Commit
·
4927db5
1
Parent(s):
75bc69f
fix: align all docs/tests to gpt-5.1 (actual current model)
Browse files- AGENTS.md +2 -3
- CLAUDE.md +2 -3
- GEMINI.md +2 -3
- docs/bugs/INVESTIGATION_INVALID_MODELS.md +13 -12
- tests/unit/agent_factory/test_judges_factory.py +3 -3
AGENTS.md
CHANGED
|
@@ -93,9 +93,8 @@ DeepBonerError (base)
|
|
| 93 |
|
| 94 |
Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
|
| 95 |
|
| 96 |
-
- **OpenAI:** `gpt-5`
|
| 97 |
-
-
|
| 98 |
-
- While `gpt-5.1` (released November 2025) exists, it is currently gated, and attempts to use it resulted in a `403 model_not_found` error for typical API keys. Advanced users with access to `gpt-5.1-instant`, `gpt-5.1-thinking`, or `gpt-5.1-codex-max` may configure their `.env` accordingly.
|
| 99 |
- **Anthropic:** `claude-sonnet-4-5-20250929`
|
| 100 |
- This is the mid-range Claude 4.5 model, released on September 29, 2025.
|
| 101 |
- The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
|
|
|
|
| 93 |
|
| 94 |
Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
|
| 95 |
|
| 96 |
+
- **OpenAI:** `gpt-5.1`
|
| 97 |
+
- Current flagship model (November 2025). Requires Tier 5 access.
|
|
|
|
| 98 |
- **Anthropic:** `claude-sonnet-4-5-20250929`
|
| 99 |
- This is the mid-range Claude 4.5 model, released on September 29, 2025.
|
| 100 |
- The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
|
CLAUDE.md
CHANGED
|
@@ -100,9 +100,8 @@ DeepBonerError (base)
|
|
| 100 |
|
| 101 |
Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
|
| 102 |
|
| 103 |
-
- **OpenAI:** `gpt-5`
|
| 104 |
-
-
|
| 105 |
-
- While `gpt-5.1` (released November 2025) exists, it is currently gated, and attempts to use it resulted in a `403 model_not_found` error for typical API keys. Advanced users with access to `gpt-5.1-instant`, `gpt-5.1-thinking`, or `gpt-5.1-codex-max` may configure their `.env` accordingly.
|
| 106 |
- **Anthropic:** `claude-sonnet-4-5-20250929`
|
| 107 |
- This is the mid-range Claude 4.5 model, released on September 29, 2025.
|
| 108 |
- The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
|
|
|
|
| 100 |
|
| 101 |
Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
|
| 102 |
|
| 103 |
+
- **OpenAI:** `gpt-5.1`
|
| 104 |
+
- Current flagship model (November 2025). Requires Tier 5 access.
|
|
|
|
| 105 |
- **Anthropic:** `claude-sonnet-4-5-20250929`
|
| 106 |
- This is the mid-range Claude 4.5 model, released on September 29, 2025.
|
| 107 |
- The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
|
GEMINI.md
CHANGED
|
@@ -74,9 +74,8 @@ Settings via pydantic-settings from `.env`:
|
|
| 74 |
|
| 75 |
Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
|
| 76 |
|
| 77 |
-
- **OpenAI:** `gpt-5`
|
| 78 |
-
-
|
| 79 |
-
- While `gpt-5.1` (released November 2025) exists, it is currently gated, and attempts to use it resulted in a `403 model_not_found` error for typical API keys. Advanced users with access to `gpt-5.1-instant`, `gpt-5.1-thinking`, or `gpt-5.1-codex-max` may configure their `.env` accordingly.
|
| 80 |
- **Anthropic:** `claude-sonnet-4-5-20250929`
|
| 81 |
- This is the mid-range Claude 4.5 model, released on September 29, 2025.
|
| 82 |
- The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
|
|
|
|
| 74 |
|
| 75 |
Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
|
| 76 |
|
| 77 |
+
- **OpenAI:** `gpt-5.1`
|
| 78 |
+
- Current flagship model (November 2025). Requires Tier 5 access.
|
|
|
|
| 79 |
- **Anthropic:** `claude-sonnet-4-5-20250929`
|
| 80 |
- This is the mid-range Claude 4.5 model, released on September 29, 2025.
|
| 81 |
- The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
|
docs/bugs/INVESTIGATION_INVALID_MODELS.md
CHANGED
|
@@ -9,22 +9,23 @@
|
|
| 9 |
|
| 10 |
## Issue Description
|
| 11 |
The user encountered a 403 error when running in Magentic mode:
|
| 12 |
-
`Error code: 403 - {'error': {'message': 'Project ... does not have access to model gpt-5
|
| 13 |
-
|
| 14 |
-
This indicates the application is trying to use `gpt-5.1`, which the user's API key did not have access to (likely a beta/gated model).
|
| 15 |
|
| 16 |
## Root Cause Analysis
|
| 17 |
-
|
| 18 |
-
|
| 19 |
-
|
| 20 |
-
- `
|
| 21 |
-
- `gpt-
|
| 22 |
-
- `
|
|
|
|
|
|
|
| 23 |
|
| 24 |
## Solution Implemented
|
| 25 |
Updated `src/utils/config.py` to use:
|
| 26 |
-
- `
|
| 27 |
-
- `
|
| 28 |
|
| 29 |
## Verification
|
| 30 |
-
- `tests/unit/agent_factory/test_judges_factory.py` updated and passed.
|
|
|
|
|
|
| 9 |
|
| 10 |
## Issue Description
|
| 11 |
The user encountered a 403 error when running in Magentic mode:
|
| 12 |
+
`Error code: 403 - {'error': {'message': 'Project ... does not have access to model gpt-5', ... 'code': 'model_not_found'}}`
|
|
|
|
|
|
|
| 13 |
|
| 14 |
## Root Cause Analysis
|
| 15 |
+
OpenAI deprecated the base `gpt-5` model. Tier 5 accounts now have access to:
|
| 16 |
+
- `gpt-5.1` (current flagship)
|
| 17 |
+
- `gpt-5-mini`
|
| 18 |
+
- `gpt-5-nano`
|
| 19 |
+
- `gpt-4.1`, `gpt-4.1-mini`, `gpt-4.1-nano`
|
| 20 |
+
- `o3`, `o4-mini`
|
| 21 |
+
|
| 22 |
+
The base `gpt-5` is NO LONGER available via API.
|
| 23 |
|
| 24 |
## Solution Implemented
|
| 25 |
Updated `src/utils/config.py` to use:
|
| 26 |
+
- `openai_model`: `gpt-5.1` (the actual current model)
|
| 27 |
+
- `anthropic_model`: `claude-sonnet-4-5-20250929` (unchanged)
|
| 28 |
|
| 29 |
## Verification
|
| 30 |
+
- `tests/unit/agent_factory/test_judges_factory.py` updated and passed.
|
| 31 |
+
- User confirmed Tier 5 access to `gpt-5.1` via OpenAI dashboard.
|
tests/unit/agent_factory/test_judges_factory.py
CHANGED
|
@@ -25,11 +25,11 @@ def test_get_model_openai(mock_settings):
|
|
| 25 |
"""Test that OpenAI model is returned when provider is openai."""
|
| 26 |
mock_settings.llm_provider = "openai"
|
| 27 |
mock_settings.openai_api_key = "sk-test"
|
| 28 |
-
mock_settings.openai_model = "gpt-5"
|
| 29 |
|
| 30 |
model = get_model()
|
| 31 |
assert isinstance(model, OpenAIChatModel)
|
| 32 |
-
assert model.model_name == "gpt-5"
|
| 33 |
|
| 34 |
|
| 35 |
def test_get_model_anthropic(mock_settings):
|
|
@@ -58,7 +58,7 @@ def test_get_model_default_fallback(mock_settings):
|
|
| 58 |
"""Test fallback to OpenAI if provider is unknown."""
|
| 59 |
mock_settings.llm_provider = "unknown_provider"
|
| 60 |
mock_settings.openai_api_key = "sk-test"
|
| 61 |
-
mock_settings.openai_model = "gpt-5"
|
| 62 |
|
| 63 |
model = get_model()
|
| 64 |
assert isinstance(model, OpenAIChatModel)
|
|
|
|
| 25 |
"""Test that OpenAI model is returned when provider is openai."""
|
| 26 |
mock_settings.llm_provider = "openai"
|
| 27 |
mock_settings.openai_api_key = "sk-test"
|
| 28 |
+
mock_settings.openai_model = "gpt-5.1"
|
| 29 |
|
| 30 |
model = get_model()
|
| 31 |
assert isinstance(model, OpenAIChatModel)
|
| 32 |
+
assert model.model_name == "gpt-5.1"
|
| 33 |
|
| 34 |
|
| 35 |
def test_get_model_anthropic(mock_settings):
|
|
|
|
| 58 |
"""Test fallback to OpenAI if provider is unknown."""
|
| 59 |
mock_settings.llm_provider = "unknown_provider"
|
| 60 |
mock_settings.openai_api_key = "sk-test"
|
| 61 |
+
mock_settings.openai_model = "gpt-5.1"
|
| 62 |
|
| 63 |
model = get_model()
|
| 64 |
assert isinstance(model, OpenAIChatModel)
|