VibecoderMcSwaggins commited on
Commit
4927db5
·
1 Parent(s): 75bc69f

fix: align all docs/tests to gpt-5.1 (actual current model)

Browse files
AGENTS.md CHANGED
@@ -93,9 +93,8 @@ DeepBonerError (base)
93
 
94
  Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
95
 
96
- - **OpenAI:** `gpt-5`
97
- - This is the stable flagship model released in August 2025.
98
- - While `gpt-5.1` (released November 2025) exists, it is currently gated, and attempts to use it resulted in a `403 model_not_found` error for typical API keys. Advanced users with access to `gpt-5.1-instant`, `gpt-5.1-thinking`, or `gpt-5.1-codex-max` may configure their `.env` accordingly.
99
  - **Anthropic:** `claude-sonnet-4-5-20250929`
100
  - This is the mid-range Claude 4.5 model, released on September 29, 2025.
101
  - The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
 
93
 
94
  Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
95
 
96
+ - **OpenAI:** `gpt-5.1`
97
+ - Current flagship model (November 2025). Requires Tier 5 access.
 
98
  - **Anthropic:** `claude-sonnet-4-5-20250929`
99
  - This is the mid-range Claude 4.5 model, released on September 29, 2025.
100
  - The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
CLAUDE.md CHANGED
@@ -100,9 +100,8 @@ DeepBonerError (base)
100
 
101
  Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
102
 
103
- - **OpenAI:** `gpt-5`
104
- - This is the stable flagship model released in August 2025.
105
- - While `gpt-5.1` (released November 2025) exists, it is currently gated, and attempts to use it resulted in a `403 model_not_found` error for typical API keys. Advanced users with access to `gpt-5.1-instant`, `gpt-5.1-thinking`, or `gpt-5.1-codex-max` may configure their `.env` accordingly.
106
  - **Anthropic:** `claude-sonnet-4-5-20250929`
107
  - This is the mid-range Claude 4.5 model, released on September 29, 2025.
108
  - The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
 
100
 
101
  Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
102
 
103
+ - **OpenAI:** `gpt-5.1`
104
+ - Current flagship model (November 2025). Requires Tier 5 access.
 
105
  - **Anthropic:** `claude-sonnet-4-5-20250929`
106
  - This is the mid-range Claude 4.5 model, released on September 29, 2025.
107
  - The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
GEMINI.md CHANGED
@@ -74,9 +74,8 @@ Settings via pydantic-settings from `.env`:
74
 
75
  Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
76
 
77
- - **OpenAI:** `gpt-5`
78
- - This is the stable flagship model released in August 2025.
79
- - While `gpt-5.1` (released November 2025) exists, it is currently gated, and attempts to use it resulted in a `403 model_not_found` error for typical API keys. Advanced users with access to `gpt-5.1-instant`, `gpt-5.1-thinking`, or `gpt-5.1-codex-max` may configure their `.env` accordingly.
80
  - **Anthropic:** `claude-sonnet-4-5-20250929`
81
  - This is the mid-range Claude 4.5 model, released on September 29, 2025.
82
  - The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
 
74
 
75
  Given the rapid advancements, as of November 29, 2025, the DeepBoner project uses the following default LLM models in its configuration (`src/utils/config.py`):
76
 
77
+ - **OpenAI:** `gpt-5.1`
78
+ - Current flagship model (November 2025). Requires Tier 5 access.
 
79
  - **Anthropic:** `claude-sonnet-4-5-20250929`
80
  - This is the mid-range Claude 4.5 model, released on September 29, 2025.
81
  - The flagship `Claude Opus 4.5` (released November 24, 2025) is also available and can be configured by advanced users for enhanced capabilities.
docs/bugs/INVESTIGATION_INVALID_MODELS.md CHANGED
@@ -9,22 +9,23 @@
9
 
10
  ## Issue Description
11
  The user encountered a 403 error when running in Magentic mode:
12
- `Error code: 403 - {'error': {'message': 'Project ... does not have access to model gpt-5.1', ... 'code': 'model_not_found'}}`
13
-
14
- This indicates the application is trying to use `gpt-5.1`, which the user's API key did not have access to (likely a beta/gated model).
15
 
16
  ## Root Cause Analysis
17
- The default config used `gpt-5.1` (beta/preview) and `claude-sonnet-4-5-20250929`.
18
- Initial remediation mistakenly downgraded these to 2024 models (`gpt-4o`).
19
- Web search confirmed that in November 2025:
20
- - `claude-sonnet-4-5-20250929` IS valid.
21
- - `gpt-5.1` exists but access is restricted (leading to 403).
22
- - `gpt-5` (August 2025) is the stable flagship.
 
 
23
 
24
  ## Solution Implemented
25
  Updated `src/utils/config.py` to use:
26
- - `anthropic_model`: `claude-sonnet-4-5-20250929` (Restored correct Nov 2025 model)
27
- - `openai_model`: `gpt-5` (Changed from 5.1 to 5 to ensure stability/access).
28
 
29
  ## Verification
30
- - `tests/unit/agent_factory/test_judges_factory.py` updated and passed.
 
 
9
 
10
  ## Issue Description
11
  The user encountered a 403 error when running in Magentic mode:
12
+ `Error code: 403 - {'error': {'message': 'Project ... does not have access to model gpt-5', ... 'code': 'model_not_found'}}`
 
 
13
 
14
  ## Root Cause Analysis
15
+ OpenAI deprecated the base `gpt-5` model. Tier 5 accounts now have access to:
16
+ - `gpt-5.1` (current flagship)
17
+ - `gpt-5-mini`
18
+ - `gpt-5-nano`
19
+ - `gpt-4.1`, `gpt-4.1-mini`, `gpt-4.1-nano`
20
+ - `o3`, `o4-mini`
21
+
22
+ The base `gpt-5` is NO LONGER available via API.
23
 
24
  ## Solution Implemented
25
  Updated `src/utils/config.py` to use:
26
+ - `openai_model`: `gpt-5.1` (the actual current model)
27
+ - `anthropic_model`: `claude-sonnet-4-5-20250929` (unchanged)
28
 
29
  ## Verification
30
+ - `tests/unit/agent_factory/test_judges_factory.py` updated and passed.
31
+ - User confirmed Tier 5 access to `gpt-5.1` via OpenAI dashboard.
tests/unit/agent_factory/test_judges_factory.py CHANGED
@@ -25,11 +25,11 @@ def test_get_model_openai(mock_settings):
25
  """Test that OpenAI model is returned when provider is openai."""
26
  mock_settings.llm_provider = "openai"
27
  mock_settings.openai_api_key = "sk-test"
28
- mock_settings.openai_model = "gpt-5"
29
 
30
  model = get_model()
31
  assert isinstance(model, OpenAIChatModel)
32
- assert model.model_name == "gpt-5"
33
 
34
 
35
  def test_get_model_anthropic(mock_settings):
@@ -58,7 +58,7 @@ def test_get_model_default_fallback(mock_settings):
58
  """Test fallback to OpenAI if provider is unknown."""
59
  mock_settings.llm_provider = "unknown_provider"
60
  mock_settings.openai_api_key = "sk-test"
61
- mock_settings.openai_model = "gpt-5"
62
 
63
  model = get_model()
64
  assert isinstance(model, OpenAIChatModel)
 
25
  """Test that OpenAI model is returned when provider is openai."""
26
  mock_settings.llm_provider = "openai"
27
  mock_settings.openai_api_key = "sk-test"
28
+ mock_settings.openai_model = "gpt-5.1"
29
 
30
  model = get_model()
31
  assert isinstance(model, OpenAIChatModel)
32
+ assert model.model_name == "gpt-5.1"
33
 
34
 
35
  def test_get_model_anthropic(mock_settings):
 
58
  """Test fallback to OpenAI if provider is unknown."""
59
  mock_settings.llm_provider = "unknown_provider"
60
  mock_settings.openai_api_key = "sk-test"
61
+ mock_settings.openai_model = "gpt-5.1"
62
 
63
  model = get_model()
64
  assert isinstance(model, OpenAIChatModel)