Skip to content

chore: update model catalog from bot issues#833

Merged
Erin McNulty (erin2722) merged 4 commits into
mainfrom
chore/autofix-bot-issues-2026-06-19
Jun 22, 2026
Merged

chore: update model catalog from bot issues#833
Erin McNulty (erin2722) merged 4 commits into
mainfrom
chore/autofix-bot-issues-2026-06-19

Conversation

@github-actions

Copy link
Copy Markdown
Contributor

Automated daily batch of model catalog updates from bot issues.

Included issues

Summary

Issue Provider Primary model Changed models Added models Updated models Verification sources
#824 groq qwen/qwen3.6-27b qwen/qwen3.6-27b qwen/qwen3.6-27b None 1
2
#825 together Qwen/Qwen3.7-Plus Qwen/Qwen3.7-Plus Qwen/Qwen3.7-Plus None 1
#826 openai gpt-5.4-mini gpt-5.4-mini
gpt-5.4-nano
None gpt-5.4-mini
gpt-5.4-nano
1
2
3
#827 openai gpt-5.3-codex gpt-5.3-codex None gpt-5.3-codex 1
2
#828 mistral magistral-medium-2509 magistral-medium-2509
magistral-small-2509
devstral-2512
None magistral-medium-2509
magistral-small-2509
devstral-2512
1
2
#829 mistral open-mistral-nemo open-mistral-nemo
open-mistral-nemo-2407
None open-mistral-nemo
open-mistral-nemo-2407
1
2
#830 openai chatgpt-4o-latest chatgpt-4o-latest
codex-mini-latest
None chatgpt-4o-latest
codex-mini-latest
1
2

Verified metadata

#824: [BOT ISSUE] Groq: add missing qwen/qwen3.6-27b model

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
qwen/qwen3.6-27b Qwen 3.6 27B groq openai chat input=131072, output=32768 in/out=0.6/3 per 1M active

Verification notes

No LLM verification step ran; model metadata was already complete in the issue.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model Field Proposed update sync_models sync_models source models
qwen/qwen3.6-27b catalog entry present missing None

#825: [BOT ISSUE] Together: add missing Qwen/Qwen3.7-Plus model

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
Qwen/Qwen3.7-Plus Qwen 3.7 Plus together openai chat input=1000000, output=not provided in/out=0.32/1.28 per 1M active

Verification notes

No LLM verification step ran; model metadata was already complete in the issue.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model Field Proposed update sync_models sync_models source models
Qwen/Qwen3.7-Plus catalog entry present missing None

#826: [BOT ISSUE] OpenAI: update gpt-5.4-mini and gpt-5.4-nano max_input_tokens from 272K to 400K

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
gpt-5.4-mini GPT-5.4 mini openai, azure openai chat input=400000, output=128000 in/out=0.75/4.5 per 1M; cache read=0.075 per 1M multimodal=true; reasoning=true
gpt-5.4-nano GPT-5.4 nano openai, azure openai chat input=400000, output=128000 in/out=0.2/1.25 per 1M; cache read=0.02 per 1M multimodal=true; reasoning=true

Verification notes

No LLM verification step ran; model metadata was already complete in the issue.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model Field Proposed update sync_models sync_models source models
gpt-5.4-mini max_input_tokens 400000 272000 gpt-5.4-mini
gpt-5.4-nano max_input_tokens 400000 272000 gpt-5.4-nano

#827: [BOT ISSUE] OpenAI: update gpt-5.3-codex max_input_tokens from 272K to 400K

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
gpt-5.3-codex GPT-5.3 Codex openai, azure openai chat input=400000, output=128000 in/out=1.75/14 per 1M; cache read=0.175 per 1M multimodal=true; reasoning=true

Verification notes

No LLM verification step ran; model metadata was already complete in the issue.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model Field Proposed update sync_models sync_models source models
gpt-5.3-codex max_input_tokens 400000 272000 gpt-5.3-codex

#828: [BOT ISSUE] Mistral: add deprecation markers for magistral-medium-2509, magistral-small-2509, devstral-2512

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
magistral-medium-2509 Magistral Medium (2509) magistral-medium-latest mistral openai chat input=128000, output=not provided in/out=2/5 per 1M parent=magistral-medium-latest; date=2026-07-31
magistral-small-2509 Magistral Small (2509) magistral-small-latest mistral openai chat input=128000, output=not provided in/out=0.5/1.5 per 1M parent=magistral-small-latest; date=2026-07-31
devstral-2512 Devstral 2512 mistral openai chat input=262144, output=not provided in/out=0.4/2 per 1M date=2026-07-31

Verification notes

Verification

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model Field Proposed update sync_models sync_models source models
magistral-medium-2509 max_input_tokens 128000 40000 mistral/magistral-medium-2509
magistral-medium-2509 max_output_tokens n/a 40000 mistral/magistral-medium-2509
magistral-medium-2509 deprecation_date 2026-07-31 n/a mistral/magistral-medium-2509
magistral-small-2509 catalog entry present missing None
devstral-2512 max_input_tokens 262144 256000 mistral/devstral-2512
devstral-2512 max_output_tokens n/a 256000 mistral/devstral-2512
devstral-2512 deprecation_date 2026-07-31 n/a mistral/devstral-2512

#829: [BOT ISSUE] Mistral: add deprecation markers for open-mistral-nemo and open-mistral-nemo-2407

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
open-mistral-nemo Mistral NeMo mistral openai chat input=128000, output=128000 in/out=0.15/0.15 per 1M date=2026-07-31
open-mistral-nemo-2407 mistral openai chat input=128000, output=128000 in/out=0.15/0.15 per 1M date=2026-07-31

Verification notes

Verification

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model Field Proposed update sync_models sync_models source models
open-mistral-nemo input_cost_per_mil_tokens 0.15 0.3 mistral/open-mistral-nemo
open-mistral-nemo output_cost_per_mil_tokens 0.15 0.3 mistral/open-mistral-nemo
open-mistral-nemo deprecation_date 2026-07-31 n/a mistral/open-mistral-nemo
open-mistral-nemo-2407 input_cost_per_mil_tokens 0.15 0.3 mistral/open-mistral-nemo-2407
open-mistral-nemo-2407 output_cost_per_mil_tokens 0.15 0.3 mistral/open-mistral-nemo-2407
open-mistral-nemo-2407 deprecation_date 2026-07-31 n/a mistral/open-mistral-nemo-2407

#830: [BOT ISSUE] OpenAI: mark chatgpt-4o-latest and codex-mini-latest as deprecated (already shut down)

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
chatgpt-4o-latest ChatGPT-4o openai, azure openai chat input=128000, output=4096 in/out=5/15 per 1M deprecated=true; multimodal=true
codex-mini-latest openai, azure openai chat input=200000, output=100000 in/out=1.5/6 per 1M; cache read=0.375 per 1M deprecated=true; multimodal=true; reasoning=true

Verification notes

Verification

sync_models vs proposed update

sync_models cross-check found no pricing/token discrepancies for the changed models.

No sync_models discrepancies were noted.

@vercel

vercel Bot commented Jun 19, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
ai-proxy Ready Ready Preview, Comment Jun 22, 2026 3:49pm

Request Review

@github-actions

Copy link
Copy Markdown
Contributor Author

Codex (@codex) review

@chatgpt-codex-connector

Copy link
Copy Markdown

Codex Review: Didn't find any major issues. You're on a roll.

Reviewed commit: 3b20425104

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "Codex (@codex) review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "Codex (@codex) address that feedback".

@github-actions

Copy link
Copy Markdown
Contributor Author

Codex review found no issues ("Didn't find any major issues"). No changes were made to repository files.

Erin McNulty (erin2722) and others added 2 commits June 22, 2026 14:25
The bot batch raised max_input_tokens 272000->400000 for the gpt-5.4-mini,
gpt-5.4-nano and gpt-5.3-codex aliases. 400000 is the GPT-5 family's TOTAL
context window (272000 input + 128000 output); 272000 is the input-token cap
(OpenAI returns "Input tokens exceed the configured limit of 272,000 tokens").
max_input_tokens must be the input cap. The dated snapshots these aliases
resolve to (gpt-5.4-mini-2026-03-17, gpt-5.4-nano-2026-03-17) and every other
mini/nano/codex entry in the family already use 272000; reverting restores
consistency. Same regression class as #780/#776.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
…rmat to converse

Both model-sync workflows have a 'Respond to Codex review with Claude Code'
step that auto-applies Codex's suggestions to the catalog. When the index.ts
Bedrock mappings for openai.gpt-5.5 / openai.gpt-5.4 / xai.grok-4.3 are present,
Codex posts a P1 ('openai format on a Bedrock model fails') because the TS
proxy's fetchOpenAI rejects Bedrock secrets, and the auto-apply step 'fixes' it
by flipping format openai->converse. That is wrong: these are Bedrock Mantle
models served only via the OpenAI-compatible bedrock-mantle endpoint, which does
not support the Converse API or InvokeModel, so converse breaks invocation at
AWS. This recurred across #834/#836/#840/#843 (all reverting the validated #822
openai fix). The LiteLLM field sync never touches format (no checkAndUpdateFormat),
so this guardrail on the codex-response prompt is the correct lever.

Adds a 'Bedrock Mantle exception' instruction to the codex-response prompt in
both sync-models.yaml and fix-missing-model-bot-issues.yaml: keep format=openai
for bedrock-only openai-format models and explain in the summary that this is an
intentional Mantle case to be handled proxy-side, not a catalog change.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
The bot batch wrote the new/updated endpoint entries (chatgpt-4o-latest,
gpt-5.3-codex, gpt-5.4-mini, gpt-5.4-nano, codex-mini-latest) as
["openai","azure"] without the space after the comma that the .prettierrc
(trailingComma: all) style requires. Run prettier to restore ["openai", "azure"],
fixing the lint job.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
@erin2722 Erin McNulty (erin2722) merged commit d227cb9 into main Jun 22, 2026
7 of 8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment