chore: update model catalog from bot issues by github-actions[bot] · Pull Request #833 · braintrustdata/braintrust-proxy

github-actions · 2026-06-19T11:52:48Z

Automated daily batch of model catalog updates from bot issues.

Included issues

Closes [BOT ISSUE] Groq: add missing qwen/qwen3.6-27b model #824: [BOT ISSUE] Groq: add missing qwen/qwen3.6-27b model
Closes [BOT ISSUE] Together: add missing Qwen/Qwen3.7-Plus model #825: [BOT ISSUE] Together: add missing Qwen/Qwen3.7-Plus model
Closes [BOT ISSUE] OpenAI: update gpt-5.4-mini and gpt-5.4-nano max_input_tokens from 272K to 400K #826: [BOT ISSUE] OpenAI: update gpt-5.4-mini and gpt-5.4-nano max_input_tokens from 272K to 400K
Closes [BOT ISSUE] OpenAI: update gpt-5.3-codex max_input_tokens from 272K to 400K #827: [BOT ISSUE] OpenAI: update gpt-5.3-codex max_input_tokens from 272K to 400K
Closes [BOT ISSUE] Mistral: add deprecation markers for magistral-medium-2509, magistral-small-2509, devstral-2512 #828: [BOT ISSUE] Mistral: add deprecation markers for magistral-medium-2509, magistral-small-2509, devstral-2512
Closes [BOT ISSUE] Mistral: add deprecation markers for open-mistral-nemo and open-mistral-nemo-2407 #829: [BOT ISSUE] Mistral: add deprecation markers for open-mistral-nemo and open-mistral-nemo-2407
Closes [BOT ISSUE] OpenAI: mark chatgpt-4o-latest and codex-mini-latest as deprecated (already shut down) #830: [BOT ISSUE] OpenAI: mark chatgpt-4o-latest and codex-mini-latest as deprecated (already shut down)

Summary

Issue	Provider	Primary model	Changed models	Added models	Updated models	Verification sources
#824	groq	qwen/qwen3.6-27b	`qwen/qwen3.6-27b`	`qwen/qwen3.6-27b`	None	1 2
#825	together	Qwen/Qwen3.7-Plus	`Qwen/Qwen3.7-Plus`	`Qwen/Qwen3.7-Plus`	None	1
#826	openai	gpt-5.4-mini	`gpt-5.4-mini` `gpt-5.4-nano`	None	`gpt-5.4-mini` `gpt-5.4-nano`	1 2 3
#827	openai	gpt-5.3-codex	`gpt-5.3-codex`	None	`gpt-5.3-codex`	1 2
#828	mistral	magistral-medium-2509	`magistral-medium-2509` `magistral-small-2509` `devstral-2512`	None	`magistral-medium-2509` `magistral-small-2509` `devstral-2512`	1 2
#829	mistral	open-mistral-nemo	`open-mistral-nemo` `open-mistral-nemo-2407`	None	`open-mistral-nemo` `open-mistral-nemo-2407`	1 2
#830	openai	chatgpt-4o-latest	`chatgpt-4o-latest` `codex-mini-latest`	None	`chatgpt-4o-latest` `codex-mini-latest`	1 2

Verified metadata

#824: [BOT ISSUE] Groq: add missing qwen/qwen3.6-27b model

Model	Display name	Parent	Providers	Format	Flavor	Token limits	Pricing	Lifecycle
qwen/qwen3.6-27b	Qwen 3.6 27B		groq	openai	chat	input=131072, output=32768	in/out=0.6/3 per 1M	active

Verification notes

No LLM verification step ran; model metadata was already complete in the issue.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model	Field	Proposed update	sync_models	sync_models source models
qwen/qwen3.6-27b	catalog entry	present	missing	None

#825: [BOT ISSUE] Together: add missing Qwen/Qwen3.7-Plus model

Model	Display name	Parent	Providers	Format	Flavor	Token limits	Pricing	Lifecycle
Qwen/Qwen3.7-Plus	Qwen 3.7 Plus		together	openai	chat	input=1000000, output=not provided	in/out=0.32/1.28 per 1M	active

Verification notes

No LLM verification step ran; model metadata was already complete in the issue.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model	Field	Proposed update	sync_models	sync_models source models
Qwen/Qwen3.7-Plus	catalog entry	present	missing	None

#826: [BOT ISSUE] OpenAI: update gpt-5.4-mini and gpt-5.4-nano max_input_tokens from 272K to 400K

Model	Display name	Parent	Providers	Format	Flavor	Token limits	Pricing	Lifecycle
gpt-5.4-mini	GPT-5.4 mini		openai, azure	openai	chat	input=400000, output=128000	in/out=0.75/4.5 per 1M; cache read=0.075 per 1M	multimodal=true; reasoning=true
gpt-5.4-nano	GPT-5.4 nano		openai, azure	openai	chat	input=400000, output=128000	in/out=0.2/1.25 per 1M; cache read=0.02 per 1M	multimodal=true; reasoning=true

Verification notes

No LLM verification step ran; model metadata was already complete in the issue.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model	Field	Proposed update	sync_models	sync_models source models
gpt-5.4-mini	max_input_tokens	400000	272000	gpt-5.4-mini
gpt-5.4-nano	max_input_tokens	400000	272000	gpt-5.4-nano

#827: [BOT ISSUE] OpenAI: update gpt-5.3-codex max_input_tokens from 272K to 400K

Model	Display name	Parent	Providers	Format	Flavor	Token limits	Pricing	Lifecycle
gpt-5.3-codex	GPT-5.3 Codex		openai, azure	openai	chat	input=400000, output=128000	in/out=1.75/14 per 1M; cache read=0.175 per 1M	multimodal=true; reasoning=true

Verification notes

No LLM verification step ran; model metadata was already complete in the issue.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model	Field	Proposed update	sync_models	sync_models source models
gpt-5.3-codex	max_input_tokens	400000	272000	gpt-5.3-codex

#828: [BOT ISSUE] Mistral: add deprecation markers for magistral-medium-2509, magistral-small-2509, devstral-2512

Model	Display name	Parent	Providers	Format	Flavor	Token limits	Pricing	Lifecycle
magistral-medium-2509	Magistral Medium (2509)	magistral-medium-latest	mistral	openai	chat	input=128000, output=not provided	in/out=2/5 per 1M	parent=magistral-medium-latest; date=2026-07-31
magistral-small-2509	Magistral Small (2509)	magistral-small-latest	mistral	openai	chat	input=128000, output=not provided	in/out=0.5/1.5 per 1M	parent=magistral-small-latest; date=2026-07-31
devstral-2512	Devstral 2512		mistral	openai	chat	input=262144, output=not provided	in/out=0.4/2 per 1M	date=2026-07-31

Verification notes

Verification

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model	Field	Proposed update	sync_models	sync_models source models
magistral-medium-2509	max_input_tokens	128000	40000	mistral/magistral-medium-2509
magistral-medium-2509	max_output_tokens	n/a	40000	mistral/magistral-medium-2509
magistral-medium-2509	deprecation_date	2026-07-31	n/a	mistral/magistral-medium-2509
magistral-small-2509	catalog entry	present	missing	None
devstral-2512	max_input_tokens	262144	256000	mistral/devstral-2512
devstral-2512	max_output_tokens	n/a	256000	mistral/devstral-2512
devstral-2512	deprecation_date	2026-07-31	n/a	mistral/devstral-2512

#829: [BOT ISSUE] Mistral: add deprecation markers for open-mistral-nemo and open-mistral-nemo-2407

Model	Display name	Parent	Providers	Format	Flavor	Token limits	Pricing	Lifecycle
open-mistral-nemo	Mistral NeMo		mistral	openai	chat	input=128000, output=128000	in/out=0.15/0.15 per 1M	date=2026-07-31
open-mistral-nemo-2407			mistral	openai	chat	input=128000, output=128000	in/out=0.15/0.15 per 1M	date=2026-07-31

Verification notes

Verification

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model	Field	Proposed update	sync_models	sync_models source models
open-mistral-nemo	input_cost_per_mil_tokens	0.15	0.3	mistral/open-mistral-nemo
open-mistral-nemo	output_cost_per_mil_tokens	0.15	0.3	mistral/open-mistral-nemo
open-mistral-nemo	deprecation_date	2026-07-31	n/a	mistral/open-mistral-nemo
open-mistral-nemo-2407	input_cost_per_mil_tokens	0.15	0.3	mistral/open-mistral-nemo-2407
open-mistral-nemo-2407	output_cost_per_mil_tokens	0.15	0.3	mistral/open-mistral-nemo-2407
open-mistral-nemo-2407	deprecation_date	2026-07-31	n/a	mistral/open-mistral-nemo-2407

#830: [BOT ISSUE] OpenAI: mark chatgpt-4o-latest and codex-mini-latest as deprecated (already shut down)

Model	Display name	Parent	Providers	Format	Flavor	Token limits	Pricing	Lifecycle
chatgpt-4o-latest	ChatGPT-4o		openai, azure	openai	chat	input=128000, output=4096	in/out=5/15 per 1M	deprecated=true; multimodal=true
codex-mini-latest			openai, azure	openai	chat	input=200000, output=100000	in/out=1.5/6 per 1M; cache read=0.375 per 1M	deprecated=true; multimodal=true; reasoning=true

Verification notes

Verification

sync_models vs proposed update

sync_models cross-check found no pricing/token discrepancies for the changed models.

No sync_models discrepancies were noted.

vercel · 2026-06-19T11:52:52Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
ai-proxy	Ready	Preview, Comment	Jun 22, 2026 3:49pm

github-actions · 2026-06-19T11:52:58Z

Codex (@codex) review

chatgpt-codex-connector · 2026-06-19T11:59:51Z

Codex Review: Didn't find any major issues. You're on a roll.

Reviewed commit: 3b20425104

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "Codex (@codex) review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "Codex (@codex) address that feedback".

github-actions · 2026-06-19T12:00:31Z

Codex review found no issues ("Didn't find any major issues"). No changes were made to repository files.

The bot batch raised max_input_tokens 272000->400000 for the gpt-5.4-mini, gpt-5.4-nano and gpt-5.3-codex aliases. 400000 is the GPT-5 family's TOTAL context window (272000 input + 128000 output); 272000 is the input-token cap (OpenAI returns "Input tokens exceed the configured limit of 272,000 tokens"). max_input_tokens must be the input cap. The dated snapshots these aliases resolve to (gpt-5.4-mini-2026-03-17, gpt-5.4-nano-2026-03-17) and every other mini/nano/codex entry in the family already use 272000; reverting restores consistency. Same regression class as #780/#776. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

…rmat to converse Both model-sync workflows have a 'Respond to Codex review with Claude Code' step that auto-applies Codex's suggestions to the catalog. When the index.ts Bedrock mappings for openai.gpt-5.5 / openai.gpt-5.4 / xai.grok-4.3 are present, Codex posts a P1 ('openai format on a Bedrock model fails') because the TS proxy's fetchOpenAI rejects Bedrock secrets, and the auto-apply step 'fixes' it by flipping format openai->converse. That is wrong: these are Bedrock Mantle models served only via the OpenAI-compatible bedrock-mantle endpoint, which does not support the Converse API or InvokeModel, so converse breaks invocation at AWS. This recurred across #834/#836/#840/#843 (all reverting the validated #822 openai fix). The LiteLLM field sync never touches format (no checkAndUpdateFormat), so this guardrail on the codex-response prompt is the correct lever. Adds a 'Bedrock Mantle exception' instruction to the codex-response prompt in both sync-models.yaml and fix-missing-model-bot-issues.yaml: keep format=openai for bedrock-only openai-format models and explain in the summary that this is an intentional Mantle case to be handled proxy-side, not a catalog change. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

The bot batch wrote the new/updated endpoint entries (chatgpt-4o-latest, gpt-5.3-codex, gpt-5.4-mini, gpt-5.4-nano, codex-mini-latest) as ["openai","azure"] without the space after the comma that the .prettierrc (trailingComma: all) style requires. Run prettier to restore ["openai", "azure"], fixing the lint job. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

chore: update model catalog from bot issues

3b20425

github-actions Bot added the auto-sync label Jun 19, 2026

github-actions Bot requested review from Alex Z (CLowbrow), aswink, Caitlin Pinn (cpinn), Erin McNulty (erin2722) and Ken Jiang (knjiang) June 19, 2026 11:52

vercel Bot deployed to Preview June 19, 2026 11:53 View deployment

Erin McNulty (erin2722) and others added 2 commits June 22, 2026 14:25

Erin McNulty (erin2722) approved these changes Jun 22, 2026

View reviewed changes

vercel Bot deployed to Preview June 22, 2026 14:40 View deployment

vercel Bot deployed to Preview June 22, 2026 15:49 View deployment

Erin McNulty (erin2722) merged commit d227cb9 into main Jun 22, 2026
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: update model catalog from bot issues#833

chore: update model catalog from bot issues#833
Erin McNulty (erin2722) merged 4 commits into
mainfrom
chore/autofix-bot-issues-2026-06-19

github-actions Bot commented Jun 19, 2026

Uh oh!

vercel Bot commented Jun 19, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 19, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 19, 2026

Uh oh!

github-actions Bot commented Jun 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

github-actions Bot commented Jun 19, 2026

Included issues

Summary

Verified metadata

#824: [BOT ISSUE] Groq: add missing qwen/qwen3.6-27b model

Verification notes

sync_models vs proposed update

#825: [BOT ISSUE] Together: add missing Qwen/Qwen3.7-Plus model

Verification notes

sync_models vs proposed update

#826: [BOT ISSUE] OpenAI: update gpt-5.4-mini and gpt-5.4-nano max_input_tokens from 272K to 400K

Verification notes

sync_models vs proposed update

#827: [BOT ISSUE] OpenAI: update gpt-5.3-codex max_input_tokens from 272K to 400K

Verification notes

sync_models vs proposed update

#828: [BOT ISSUE] Mistral: add deprecation markers for magistral-medium-2509, magistral-small-2509, devstral-2512

Verification notes

Verification

sync_models vs proposed update

#829: [BOT ISSUE] Mistral: add deprecation markers for open-mistral-nemo and open-mistral-nemo-2407

Verification notes

Verification

sync_models vs proposed update

#830: [BOT ISSUE] OpenAI: mark chatgpt-4o-latest and codex-mini-latest as deprecated (already shut down)

Verification notes

Verification

sync_models vs proposed update

Uh oh!

vercel Bot commented Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 19, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 19, 2026

Uh oh!

github-actions Bot commented Jun 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented Jun 19, 2026 •

edited

Loading