[release] v0.103.5 by github-actions[bot] · Pull Request #4690 · Agenta-AI/agenta

github-actions · 2026-06-13T11:24:00Z

New version v0.103.5 in

web
- web/oss
- web/ee
services
api
sdks
- sdks/python
clients
- clients/python
- clients/typescript
kubernetes
- kubernetes/helm

Signed-off-by: axelray-dev <110029405+axelray-dev@users.noreply.github.com>

…535] Replace programmatic router.push with native href on the evaluator button so that clicking always navigates even if the trace drawer close handler does not complete. Removes the unused navigateToEvaluator call and preventDefault, keeping only stopPropagation to avoid triggering the parent popover's hover behavior.

One project is in scope at a time in the web app, so grouping batched requests by project and issuing one query per project handles a state that cannot exist. Every batchFn now takes the single project in scope, throws if coalesced requests disagree, and resolves all ids with one call. Documents the invariant in web/AGENTS.md.

…child selection features

…rawer config

…ncluding suffix node support and improved evaluator name resolution.

…dling and metadata display

The observability trace filter never listed annotation feedback fields (score, comment, etc.) from evaluators, so feedback sent via the API was not filterable. Two causes, both fixed on the frontend: - The filter read evaluator.metrics off thin list refs that carry no data; it now resolves each evaluator's latest revision via a new evaluatorFeedbackSchemasAtom. - Auto-created feedback evaluators store a genson-inferred output schema wrapped one level deeper ({outputs:{properties}}); resolveOutputSchema- Properties now unwraps that envelope so real metric keys surface. Also corrects docs that claimed evaluators are not auto-created.

… UI components

… and improved UI interactions

…nd improve parent checkbox state handling in PopoverCascaderVariant

A walkthrough demo for classifying CVs against a job spec with Agenta: - Curated test set of 30 real Markdown CVs (from the public opensporks/resumes dataset on Hugging Face, a mirror of the Kaggle Resume Dataset), hand-labeled against an IT Manager job spec - prepare_testset.py rebuilds the CSV reproducibly and can upload it to Agenta via the SDK - create_app.py creates the completion app with the screening prompt and structured-output JSON schema, and deploys it to production - Streamlit demo UI: PDF upload -> Markdown (markitdown) -> prompt fetched from the Agenta registry -> structured score dashboard - Sample CV PDFs (one per classification) generated from the test set https://claude.ai/code/session_01YMbf4sUb2VBFQHGNKv6yh3

The Streamlit app now shows a thumbs up/down form with an optional comment after each screening. Submitting it attaches the feedback to the screening's trace in Agenta as an annotation (evaluator slug 'user-feedback'), following the capture-user-feedback cookbook: the invocation link is captured inside the instrumented classify_cv call and the annotation is POSTed to /api/simple/traces/. Screening results now persist in session state so the result and feedback form survive Streamlit reruns. Entry scripts load .env via python-dotenv, matching the documented setup flow. https://claude.ai/code/session_01YMbf4sUb2VBFQHGNKv6yh3

…pt revision Move all the AI logic out of the Streamlit app into a new screening.py module (prompt fetch, the LLM call, tracing, feedback), leaving app.py as a UI-only shell. Any other frontend can import screening.py unchanged. Tracing improvements so screenings are easy to act on from the UI: - Auto-instrument the OpenAI client with OpenInference, so every trace has a child LLM span with the exact messages, token counts, and cost. - classify_cv takes its inputs as a dict whose keys match the prompt input variables ({"cv": ...}), and the prompt config is kept out of the trace (ignore_inputs). The span data then mirrors the completion app's inputs. - Link each span to the deployed prompt revision via ag.tracing.store_refs, so traces filter by app/environment and open in the playground on the right revision with inputs pre-filled. Also fix create_app.py to read variant.variant_version as an attribute (VariantManager now returns a ConfigurationResponse, not a dict).

The walkthrough needed a leaner story: the output schema is now tech_match / experience_match / overall_match, each with a short reason, plus the missing-requirements list. overall_match is a holistic hire-or-not judgment, so a requirement like a language can flip it while the other two stay true. The test set drops the bookkeeping columns and carries one expected_* column per dimension; empty cells are skipped by the code evaluator documented in the Readme.

…ty filter Evaluators without an output schema expose no feedback metrics to suggest, and the feedback-field Select cleared any typed value. The Select now surfaces the typed text as a '<typed> (custom)' option that commits and persists, so users can filter by a feedback name even when the schema can't provide one.

…t integration

… evaluations page contract

vercel · 2026-06-13T11:24:05Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
agenta-documentation	Ready	Preview, Comment	Jun 15, 2026 4:53pm

…elector-ui [Feat]: improve cascade entity selector UI

… query and improve cache handling

…avigation [4535] fix(frontend): fix evaluator playground navigation from trace drawer

…-fetchers refactor(frontend): drop per-project fan-out from all batch fetchers

feat(examples): CV screening demo with feedback-to-deploy walkthrough

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

[fix] Resolve broken invites in OSS (again)

axelray-dev and others added 26 commits June 4, 2026 03:02

fix(frontend): split evaluator navigation by type [4535]

4de2b82

Signed-off-by: axelray-dev <110029405+axelray-dev@users.noreply.github.com>

fix(frontend): show correct breadcrumb label for SDK evals

15d65ec

Merge branch 'main' into fix/sdk-eval-breadcrumb-label

1d5adbd

Enhance Cascade Entity Selector UI with workflow metadata and parent-…

d1941ef

…child selection features

implemented create evaluator feature and added an isolated workflow d…

e8b04f7

…rawer config

Enhance evaluator selection and display features across components, i…

a77db37

…ncluding suffix node support and improved evaluator name resolution.

Refactor evaluator and workflow components for improved selection han…

1d1d109

…dling and metadata display

fix: evaluator workflow metadata handling and auto-selection logic in…

ca5c677

… UI components

Merge branch 'main' into fix/observability-feedback-filter

39993fc

feat: enhance PopoverCascaderVariant with default child panel opening…

056d3ce

… and improved UI interactions

feat: enhance AutoSelectHandler to support multiple selection modes a…

238fd8e

…nd improve parent checkbox state handling in PopoverCascaderVariant

Merge branch 'main' into feat/improve-cascade-entity-selector-ui

6b42df4

Fix make_sample_pdfs for the new test set columns

68ffcbd

docs(agents): add entity display name rules for workflows

74189e3

feat: streamline AutoSelectHandler and simplify PopoverCascaderVarian…

047b5f8

…t integration

Merge branch 'main' into fix/sdk-eval-breadcrumb-label

0b197b1

fix(frontend): use kind=custom for SDK evals breadcrumb link to match…

3c2a226

… evaluations page contract

v0.103.5

1e1c638

dosubot Bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Jun 13, 2026

vercel Bot deployed to Preview June 13, 2026 11:25 View deployment

Merge branch 'main' into fix/sdk-eval-breadcrumb-label

1dba9e4

Merge pull request #4630 from Agenta-AI/feat/improve-cascade-entity-s…

3254cbb

…elector-ui [Feat]: improve cascade entity selector UI

vercel Bot deployed to Preview June 15, 2026 13:47 View deployment

bekossy added 2 commits June 15, 2026 15:53

Refactor workflow latest revision atom family to prioritize dedicated…

21c9aa8

… query and improve cache handling

Merge pull request #4560 from axelray-dev/fix-4535-evaluator-button-n…

282fe4c

…avigation [4535] fix(frontend): fix evaluator playground navigation from trace drawer

vercel Bot deployed to Preview June 15, 2026 14:02 View deployment

Merge pull request #4637 from Agenta-AI/refactor/single-project-batch…

6d24636

…-fetchers refactor(frontend): drop per-project fan-out from all batch fetchers

vercel Bot deployed to Preview June 15, 2026 14:25 View deployment

jp-agenta and others added 3 commits June 15, 2026 16:27

revert crash from v0.100.4

d276fa1

clean up dead coded in api

9d8d3f7

Merge pull request #4607 from Agenta-AI/claude/cv-classifier-demo-oug3jb

1d42627

feat(examples): CV screening demo with feedback-to-deploy walkthrough

dosubot Bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Jun 15, 2026

vercel Bot deployed to Preview June 15, 2026 14:33 View deployment

jp-agenta and others added 5 commits June 15, 2026 16:42

fixes the issue in ee

da524fa

copy fix

7c0cfb6

Add INVITE_EMAIL_MISMATCH and clean up copy

93b2039

quick error fix

7890f85

web prettier fix

1ec4b0b

vercel Bot deployed to Preview June 15, 2026 15:10 View deployment

Merge branch 'main' into release/v0.103.5

325787b

vercel Bot deployed to Preview June 15, 2026 15:12 View deployment

jp-agenta and others added 5 commits June 15, 2026 17:12

fix NOT YOUR INVITE

8767cd2

Merge branch 'release/v0.103.5' into fix/broken-invites-in-oss-again

e0f731a

Potential fix for pull request finding

7c96095

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

fix(api): ruff format organization_service.py

c042b1c

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Merge pull request #4704 from Agenta-AI/fix/broken-invites-in-oss-again

c558de5

[fix] Resolve broken invites in OSS (again)

vercel Bot deployed to Preview June 15, 2026 16:53 View deployment

bekossy approved these changes Jun 15, 2026

View reviewed changes

dosubot Bot added the lgtm This PR has been approved by a maintainer label Jun 15, 2026

bekossy merged commit 58a5cca into main Jun 15, 2026
31 of 32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[release] v0.103.5#4690

[release] v0.103.5#4690
bekossy merged 77 commits into
mainfrom
release/v0.103.5

github-actions Bot commented Jun 13, 2026

Uh oh!

vercel Bot commented Jun 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Conversation

github-actions Bot commented Jun 13, 2026

Uh oh!

vercel Bot commented Jun 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

vercel Bot commented Jun 13, 2026 •

edited

Loading