feat(traces): superadmin source-code context for the evaluator by albanm · Pull Request #35 · data-fair/agents

albanm · 2026-06-19T15:59:37Z

Give superadmins a restricted GitHub source-exploration tool in the trace evaluator, and fix the readArchitectureDoc tool it complements.

What changed:

Fix readArchitectureDoc: its enum-constrained topic contradicted a description telling the model to "pass an unknown topic" to list topics, so the AI SDK rejected the call before execute and the model looped on validation errors. Valid topics are now listed inline and the contradictory instruction removed.
Add a superadmin-only, read-only proxy GET /api/admin/github, scoped to data-fair/{agents,data-fair,portals} and json-layout/json-layout, with an optional GITHUB_TOKEN (unauthenticated fallback + a startup hint when unset).
Add an explore_github evaluator tool calling that proxy, gated on adminMode. Its description points at each repo's starting paths (incl. json-layout core/src/webmcp/tools/, the MCP form tools behind the *_form sub-agents). Account admins keep the docs-only evaluator; readArchitectureDoc is retained for everyone as the cheap first hop.

Why: the evaluator's curated architecture docs are lossy; superadmins need source as ground truth (actual prompts, tool schemas, the assistant's tools in data-fair/portals, and the json-layout form-MCP tools) to make precise recommendations.

Heads-up: the UI adminMode gate must stay in lockstep with the server's reqAdminMode — that parity is the access model. api/config/type/.type/ is generated by build-types; regenerate it whenever the config schema changes.

…e enum The tool's topic param is constrained to an enum of real doc names, but the description and system prompt told the model to "pass an unknown topic to list available topics". The AI SDK rejects out-of-enum args before execute() runs, so the model never reached the listing fallback — it just got repeated validation errors and gave up. List the valid topics inline (in the description and prompt) so the model picks a valid enum value on the first call. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…nfig Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…eTools Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Whitelist json-layout/json-layout in the github proxy and point explore_github at core/src/webmcp/tools/ — the MCP form tools backing the *_form form sub-agents (pageConfig_form, portalConfig_form). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

albanm and others added 7 commits June 19, 2026 16:38

feat(admin): pure helpers for github source proxy

5a3eac6

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(admin): superadmin github source proxy route + optional token co…

8a80808

…nfig Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(traces): explore_github source tool calling the admin proxy

1bbc16c

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(traces): gate source tools + prompt addendum behind includeSourc…

fa90eb9

…eTools Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(traces): offer source tools to superadmins in trace review

fd6b5dc

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

github-actions Bot added the feature label Jun 19, 2026

albanm merged commit 3632896 into main Jun 19, 2026
3 checks passed

albanm deleted the feat-eval-context branch June 19, 2026 15:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(traces): superadmin source-code context for the evaluator#35

feat(traces): superadmin source-code context for the evaluator#35
albanm merged 7 commits into
mainfrom
feat-eval-context

albanm commented Jun 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

albanm commented Jun 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant