feat(traces): give the trace evaluator data-fair data-exploration tools by albanm · Pull Request #37 · data-fair/agents

albanm · 2026-06-19T16:25:00Z

Adds read-only data-exploration tools to the admin trace evaluator so it can check a reviewed session against the actual data instead of judging from the trace text alone.

New evaluator-data-tools.ts builds list_datasets, describe_dataset, get_dataset_schema, search_data, aggregate_data, calculate_metric, get_field_values (reused from @data-fair/agent-tools-data-fair) plus an evaluator-specific get_dataset_metadata_raw for the full untrimmed metadata.
Tools call the same-origin data-fair API with the reviewer's session cookie, scoped to the conversation's owner account (owner=type:id[:department]), so a superadmin reviewing account X explores X's data. They register always and degrade gracefully when data-fair is unreachable.
Merged flat into EvaluatorChat's tools; department threaded through TraceReview; evaluator prompt updated.

Why: so the evaluator, when opened in a data-fair context, has data-exploration tools like the normal agent and can extend its evaluation to the real data (e.g. missing dataset descriptions, schema quality, whether a search would return rows).

Heads-up: superadmin cross-account exploration relies on an active adminMode session (no data-fair change); publicationSite scoping is deferred (not captured in traces). Adds the @data-fair/agent-tools-data-fair UI dependency.

Read-only data-exploration tools for the trace evaluator, reusing @data-fair/agent-tools-data-fair here and calling the same-origin data-fair API scoped to the conversation's owner account. Includes a raw metadata tool for metadata-quality and tool-coverage evaluation. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…field-values tools Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Also covers the search_data next-pagination branch with a test. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Merge buildEvaluatorDataTools into the evaluator localTools and thread the conversation owner's department through TraceReview -> EvaluatorChat. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…ontract Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

# Conflicts: # ui/src/components/EvaluatorChat.vue # ui/src/components/TraceReview.vue

# Conflicts: # ui/src/components/TraceReview.vue

albanm and others added 9 commits June 19, 2026 17:34

docs: implementation plan for evaluator data-exploration tools

e8a7266

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(evaluator): owner-scoped list_datasets data tool

6e70bfc

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(evaluator): add dataset describe/schema/search/aggregate/metric/…

2506d1f

…field-values tools Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(evaluator): add get_dataset_metadata_raw tool

3299182

Also covers the search_data next-pagination branch with a test. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(evaluator): expose data-exploration tools in the trace review chat

b459241

Merge buildEvaluatorDataTools into the evaluator localTools and thread the conversation owner's department through TraceReview -> EvaluatorChat. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

fix(evaluator): widen TraceReview loaded-event owner type for department

5294344

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(evaluator): document data-exploration tools in the prompt

cdb16d5

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

test(evaluator): assert search_data next uses the real absolute URL c…

1308cb5

…ontract Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

github-actions Bot added the feature label Jun 19, 2026

albanm added 2 commits June 19, 2026 18:27

Merge remote-tracking branch 'origin/main' into feat-eval-exploration

3a3073d

# Conflicts: # ui/src/components/EvaluatorChat.vue # ui/src/components/TraceReview.vue

Merge remote-tracking branch 'origin/main' into feat-eval-exploration

fc8203f

# Conflicts: # ui/src/components/TraceReview.vue

albanm merged commit 38aacda into main Jun 19, 2026
3 checks passed

albanm deleted the feat-eval-exploration branch June 19, 2026 17:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(traces): give the trace evaluator data-fair data-exploration tools#37

feat(traces): give the trace evaluator data-fair data-exploration tools#37
albanm merged 11 commits into
mainfrom
feat-eval-exploration

albanm commented Jun 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

albanm commented Jun 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant