Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: preserve first line in mbpp code extraction
#3710 opened Apr 15, 2026 by yu2001-s Loading…
Add diagnostic columns for answer-not-found and invalid-filter tracking
#3709 opened Apr 15, 2026 by fxmarty-amd Contributor Loading…
2 tasks
2
Add MolecularIQ chemistry reasoning benchmark task
#3707 opened Apr 15, 2026 by cbartmann Loading…
[dev] 0.5 review
#3703 opened Apr 14, 2026 by fxmarty-amd Contributor Loading…
Adding Cruxeval
#3699 opened Apr 12, 2026 by ThomasHeap Loading…
5 of 6 tasks
Fix median aggregation returning arbitrary element instead of median
#3696 opened Apr 12, 2026 by Chessing234 Loading…
1 of 2 tasks
fix(vllm): add ray to vllm extras (#3688)
#3694 opened Apr 10, 2026 by kvr06-ai Loading…
refactor(5.0)
#3690 opened Apr 8, 2026 by baberabb Contributor Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.