Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Vllm get tokenizer
#1794 opened May 6, 2024 by AguirreNicolas Loading…
Minor features
#2249 opened Aug 25, 2024 by artemorloff Loading…
Add loncxt tasks
#2629 opened Jan 17, 2025 by baberabb Draft
[API] Add octoai back-end
#936 opened Oct 19, 2023 by vvchernov Loading…
Added no-softmax entries to MODEL_REGISTRY
#1052 opened Dec 2, 2023 by denizyuret Loading…
Add Selfcheckgpt evaluation to tasks
#1080 opened Dec 7, 2023 by erenup Loading…
add all vlsp
#1123 opened Dec 14, 2023 by qnguyen3 Draft
Add various social bias tasks
#1185 opened Dec 21, 2023 by oskarvanderwal Loading…
1 task
Adding new task: Boxes
#1557 opened Mar 11, 2024 by irafayabdul Loading…
add context-based requests processing
#1571 opened Mar 13, 2024 by artemorloff Loading…
Physics GRE task added
#1655 opened Apr 1, 2024 by ShayekhBinIslam Loading…
Addition of BedrockChatModel
#1708 opened Apr 16, 2024 by jacquelinegarrahan Loading…
Add ability to inject OpenAI client to LM
#1732 opened Apr 22, 2024 by ciaranby Loading…
Fix cost_estimate.py
#1810 opened May 8, 2024 by xksteven Loading…
Financial PhraseBank (FPB) Eval Metric
#1815 opened May 9, 2024 by bcicc Loading…
Implement Exams benchmark
#1852 opened May 17, 2024 by snova-zoltanc Loading…
ProTip! Add no:assignee to see everything that’s not assigned.