Skip to content

Pull requests: stanford-crfm/helm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Changed MMLU Pro for Non-COT Version
#3108 opened Oct 28, 2024 by siyagoel Loading…
Build frontend
#3105 opened Oct 27, 2024 by github-actions bot Loading…
Fix typo in downloading_raw_results.md
#3102 opened Oct 25, 2024 by arseniy-klimovskiy Loading…
GPQA Few-shot CoT, adapter part
#3099 opened Oct 24, 2024 by liamjxu Loading…
GPQA Few-shot CoT, spec part
#3097 opened Oct 24, 2024 by liamjxu Loading…
GPQA Few shot CoT, scenario part
#3096 opened Oct 24, 2024 by liamjxu Loading…
Added scenario for MMLU Pro
#3077 opened Oct 21, 2024 by siyagoel Loading…
IBM Enterprise Scenarios
#3064 opened Oct 16, 2024 by yifanmai Draft
Medhelm
#3038 opened Oct 2, 2024 by aunell Loading…
New safety scenario: HarmBench GCG-T
#3035 opened Oct 1, 2024 by farzaank Loading…
fix R/B channel switch in skin tone calculation
#2589 opened Apr 24, 2024 by rbitr Loading…
Documentation: Evaluation run lifecycle
#2506 opened Mar 25, 2024 by yifanmai Loading…
Remove AdapterSpec from metrics
#2244 opened Jan 17, 2024 by yifanmai Draft
Numeracy scenario update
#1978 opened Nov 2, 2023 by friedeggs Loading…
ProTip! Exclude everything labeled bug with -label:bug.