Skip to content

Pull requests: allenai/open-instruct

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

remove_non_tensor_columns
#831 opened Jul 26, 2025 by garrett361 Loading…
look at changes
#825 opened Jul 24, 2025 by jacob-morrison Loading…
Fix head so grpo_fast.py can run.
#821 opened Jul 24, 2025 by finbarrtimbers Loading…
first iteration
#812 opened Jul 22, 2025 by jacob-morrison Draft
perf penalty
#805 opened Jul 21, 2025 by saurabh111233212 Loading…
[WIP] replay buffer
#776 opened Jul 11, 2025 by mnoukhov Draft
Saurbahs/diff filtering
#774 opened Jul 11, 2025 by saurabh111233212 Draft
next olmo and rl from base
#767 opened Jul 9, 2025 by mnoukhov Loading…
4 tasks done
[WIP] add long-form rl-rag reward
#729 opened Jun 19, 2025 by RulinShao Loading…
[WIP] better single gpu performance
#725 opened Jun 16, 2025 by mnoukhov Draft
Add adaptive majority voting for GRPO training
#684 opened May 22, 2025 by AfraAmini Loading…
ProTip! Exclude everything labeled bug with -label:bug.