-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Pull requests: huggingface/open-r1
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Simplify dependencies set up for development mode: update makefile and readme
#449
opened Feb 28, 2025 by
ocramz
Loading…
feat: make reward functions to support parallel computation
#398
opened Feb 23, 2025 by
0x404
Loading…
New GRPO dataset and task: formally-verified program correctness
#379
opened Feb 20, 2025 by
ocramz
Loading…
Fix: Default value of
cosine_min_value_wrong
parameter
#305
opened Feb 13, 2025 by
zhangsheng377
Loading…
Simplified installation requirements to support more accelerators
#303
opened Feb 13, 2025 by
ji-huazhong
Loading…
[GRPO] generate with prompt containing the first <think> tag
#283
opened Feb 11, 2025 by
kashif
Loading…
Fix: Avoid empty keyword argument in VLLMModelConfig from Makefile
#246
opened Feb 8, 2025 by
mattdepaolis
Loading…
Replace the base model deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B to Qwen/Qwen2.5-1.5B-Instruct in GRPO
#198
opened Feb 5, 2025 by
DVampire
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-01-28.