Issues · huggingface/trl

[Project] Training Agents with GRPO

#2723 opened Jan 31, 2025 by August-murr

Open 10

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 6

[Tracking issue] Wrong loss scaling when accumulating gradient

#2617 opened Jan 23, 2025 by qgallouedec

Open

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

256 Open 1,292 Closed

⚡accelerate 🚀 deepspeed 🏋 DPO ✨ enhancement

#2985 opened Feb 28, 2025 by cyr0930

🚀 deepspeed 📚 documentation ✨ enhancement 🦥 unsloth

#2978 opened Feb 27, 2025 by ParagEkbote

⚡accelerate 🐛 bug 🏋 GRPO

#2977 opened Feb 27, 2025 by zaddy6

5 tasks done

🏋 GRPO 🏋 Reward

#2976 opened Feb 27, 2025 by L1n111ya

5 tasks done

🏋 GRPO ❓ question

#2972 opened Feb 27, 2025 by Tuziking

🏋 DPO ✨ enhancement

#2964 opened Feb 26, 2025 by ggbetz

DeepSpeedZeRoOffload is incompatible with DeepSpeed>=0.16.4 🐛 bug ⚡ PEFT

#2962 opened Feb 26, 2025 by jamesbraza

5 tasks done

⚡accelerate 🐛 bug

#2960 opened Feb 26, 2025 by Kfkcome

5 tasks done

compute_metrics in GRPOTrainer ✨ enhancement 🏋 GRPO

#2959 opened Feb 26, 2025 by dipta007

🏋 DPO ✨ enhancement

#2958 opened Feb 25, 2025 by jkx19

🏋 GRPO ❓ question

#2944 opened Feb 24, 2025 by MlSAKA-MlKOTO

🐛 bug 🏋 GRPO

#2942 opened Feb 24, 2025 by Marsella8

5 tasks done

🏋 GRPO ❓ question

#2941 opened Feb 24, 2025 by Tomsawyerhu

🏋 GRPO ❓ question

#2927 opened Feb 21, 2025 by Tuziking

🐛 bug 🏋 GRPO

#2923 opened Feb 21, 2025 by edwardzjl

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list