Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

133 Open 1,196 Closed

🐛 bug 🏋 GKD

#2586 opened Jan 17, 2025 by Omar-Deepshard

5 tasks done

🏋 ORPO ❓ question

#2570 opened Jan 15, 2025 by vitalyshalumov

7 of 9 tasks

🐛 bug 🏋 DPO 👁️ VLM

#2563 opened Jan 12, 2025 by liuchaohu

5 of 9 tasks

⚡accelerate 🏋 PPO 🏋 RLOO

#2555 opened Jan 10, 2025 by dawidm

7 of 9 tasks

✨ enhancement 🏋 KTO

#2554 opened Jan 10, 2025 by starmpcc

🐛 bug 🏋 DPO

#2553 opened Jan 9, 2025 by solume

7 of 9 tasks

❓ question 🏋 SFT

#2545 opened Jan 6, 2025 by okhat

Is truncation_mode used in DPOTrainer? 🏋 DPO ❓ question

#2538 opened Jan 2, 2025 by anakin87

🏋 DPO 🙋 help from community wanted ⚡ PEFT

#2536 opened Jan 2, 2025 by maoulee

7 of 9 tasks

🙋 help from community wanted 🏋 PPO

#2534 opened Dec 31, 2024 by SachinVashisth

🐛 bug 🚀 deepspeed ⏳ needs more info 🏋 Online DPO

#2532 opened Dec 30, 2024 by yiyepiaoling0715

5 of 9 tasks

✨ enhancement 🏋 Online DPO 🏋 PPO 🏋 RLOO

#2529 opened Dec 28, 2024 by dawidm

✨ enhancement

#2525 opened Dec 28, 2024 by August-murr

3 tasks done

ProTip! Updated in the last three days: updated:>2025-01-14.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list