Jeremy Haschal
JermemyHaschal
AI & ML interests
None yet
Recent Activity
reacted
to
albertvillanova's
post with π€ about 4 hours ago
π TRL v0.29.0 introduces trl-training: an agent-native training skill.
This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)
Weβre excited to see what the community builds on top of this.
If youβre working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! π€
The future of ML tooling is agent-native.
π https://github.com/huggingface/trl/releases/tag/v0.29.0 new activity
14 days ago
TheDrummer/Rocinante-X-12B-v1-GGUF:Comparison with Rivermind-Lux-12B-v1b? Organizations
None yet