yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF Text Generation • 12B • Updated 13 days ago • 597k • 2.55k
view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl axolotl-ai-co • Apr 4, 2025 • 17
Running 195 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 195 Building and scaling RL environments for LLM training
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 165