view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl axolotl-ai-co • Apr 4, 2025 • 17
Running 127 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 127 Building and scaling RL environments for LLM training
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 147
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 12 days ago • 372k • 244