Running 12 Defeating the trainer-generator precision mismatch in TRL 🎯 12 Download research PDF (Pro access required)
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 10 days ago • 577k • 334
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16 Text Generation • 124B • Updated Mar 14 • 15k • 27