nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated Apr 29 • 1.04M • • 385
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 11 days ago • 47.2k • 9
view article Article Building Tensors from Scratch in Rust (Part 1.2): View Operations KeighBee • Jun 18, 2025 • 4
Running 601 Scaling test-time compute 📈 601 Boost LLM answers with flexible test‑time search strategies
Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 18
Running Agents 432 Reward Bench Leaderboard 📐 432 Explore and compare model scores on RewardBench benchmarks
Skywork/Skywork-Reward-Llama-3.1-8B-v0.2 Text Classification • 8B • Updated Oct 25, 2024 • 71.6k • 43
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16, 2025 • 170