Running 3.86k The Ultra-Scale Playbook π 3.86k The ultimate guide to training LLM on large GPU Clusters
Running Featured 1.35k FineWeb: decanting the web for the finest text data at scale π· 1.35k Explore and download the FineWeb webβscale text dataset
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 Text Generation β’ 26B β’ Updated Nov 27, 2025 β’ 3.88k β’ 20
Running on CPU Upgrade Featured 3.2k The Smol Training Playbook π 3.2k The secrets to building world-class LLMs
IlyaGusev/gemma-2-2b-it-abliterated Text Generation β’ 3B β’ Updated Jul 31, 2024 β’ 2.26k β’ β’ 51
Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct Text Generation β’ 1B β’ Updated Sep 27, 2024 β’ 2.15k β’ β’ 47
Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF Text Generation β’ 0.5B β’ Updated Oct 6, 2024 β’ 328 β’ 9