-
The Ultra-Scale Playbook
π3.85kThe ultimate guide to training LLM on large GPU Clusters
-
The Smol Training Playbook
π3.18kThe secrets to building world-class LLMs
-
Evaluation Guidebook
π321Explore LLM benchmark trends over time
-
The Synthetic Data Playbook: Generating Trillions of the Finest Tokens
π236Explore synthetic data experiments on a virtual bookshelf
Nima Nooshiri
nimanzik
AI & ML interests
None yet
Recent Activity
upvoted an article about 7 hours ago
KV Caching Explained: Optimizing Transformer Inference Efficiency updated a collection 1 day ago
Hugging Face Playbooks & Guidebooks published a model 15 days ago
nimanzik/totem-reproduction-vqvae