HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification • 0.9B • Updated Mar 7, 2024 • 340 • 53
Running on CPU Upgrade Featured 3.07k The Smol Training Playbook 📚 3.07k The secrets to building world-class LLMs
Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 831k • • 1.46k
Running 595 Scaling test-time compute 📈 595 Run advanced search strategies to boost LLM problem solving