view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG 21 days ago ⢠64
Runtime error Featured 2.95k The Smol Training Playbook š 2.95k The secrets to building world-class LLMs
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models Jul 18, 2025 ⢠50
Hf-native ColVision Models Collection Models that can be used with the native transformers š¤ implementation instead of colpali-engine. ⢠4 items ⢠Updated 27 days ago ⢠8
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26, 2025 ⢠181
Running 3.67k The Ultra-Scale Playbook š 3.67k The ultimate guide to training LLM on large GPU Clusters
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ⢠2502.02737 ⢠Published Feb 4, 2025 ⢠254
ColPali: Efficient Document Retrieval with Vision Language Models Paper ⢠2407.01449 ⢠Published Jun 27, 2024 ⢠51