Running Featured 48 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems ๐ 48 Who needs 1T parameters? Olympiad proofs with a 4B model
Code Llama Family Collection This collection hosts the transformers repos of the Code Llama release โข 12 items โข Updated Dec 6, 2024 โข 68
Llama 3.1 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.1 models, including the configurations, โข 6 items โข Updated Dec 6, 2024 โข 20
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models โข 11 items โข Updated Dec 6, 2024 โข 709
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 6 days ago โข 422
view post Post 7985 We're kick-starting the process of Transformers v5, with @ArthurZ and @cyrilvallez !v5 should be significant: we're using it as a milestone for performance optimizations, saner defaults, and a much cleaner code base worthy of 2025.Fun fact: v4.0.0-rc-1 came out on Nov 19, 2020, nearly five years ago! See translation 6 replies ยท ๐ 19 19 ๐ 9 9 ๐ฅ 6 6 + Reply
MapTrace: Scalable Data Generation for Route Tracing on Maps Paper โข 2512.19609 โข Published Dec 22, 2025 โข 2
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper โข 2602.13367 โข Published 12 days ago โข 30
Audio dataset Collection N datasets showcase how to configure and load audio datasets โข 11 items โข Updated Aug 2, 2024 โข 6
Format: CSV and TSV Collection 6 datasets showcase how to configure and load CSV and TSV files. โข 6 items โข Updated Nov 23, 2023 โข 9