view article Article Extract Text and Knowledge from Images with Open Vision Language Models dvilasuero • Oct 23, 2025 • 5
view article Article Unlock the power of images with AI Sheets +4 Ameeeee, dvilasuero, frascuchon, damianpumar, lvwerra, thomwolf • Oct 21, 2025 • 33
view article Article How to Choose the Best Open Source LLM for Your Project in 2025 dvilasuero • Sep 9, 2025 • 77
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! +4 dvilasuero, Ameeeee, frascuchon, damianpumar, lvwerra, thomwolf • Aug 8, 2025 • 109
view article Article Vibe coding for data science: how to label a dataset with Kimi K2 dvilasuero • Jul 22, 2025 • 22
view article Article LLM Hallucinations: bug or feature? The US Supreme Court 2025 cases experiment dvilasuero • Jul 8, 2025 • 19
view article Article FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages davanstrien • Jul 8, 2025 • 35
view article Article FineWeb2-C: Help Build Better Language Models in Your Language davanstrien • Dec 23, 2024 • 21
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language +4 davidberenstein1957, sdiazlor, Leiyre, dvilasuero, Ameeeee, burtenshaw • Dec 16, 2024 • 158
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community +5 davidberenstein1957, burtenshaw, dvilasuero, davanstrien, sayakpaul, Ameeeee, linoyts • Dec 9, 2024 • 70
view article Article Let’s make a generation of amazing image generation models burtenshaw • Nov 26, 2024 • 33
view article Article Argilla 2.4: Easily Build Fine-Tuning and Evaluation Datasets on the Hub — No Code Required +1 nataliaElv, burtenshaw, dvilasuero • Nov 4, 2024 • 45
view article Article How to build a custom text classifier without days of human labeling sdiazlor • Oct 17, 2024 • 57
view article Article How to optimize your data labelling project with custom interfaces burtenshaw • Oct 16, 2024 • 20
view article Article 🔥 Argilla 2.0: the data-centric tool for AI makers 🤗 dvilasuero • Jul 30, 2024 • 39
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context +6 philschmid, osanseviero, alvarobartt, lvwerra, dvilasuero, reach-vb, marcsun13, pcuenq • Jul 23, 2024 • 241
view article Article How we leveraged distilabel to create an Argilla 2.0 Chatbot +3 plaguss, gabrielmbmb, sdiazlor, osanseviero, dvilasuero • Jul 16, 2024 • 33