view article Article PrediBench: Testing AI models on prediction markets charles-azam • Sep 24, 2025 • 5
view article Article ScreenEnv: Deploy your full stack Desktop Agent A-Mahla, m-ric • Jul 10, 2025 • 76
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 773
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! +1 A-Mahla, m-ric, thomwolf • Jun 6, 2025 • 56
view article Article CodeAgents + Structure: A Better Way to Execute Actions akseljoonas, m-ric • May 28, 2025 • 82
view article Article Trace & Evaluate your Agent with Arize Phoenix +1 schavalii, jgilhuly16, m-ric • Feb 28, 2025 • 41
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
view article Article DABStep: Data Agent Benchmark for Multi-step Reasoning +5 eggie5, martinigoyanes, frisokingma, andreumora, lvwerra, thomwolf, m-ric • Feb 4, 2025 • 128
view article Article We now support VLMs in smolagents! +1 m-ric, merve, albertvillanova • Jan 24, 2025 • 113
view article Article Introducing smolagents: simple agents that write actions in code. +1 m-ric, merve, thomwolf • Dec 31, 2024 • 1.19k
view article Article Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge +1 Vinsingh, rajgreen, m-ric • Oct 28, 2024 • 29
view article Article Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge +1 Vinsingh, rajgreen, m-ric • Oct 28, 2024 • 29
view article Article Our Transformers Code Agent beats the GAIA benchmark 🏅 m-ric, sergeipetrov • Jul 1, 2024 • 100
view article Article Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖 m-ric • Jun 20, 2024 • 26
view article Article CodeAgents + Structure: A Better Way to Execute Actions akseljoonas, m-ric • May 28, 2025 • 82
view article Article License to Call: Introducing Transformers Agents 2.0 +1 m-ric, lysandre, pcuenq • May 13, 2024 • 137
view article Article Open-source LLMs as LangChain Agents +1 m-ric, Jofthomas, andrewrreed • Jan 24, 2024 • 78
view article Article Open-source LLMs as LangChain Agents +1 m-ric, Jofthomas, andrewrreed • Jan 24, 2024 • 78