Runtime error Featured 141 smolagents LLM leaderboard 🏆 141 A leaderboard for LLMs powering smolagents
Sleeping Agents 4 CompassJudger Subjective Evaluation Learderboard 🌎 4 CompassJudger Subjective Evaluation Learderboard
PatronusAI/Llama-3-Patronus-Lynx-70B-Instruct Text Generation • 71B • Updated Jul 22, 2024 • 13 • • 32
Running on CPU Upgrade Agents 76 AIR-Bench Leaderboard 🥇 76 Explore and compare QA and long doc benchmarks