Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA Paper • 2505.21115 • Published May 27, 2025 • 144
AgentSocialBench: Evaluating Privacy Risks in Human-Centered Agentic Social Networks Paper • 2604.01487 • Published Apr 1 • 10
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search Paper • 2504.08066 • Published Apr 10, 2025 • 22
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences Paper • 2601.06789 • Published Jan 11 • 82
Running Agents 43 Ko-FreshQA Leaderboard 🚀 43 Explore, submit, and download Korean QA leaderboard data
kresnik/wav2vec2-large-xlsr-korean Automatic Speech Recognition • 0.3B • Updated Jul 3, 2023 • 753k • 56