view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models May 7, 2025 • 60
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 153
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 126
togethercomputer/CoderForge-Preview-32B-SWE-Bench-Verified-Evaluation-trajectories Viewer • Updated Feb 2 • 500 • 84 • 13
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 10 days ago • 17
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 53
Robust Speech Recognition via Large-Scale Weak Supervision Paper • 2212.04356 • Published Dec 6, 2022 • 53