BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation Paper • 2604.09497 • Published Apr 10 • 29
view article Article BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders Nicolas-BZRD • Apr 7 • 26
view article Article A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3 antoineedy • Feb 27 • 12
ViDoRe V3: A Comprehensive Evaluation of Retrieval Augmented Generation in Complex Real-World Scenarios Paper • 2601.08620 • Published Jan 13 • 12
Does It Tie Out? Towards Autonomous Legal Agents in Venture Capital Paper • 2512.18658 • Published Dec 21, 2025 • 11
Running Agents 207 Vidore Leaderboard 🥇 207 Browse and compare visual document retrieval model scores
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases QuentinJG • Nov 5, 2025 • 64