GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent Paper • 2603.13875 • Published Mar 14 • 36
Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval Paper • 2604.23734 • Published 20 days ago • 3
view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models lightonai • 24 days ago • 38
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • 30 days ago • 71
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • Apr 9 • 59
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published Apr 6 • 12
view article Article BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders Nicolas-BZRD • Apr 7 • 27
BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs Paper • 2604.02045 • Published Apr 2 • 37
view article Article **ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?** lightonai • Feb 19 • 21
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 7
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated Apr 7 • 20
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling lightonai • Feb 12 • 56
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c • Feb 4 • 89
view article Article Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model nvidia • Feb 4 • 28
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 119