Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? Paper • 2603.24472 • Published 25 days ago • 53 • 7
Jina-VLM: Small Multilingual Vision Language Model Paper • 2512.04032 • Published Dec 3, 2025 • 15 • 4
MemMamba: Rethinking Memory Patterns in State Space Model Paper • 2510.03279 • Published Sep 28, 2025 • 74 • 3