When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published 28 days ago • 29
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published Feb 5 • 347
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published about 1 month ago • 282