Rethinking Memory as Continuously Evolving Connectivity Paper • 2605.28773 • Published 5 days ago • 27
DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning Paper • 2605.25604 • Published 7 days ago • 132
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation Paper • 2605.28293 • Published 5 days ago • 80
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 5 days ago • 68
MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems Paper • 2605.28732 • Published 5 days ago • 37
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 5 days ago • 79
JLT: Clean-Latent Prediction in Latent Diffusion Transformers Paper • 2605.27102 • Published 6 days ago • 30
Negligible in Size, Significant in Effect: On Scale Vectors in Large Language Models Paper • 2605.26895 • Published 6 days ago • 17
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 5 days ago • 54
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published Apr 29 • 25
Why Fine-Tuning Encourages Hallucinations and How to Fix It Paper • 2604.15574 • Published Apr 16 • 25
AgriIR: A Scalable Framework for Domain-Specific Knowledge Retrieval Paper • 2604.16353 • Published Mar 17 • 4
GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding Paper • 2605.15250 • Published 18 days ago • 13
EndPrompt: Efficient Long-Context Extension via Terminal Anchoring Paper • 2605.14589 • Published 18 days ago • 17