Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 13 days ago • 317
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 12 days ago • 280
Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills Paper • 2604.05333 • Published 14 days ago • 22
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 13 days ago • 184
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 12 days ago • 42
LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model Paper • 2604.02097 • Published 19 days ago • 32
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 22 days ago • 144
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published 24 days ago • 64
EgoSim: Egocentric World Simulator for Embodied Interaction Generation Paper • 2604.01001 • Published 20 days ago • 38
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published 25 days ago • 156
Brevity Constraints Reverse Performance Hierarchies in Language Models Paper • 2604.00025 • Published Mar 11 • 23
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Paper • 2603.29025 • Published 21 days ago • 13
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 19 days ago • 32
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published 19 days ago • 14
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 21 days ago • 69
Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design Paper • 2603.28376 • Published 21 days ago • 23
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published 19 days ago • 142
A Survey of On-Policy Distillation for Large Language Models Paper • 2604.00626 • Published 19 days ago • 11