Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments Paper • 2602.01244 • Published 11 days ago • 14
OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions Paper • 2602.05843 • Published 7 days ago • 57
Self-Improving World Modelling with Latent Actions Paper • 2602.06130 • Published 7 days ago • 27
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published 3 days ago • 145
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing Paper • 2601.21459 • Published 15 days ago • 9
SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning Paper • 2602.02472 • Published 10 days ago • 44
CoDiQ: Test-Time Scaling for Controllable Difficult Question Generation Paper • 2602.01660 • Published 11 days ago • 7
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation Paper • 2601.21420 • Published 15 days ago • 42
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models Paper • 2503.09567 • Published Mar 12, 2025 • 1
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published Jan 12 • 114
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 29 days ago • 126
OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent Paper • 2601.07779 • Published Jan 12 • 28
MMLongCite: A Benchmark for Evaluating Fidelity of Long-Context Vision-Language Models Paper • 2510.13276 • Published Oct 15, 2025
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 215
MAC-SLU: Multi-Intent Automotive Cabin Spoken Language Understanding Benchmark Paper • 2512.01603 • Published Dec 1, 2025
Agent2World: Learning to Generate Symbolic World Models via Adaptive Multi-Agent Feedback Paper • 2512.22336 • Published Dec 26, 2025 • 2