LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws Paper • 2605.23901 • Published 10 days ago • 13
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 5 days ago • 408
OpenComputer: Verifiable Software Worlds for Computer-Use Agents Paper • 2605.19769 • Published 13 days ago • 81
EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation Paper • 2605.23271 • Published 10 days ago • 78
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 20 days ago • 195
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 12 days ago • 204
Capturing LLM Capabilities via Evidence-Calibrated Query Clustering Paper • 2605.17110 • Published 16 days ago • 2
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 18 days ago • 145
Does Synthetic Layered Design Data Benefit Layered Design Decomposition? Paper • 2605.15167 • Published 18 days ago • 8
BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning Paper • 2605.07394 • Published 24 days ago • 5
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published Apr 30 • 57
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning Paper • 2604.05404 • Published Apr 7 • 43