Reason, Then Re-reason: Cross-view Revisiting Improves Spatial Reasoning Paper • 2606.11683 • Published 2 days ago • 28
TRL-Bench: Standardizing Cross-Paradigm Representation-Level Evaluation of Tabular Encoders Paper • 2606.09323 • Published 4 days ago • 48
Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions Paper • 2606.09076 • Published 4 days ago • 51
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 2 days ago • 68
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 2 days ago • 75
Why Muon Outperforms Adam: A Curvature Perspective Paper • 2606.04662 • Published 9 days ago • 8
A Geometric Account of Activation Steering through Angle-Norm Decomposition Paper • 2606.06735 • Published 8 days ago • 20
Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders Paper • 2606.07473 • Published 7 days ago • 12
Human Psychometric Questionnaires Mischaracterize LLM Behavior Paper • 2509.10078 • Published 14 days ago • 33
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 4 days ago • 56
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 8 days ago • 59
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 169 items • Updated 2 days ago • 35
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 169 items • Updated 2 days ago • 35
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe Paper • 2512.16649 • Published Dec 18, 2025 • 31
LayerRoute: Input-Conditioned Adaptive Layer Skipping via LoRA Fine-Tuning for Agentic Language Models Paper • 2606.01838 • Published 11 days ago • 2
Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development Paper • 2606.07207 • Published 7 days ago • 3