Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making Paper • 2602.06570 • Published 4 days ago • 56 • 3
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers Paper • 2602.06079 • Published 6 days ago • 15 • 3
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published 7 days ago • 48 • 4
CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs Paper • 2602.05258 • Published 5 days ago • 6 • 4
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published 5 days ago • 18 • 8
Privileged Information Distillation for Language Models Paper • 2602.04942 • Published 6 days ago • 23 • 4
DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 5 days ago • 38 • 2
Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning Paper • 2602.04998 • Published 5 days ago • 5 • 5
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Paper • 2602.02474 • Published 8 days ago • 51 • 4
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems Paper • 2602.03695 • Published 7 days ago • 1 • 1
Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening Paper • 2602.05386 • Published 5 days ago • 68 • 4
LightRAG: Simple and Fast Retrieval-Augmented Generation Paper • 2410.05779 • Published Oct 8, 2024 • 28 • 1
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Paper • 2404.02905 • Published Apr 3, 2024 • 74 • 4
Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection Paper • 2602.03216 • Published 7 days ago • 12 • 4
CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding Paper • 2602.01785 • Published 8 days ago • 90 • 4
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models Paper • 2601.22060 • Published 12 days ago • 149 • 4
On the Theoretical Limitations of Embedding-Based Retrieval Paper • 2508.21038 • Published Aug 28, 2025 • 21 • 2
VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration Paper • 2601.22674 • Published 11 days ago • 5 • 3