The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published 3 days ago • 164 • 2
A Survey of LLM-based Deep Search Agents: Paradigm, Optimization, Evaluation, and Challenges Paper • 2508.05668 • Published Aug 3, 2025 • 1 • 1
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published 2 days ago • 168 • 5
Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making Paper • 2602.06570 • Published 7 days ago • 59 • 3
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers Paper • 2602.06079 • Published 9 days ago • 18 • 3
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published 10 days ago • 52 • 7
CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs Paper • 2602.05258 • Published 8 days ago • 7 • 4
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published 9 days ago • 18 • 9
Privileged Information Distillation for Language Models Paper • 2602.04942 • Published 9 days ago • 24 • 4
DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 8 days ago • 41 • 2
Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning Paper • 2602.04998 • Published 9 days ago • 6 • 5
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Paper • 2602.02474 • Published 11 days ago • 53 • 4
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems Paper • 2602.03695 • Published 10 days ago • 1 • 1
Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening Paper • 2602.05386 • Published 8 days ago • 69 • 4
LightRAG: Simple and Fast Retrieval-Augmented Generation Paper • 2410.05779 • Published Oct 8, 2024 • 28 • 1
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Paper • 2404.02905 • Published Apr 3, 2024 • 74 • 4
Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection Paper • 2602.03216 • Published 10 days ago • 12 • 4
CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding Paper • 2602.01785 • Published 11 days ago • 92 • 4
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models Paper • 2601.22060 • Published 15 days ago • 150 • 4