4 1656

Shaobai Jiang

shaobaij

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

MEMENTO: Teaching LLMs to Manage Their Own Context

upvoted a paper about 17 hours ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

upvoted a paper about 17 hours ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

MEMENTO: Teaching LLMs to Manage Their Own Context

Paper • 2604.09852 • Published 11 days ago • 1

upvoted 5 papers about 17 hours ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 13 days ago • 317

upvoted 14 papers about 20 hours ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Paper • 2604.02097 • Published 19 days ago • 32

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 22 days ago • 144

VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward

Paper • 2603.26599 • Published 24 days ago • 64

EgoSim: Egocentric World Simulator for Embodied Interaction Generation

Paper • 2604.01001 • Published 20 days ago • 38

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published 25 days ago • 156

Brevity Constraints Reverse Performance Hierarchies in Language Models

Paper • 2604.00025 • Published Mar 11 • 23

The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning

Paper • 2603.29025 • Published 21 days ago • 13

Reasoning Shift: How Context Silently Shortens LLM Reasoning

Paper • 2604.01161 • Published 19 days ago • 32

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

Paper • 2604.00842 • Published 19 days ago • 14

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Paper • 2603.28407 • Published 21 days ago • 69

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published 20 days ago • 96

Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design

Paper • 2603.28376 • Published 21 days ago • 23

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Paper • 2604.02029 • Published 19 days ago • 142

A Survey of On-Policy Distillation for Large Language Models

Paper • 2604.00626 • Published 19 days ago • 11

Shaobai Jiang

AI & ML interests

Recent Activity

Organizations

shaobaij's activity