Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability.
Qihan Ren
jasonrqh
AI & ML interests
XAI, LLM reasoning & safety, Coding agent
Recent Activity
upvoted a paper 39 minutes ago
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence upvoted a paper 13 days ago
MMSkills: Towards Multimodal Skills for General Visual Agents