MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published 3 days ago • 74
Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs Paper • 2508.19594 • Published Aug 27, 2025 • 3
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling Paper • 2506.08672 • Published Jun 10, 2025 • 30 • 4
LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts Paper • 2602.14060 • Published 20 days ago • 2
LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts Paper • 2602.14060 • Published 20 days ago • 2 • 3
LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts Paper • 2602.14060 • Published 20 days ago • 2
LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts Paper • 2602.14060 • Published 20 days ago • 2
TongSIM: A General Platform for Simulating Intelligent Machines Paper • 2512.20206 • Published Dec 23, 2025 • 28
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published Dec 8, 2025 • 78 • 4
ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection Paper • 2505.16475 • Published May 22, 2025 • 3
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling Paper • 2506.08672 • Published Jun 10, 2025 • 30