LoopRPT: Reinforcement Pre-Training for Looped Language Models Paper • 2603.19714 • Published 13 days ago • 13
LoopRPT: Reinforcement Pre-Training for Looped Language Models Paper • 2603.19714 • Published 13 days ago • 13
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30, 2025 • 119
allenai/mid-training-OpenMathReasoning-rewrite-teacher-student-lecture-filtered Viewer • Updated Jul 6, 2025 • 291k • 18 • 3