arxiv:2603.11178
XYX
xuyd16
AI & ML interests
None yet
Recent Activity
submitted
a paper
38 minutes ago
PACED: Distillation at the Frontier of Student Competence authored
a paper
about 4 hours ago
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning authored
a paper
about 4 hours ago
On-Policy Self-Distillation for Reasoning Compression Organizations
None yet