Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
4
5
7
Hejian Sang
pb09204048
Follow
m0m0chen's profile picture
JasonZhu13's profile picture
ariG23498's profile picture
9 followers
ยท
6 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
On-Policy Self-Distillation for Reasoning Compression
submitted
a paper
5 days ago
On-Policy Self-Distillation for Reasoning Compression
authored
a paper
7 days ago
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning
View all activity
Organizations
Articles
1
Article
64
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
Papers
2
arxiv:
2602.21420
arxiv:
2510.00237
models
0
None public yet
datasets
0
None public yet