ZhangXiaoyun
DadaCloud01
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting authored a paper about 2 months ago
Rediscovering Entropy Regularization: Adaptive Coefficient Unlocks Its
Potential for LLM Reinforcement Learning authored a paper about 2 months ago
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters