Songhao Wu
shwu
AI & ML interests
None yet
Recent Activity
upvoted a paper about 13 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards upvoted a paper 3 months ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper 3 months ago
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep ResearchOrganizations
None yet