guirong chen
aaaGUI
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper 5 months ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding upvoted a paper 11 months ago
DeepCritic: Deliberate Critique with Large Language ModelsOrganizations
None yet