Submitted by
Penghui Qi
Sea AI Lab
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Rethinking the Trust Region in LLM Reinforcement Learning
Revisiting Parameter Server in LLM Post-Training