RLHFlow

university

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Papers

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

View all Papers

RLHFlow 's Papers 1

Submitted by

Wei Xiong

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

RLHFlow