GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 225 • 9
Running on Zero Featured 2.07k PuLID-FLUX 🤗 2.07k Generate customized images from text and reference photo
Running on Zero 1.66k Flux.1-dev Upscaler 🔎 1.66k Upscale low‑resolution images to high‑resolution with AI