UnifiedReward Flex
Collection
8 items
β’
Updated
β’
3
UnifiedReward-Flex-qwen3vl-32b is a unified personalized reward model for vision generation that couples reward modeling with flexible and context-adaptive reasoning!!
π The inference code is available at Github.
For further details, please refer to the following resources:
@article{unifiedreward-flex,
title={Unified Personalized Reward Model for Vision Generation},
author={Wang, Yibin and Zang, Yuhang and Han, Feng and Zhou, Yujie and Bu, Jiazi and Jin, Cheng and Wang, Jiaqi},
journal={arXiv preprint arXiv:2602.02380},
year={2026}
}