arxiv:2403.13684
whj363636
whj363636
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 3 days ago
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe upvoted a paper about 1 month ago
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model upvoted a paper 2 months ago
MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning