Yu-Shiang Huang
hyusterr
ยท
AI & ML interests
NLP, IR, RecSys, FinTech
Recent Activity
upvoted a paper about 1 month ago
Less is More: Recursive Reasoning with Tiny Networks upvoted a collection about 1 month ago
Deepseek Papers upvoted a paper about 2 months ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Organizations
None yet