floyed shen
floyed
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 16 hours ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training submitted
a paper
1 day ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training upvoted a paper 1 day ago
DeepEyesV2: Toward Agentic Multimodal Model