arxiv:2504.00891
Jiafei Lyu
dmux
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted a paper about 15 hours ago
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization liked
a model 22 days ago
biang889/ProAct upvoted a paper 22 days ago
ProAct: Agentic Lookahead in Interactive Environments