arxiv:2601.20209
Jinyang Wu
Jinyang23
AI & ML interests
large language models, reasoning, agentic rl
Recent Activity
upvoted
a
paper
about 20 hours ago
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing
upvoted
a
paper
about 20 hours ago
TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents
upvoted
a
paper
about 22 hours ago
SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration
Organizations
None yet