arxiv:2605.29447
Hao Jiang
Lutalica
AI & ML interests
Multimodal LLMs, LLM Reasoning, Reinforcement Learning, Efficient Inference
Recent Activity
authored a paper about 9 hours ago
Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents upvoted a paper about 16 hours ago
Pyramid Texture Filtering authored a paper 4 days ago
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use