Yury Panikov
panikov
AI & ML interests
None yet
Recent Activity
commentedon a paper 41 minutes ago
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key commentedon a paper 44 minutes ago
Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces commentedon a paper about 1 hour ago
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement LearningOrganizations
None yet