Yury Panikov
panikov
AI & ML interests
None yet
Recent Activity
commentedon a paper about 7 hours ago
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key commentedon a paper about 7 hours ago
Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces commentedon a paper about 7 hours ago
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement LearningOrganizations
None yet