Agent - a Chevolier Collection

Chevolier 's Collections

Self-Improving AI

Image Generation

Video Generation

Agent

updated about 17 hours ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks

Paper • 2510.08002 • Published Oct 9, 2025 • 24
Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published Oct 9, 2025 • 10
The Denario project: Deep knowledge AI agents for scientific discovery

Paper • 2510.26887 • Published Oct 30, 2025 • 8
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238
WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Paper • 2509.13312 • Published Sep 16, 2025 • 106
LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 104
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 81
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 110
Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 83
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Paper • 2602.07085 • Published Feb 6 • 190
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 628
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 290
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published Apr 7 • 121
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery

Paper • 2604.01658 • Published Apr 2 • 56
Neural Computers

Paper • 2604.06425 • Published Apr 7 • 31
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published 20 days ago • 106
Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 19 days ago • 215
FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments

Paper • 2604.25135 • Published 21 days ago • 12
The Last Harness You'll Ever Build

Paper • 2604.21003 • Published 27 days ago • 5
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published 25 days ago • 226
Recursive Multi-Agent Systems

Paper • 2604.25917 • Published 21 days ago • 268
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Paper • 2604.22446 • Published 25 days ago • 121
The Last Human-Written Paper: Agent-Native Research Artifacts

Paper • 2604.24658 • Published 20 days ago • 20
SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published 12 days ago • 44
Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

Paper • 2605.05724 • Published 12 days ago • 15
A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

Paper • 2605.06200 • Published 12 days ago • 14
Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 5 days ago • 94
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Paper • 2605.06527 • Published 12 days ago • 42
ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both

Paper • 2605.15198 • Published 5 days ago • 18
Orchard: An Open-Source Agentic Modeling Framework

Paper • 2605.15040 • Published 5 days ago • 16
Dynamic Latent Routing

Paper • 2605.14323 • Published 5 days ago • 4