arxiv:2510.04550
Pengfei He
bigboss24
AI & ML interests
Trustworthy
Recent Activity
authored
a paper
11 days ago
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool
Use
upvoted
a
paper
12 days ago
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool
Use
upvoted
a
paper
12 days ago
Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents
Organizations
None yet