Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 5 days ago • 138
huihui-ai/Huihui-Qwen3.5-27B-abliterated Image-Text-to-Text • 28B • Updated 5 days ago • 12k • 85
TeichAI/claude-4.5-opus-high-reasoning-250x Viewer • Updated Nov 28, 2025 • 250 • 5.81k • 315
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 24 days ago • 30
DavidAU/Mistral-Nemo-Inst-2407-12B-Thinking-Uncensored-HERETIC-HI-Claude-Opus Text Generation • 12B • Updated Jan 12 • 535 • 18
view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model Jan 1 • 18
AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models Paper • 2506.14682 • Published Jun 17, 2025