ToolRL The ToolRL model trained for tool use through GRPO chengq9/ToolRL-Llama3.2-3B 4B • Updated Apr 22, 2025 • 13 chengq9/ToolRL-Qwen2.5-3B 3B • Updated Apr 22, 2025 • 1.77k • 2 chengq9/ToolRL-Qwen2.5-1.5B 2B • Updated Apr 22, 2025 • 15
ToolRL The ToolRL model trained for tool use through GRPO chengq9/ToolRL-Llama3.2-3B 4B • Updated Apr 22, 2025 • 13 chengq9/ToolRL-Qwen2.5-3B 3B • Updated Apr 22, 2025 • 1.77k • 2 chengq9/ToolRL-Qwen2.5-1.5B 2B • Updated Apr 22, 2025 • 15