DCAgent2/DCAgent2_aider_polyglot_penfever_kimi-k2-swesmith_with_plain_docker-sandboxes-mb5c427b4 Viewer • Updated Jan 23 • 627 • 2
DCAgent2/DCAgent2_bfcl-parity_laion_Kimi-K2T-neulab-agenttuning-kg-sandboxes-maxeps-32k_d941d235 Viewer • Updated Feb 28 • 366 • 5
DCAgent2/DCAgent2_bfcl-parity_laion_Kimi-K2T-neulab-agenttuning-kg-sandboxes-maxeps-32k_fa4622ee Viewer • Updated Feb 26 • 369 • 6
DCAgent2/DCAgent2_bfcl-parity_laion_Kimi-K2T-neulab-agenttuning-mind2web-sandboxes-maxep3ae6067b Viewer • Updated Feb 26 • 368 • 4
DCAgent2/DCAgent2_bfcl-parity_laion_Kimi-K2T-neulab-agenttuning-webshop-sandboxes-maxepse369590c Viewer • Updated Feb 28 • 367 • 7
DCAgent2/DCAgent2_bfcl-parity_laion_Kimi-K2T-neulab-agenttuning-webshop-sandboxes-maxepsf03dce66 Viewer • Updated Feb 26 • 369 • 6
DCAgent2/DCAgent2_bfcl-parity_laion_glm46-neulab-agenttuning-alfworld-sandboxes-maxeps-131k Viewer • Updated Mar 6 • 7
DCAgent2/DCAgent2_bfcl-parity_laion_glm46-neulab-agenttuning-alfworld-sandboxes-maxeps-1d9196fed Viewer • Updated Mar 1 • 299 • 7
DCAgent2/DCAgent2_bfcl-parity_laion_kimi-k2-r2egym_sandboxes-maxeps-32k_20260227_205943 Viewer • Updated Feb 28 • 366 • 4
DCAgent2/DCAgent2_bfcl-parity_penfever_kimi-k2-swesmith_with_plain_docker-sandboxes-maxe30f4d3c8 Viewer • Updated Feb 27 • 368 • 3
DCAgent2/DCAgent2_bfcl-parity_penfever_kimi-k2-swesmith_with_plain_docker-sandboxes-maxe3e76f5f3 Viewer • Updated Feb 27 • 368 • 4
DCAgent2/DCAgent2_bfcl-parity_penfever_kimi-k2-swesmith_with_plain_docker-sandboxes-maxeb18299fa Viewer • Updated Feb 27 • 367 • 2
DCAgent2/DCAgent_dev_set_71_tasks_laion_glm46-neulab-agenttuning-alfworld-sandboxes-maxe1b0bc321 Updated Dec 19, 2025 • 2
DCAgent2/DCAgent_dev_set_71_tasks_laion_kimi-k2-r2egym_sandboxes-maxeps-32k_20251226_004652 Viewer • Updated Dec 26, 2025 • 2
DCAgent2/DCAgent_dev_set_71_tasks_penfever_kimi-k2-swesmith_with_plain_docker-sandboxes30912d06 Updated Dec 19, 2025 • 2
DCAgent2/DCAgent_dev_set_71_tasks_penfever_kimi-k2-swesmith_with_plain_docker-sandboxesc4fbd018 Updated Dec 19, 2025 • 2
DCAgent2/DCAgent_dev_set_v2_laion_glm46-neulab-agenttuning-alfworld-sandboxes-maxeps-131aaba1500 Viewer • Updated Mar 2 • 188 • 5
DCAgent2/DCAgent_dev_set_v2_laion_glm46-neulab-agenttuning-alfworld-sandboxes-maxeps-131k Viewer • Updated Mar 5 • 300 • 5
DCAgent2/DCAgent_dev_set_v2_laion_kimi-k2-r2egym_sandboxes-maxeps-32k Viewer • Updated Mar 6 • 300 • 4
DCAgent2/dev_set_71_tasks_Kimi_K2T_neulab_agenttuning_kg_sandboxes_maxeps_32k_20260224_204722 Viewer • Updated Feb 25 • 210 • 5
DCAgent2/dev_set_71_tasks_Kimi_K2T_neulab_agenttuning_mind2web_sandboxes_maxeps_32k_20264c003700 Viewer • Updated Feb 25 • 210 • 8
DCAgent2/dev_set_71_tasks_Kimi_K2T_neulab_agenttuning_webshop_sandboxes_maxeps_32k_20260aea47664 Viewer • Updated Feb 25 • 210 • 9
DCAgent2/dev_set_v2_Kimi_2_5_r2egym_sandboxes_maxeps_32k__Qwen3_8B_20260318_094506 Viewer • Updated Mar 19 • 300 • 3
DCAgent2/dev_set_v2_Kimi_K2T_neulab_agenttuning_kg_sandboxes_maxeps_32k_20260221_005345 Viewer • Updated Feb 21 • 297 • 4
DCAgent2/dev_set_v2_Kimi_K2T_neulab_agenttuning_mind2web_sandboxes_maxeps_32k_20260221_005343 Viewer • Updated Feb 21 • 297 • 7
DCAgent2/dev_set_v2_Kimi_K2T_neulab_agenttuning_webshop_sandboxes_maxeps_32k_20260221_005349 Viewer • Updated Feb 21 • 295 • 8
DCAgent2/dev_set_v2_kimi_k2_swesmith_with_plain_docker_sandboxes_maxeps_32k_20260227_230150 Viewer • Updated Feb 28 • 297 • 4
DCAgent2/dev_set_v2_sft__Kimi_2_5_inferredbugs_sandboxes_maxeps_32k__Qwen3_8B_20260330_012647 Viewer • Updated Mar 30 • 290 • 6
penfever/Kimi-K2T-neulab-agenttuning-kg-sandboxes-maxeps-32k_neulab-agenttuning-kg-sandboxes Viewer • Updated Jan 16 • 7.95k • 2
penfever/Kimi-K2T-neulab-agenttuning-mind2web-sandboxes-maxeps-32k_neulab-agenttuning-db-sandboxes Viewer • Updated Jan 16 • 15.8k • 11
penfever/Kimi-K2T-neulab-agenttuning-webshop-sandboxes-maxeps-32k Viewer • Updated Jan 19 • 5.72k • 3
penfever/glm46-neulab-agenttuning-alfworld-sandboxes-maxeps-131k Viewer • Updated Dec 17, 2025 • 4.84k • 7
DCAgent2/dev_set_v2_e1_embedding_d1_original_sandboxes_20260415_055339 Viewer • Updated Apr 15 • 287 • 3
centrepourlasecuriteia/content-moderation-output-dataset Viewer • Updated 14 days ago • 1.3k • 27 • 4
interstellarninja/interleaved_tool_use_execution_feedback Viewer • Updated Jul 2, 2025 • 3.34k • 13 • 2
SeppeV/mistral_instruct_ft_dpo_joke_outputs_with_tom_swiftie_prompt_and_context Viewer • Updated Oct 26, 2024 • 20 • 13
SeppeV/mistral_instruct_joke_outputs_with_tom_swiftie_prompt_and_context Viewer • Updated Oct 26, 2024 • 20 • 5
SeppeV/mistral_instruct_sft_dpo_joke_outputs_with_tom_swiftie_prompt_and_context Viewer • Updated Nov 2, 2024 • 20 • 5