lllqaq/R2EGym-32B-Agent-Coder-Instruct_merged_bucketab_4sources_20260228_101548_32768_8gpu Text Generation • 1.12M • Updated 5 days ago • 14
lllqaq/R2EGym-32B-Agent-Coder-Instruct_merged_bucketab_4sources_20260228_101548_32768_8gpu Text Generation • 1.12M • Updated 5 days ago • 14
lllqaq/R2EGym-7B-Agent-Coder-Instruct-merged_bucketab_4sources_20260228_101548_32768_3gpu_oomfix Text Generation • 333k • Updated 6 days ago • 38
lllqaq/R2EGym-7B-Agent-Coder-Instruct-merged_bucketab_4sources_20260228_101548_32768_3gpu_oomfix Text Generation • 333k • Updated 6 days ago • 38
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-traj_reward1_loose_4sources_shuf42_ckpt2400 841k • Updated 9 days ago • 11
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-traj_reward1_loose_4sources_shuf42_ckpt2400 841k • Updated 9 days ago • 11
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-merged_bucketab_4sources_20260228_101548_32768_4gpu_oomfix Text Generation • 841k • Updated 10 days ago • 14
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-merged_bucketab_4sources_20260228_101548_32768_4gpu_oomfix Text Generation • 841k • Updated 10 days ago • 14
lllqaq/R2EGym-14B-Agent-Coder-Instruct-traj_bucketAB_multi_3sources_bucketAB_sft_shuf42 Text Generation • 841k • Updated 10 days ago • 12
lllqaq/R2EGym-14B-Agent-Coder-Instruct-traj_bucketAB_multi_3sources_bucketAB_sft_shuf42 Text Generation • 841k • Updated 10 days ago • 12
lllqaq/R2EGym-32B-Agent-Coder-Instruct-fimMidPostV2-r2egym-32k-ckpt808 1.12M • Updated 15 days ago • 10
lllqaq/R2EGym-32B-Agent-Coder-Instruct-fimMidPostV2-r2egym-32k-ckpt808 1.12M • Updated 15 days ago • 10