mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset Text Generation • 8B • Updated 31 minutes ago • 19
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-15 Text Generation • 8B • Updated 2 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-14 Text Generation • 8B • Updated 2 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-13 Text Generation • 8B • Updated 3 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-12 Text Generation • 8B • Updated 3 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-11 Text Generation • 8B • Updated 3 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-10 Text Generation • 8B • Updated 5 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-9 Text Generation • 8B • Updated 5 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-8 Text Generation • 8B • Updated 5 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-7 Text Generation • 8B • Updated 6 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-6 Text Generation • 8B • Updated 6 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-5 Text Generation • 8B • Updated 6 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-4 Text Generation • 8B • Updated 6 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-3 Text Generation • 8B • Updated 6 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-2 Text Generation • 8B • Updated 6 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5 Text Generation • 8B • Updated 6 days ago
mssfj/Qwen3-4B_formatted-miromind-1000-sft-scot-grpo-1epoch Text Generation • 4B • Updated Sep 1, 2025 • 2
mssfj/Qwen3-4B_formatted-miromind-1000-sft-prompt-grpo Text Generation • 4B • Updated Aug 31, 2025 • 1