arxiv:2410.01180
Kangda Wei
kangdawei
AI & ML interests
None yet
Organizations
models 50
kangdawei/DRA-GRPO-8B
8B • Updated • 2
kangdawei/DRA-GRPO-7B
Text Generation • 8B • Updated • 12
kangdawei/MMR-Sigmoid-DAPO-7B
Text Generation • 8B • Updated • 17 •
kangdawei/MMR-Sigmoid-DR-GRPO-8B
Text Generation • 8B • Updated • 3
kangdawei/MMR-Sigmoid-DAPO-8B
Text Generation • 8B • Updated • 174 •
kangdawei/MMR-Sigmoid-DAPO
Text Generation • 2B • Updated • 3 •
kangdawei/MMR-Sigmoid-GRPO-8B
Text Generation • 8B • Updated • 4 • 1
kangdawei/MMR-Sigmoid-GRPO-7B
Text Generation • 8B • Updated • 3
kangdawei/MMR-Sigmoid-DR-GRPO-7B
Text Generation • 8B • Updated • 5
kangdawei/DAPO-8B
Text Generation • 8B • Updated • 4 •
datasets 0
None public yet