sirynoma's picture

In a Training Loop 🔄

sirynoma

uavleeva

·

AI & ML interests

None yet

Organizations

uavleeva 's models 13

uavleeva/grpo_merged_math_sql_code_ties_001

Text Generation • Updated Feb 8

uavleeva/grpo_mixed_run_002

uavleeva/grpo_sql_run_005

uavleeva/grpo_merged_math_sql_code_linear_001

Text Generation • Updated Feb 8

uavleeva/grpo_code_run_002

uavleeva/grpo_mixed_run_004

uavleeva/grpo_math_run_level3_all_rewards_001

uavleeva/grpo_sql_run_002

uavleeva/grpo_sql_run_004

uavleeva/grpo_mixed_run_001

uavleeva/grpo_sudoku_run_003

uavleeva/grpo_math_run_level3_accformat_001

uavleeva/grpo_code_run_001