Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
sirynoma
uavleeva
Follow
0 followers
·
1 following
Suchotin
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
Multitask RLVR using GRPO (HSE Project)
updated
a collection
1 day ago
Multitask RLVR using GRPO (HSE Project)
updated
a collection
1 day ago
Multitask RLVR using GRPO (HSE Project)
View all activity
Organizations
uavleeva
's models
13
Sort:Â Recently updated
uavleeva/grpo_merged_math_sql_code_ties_001
Text Generation
•
Updated
1 day ago
•
6
uavleeva/grpo_mixed_run_002
Updated
1 day ago
uavleeva/grpo_sql_run_005
Updated
1 day ago
uavleeva/grpo_merged_math_sql_code_linear_001
Text Generation
•
Updated
1 day ago
uavleeva/grpo_code_run_002
Updated
1 day ago
uavleeva/grpo_mixed_run_004
Updated
1 day ago
uavleeva/grpo_math_run_level3_all_rewards_001
Updated
1 day ago
uavleeva/grpo_sql_run_002
Updated
1 day ago
uavleeva/grpo_sql_run_004
Updated
2 days ago
uavleeva/grpo_mixed_run_001
Updated
2 days ago
uavleeva/grpo_sudoku_run_003
Updated
3 days ago
uavleeva/grpo_math_run_level3_accformat_001
Updated
3 days ago
uavleeva/grpo_code_run_001
Updated
3 days ago