Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
4
Nguyễn Minh Phúc
DatPySci
Follow
Oztobuzz's profile picture
dark-pen's profile picture
2 followers
·
1 following
AI & ML interests
Reinforcement learning, NLP
Recent Activity
updated
a model
1 day ago
DatPySci/code_r1
published
a model
1 day ago
DatPySci/code_r1
updated
a model
3 days ago
DatPySci/RLVR-SGDM-Gap
View all activity
Organizations
DatPySci
's models
96
Sort: Recently updated
DatPySci/tiny-llama-kto-iter0-0.1-epoch1
Text Generation
•
1B
•
Updated
Mar 9, 2024
DatPySci/zephyr-7b-kto-iter0-0.2-epoch1
Updated
Mar 8, 2024
DatPySci/pythia-160m-sft-full
Updated
Mar 1, 2024
DatPySci/pythia-1b-kto-iter0
Text Generation
•
1B
•
Updated
Feb 27, 2024
•
3
DatPySci/pythia-1b-self-kto-iter1
Text Generation
•
1B
•
Updated
Feb 27, 2024
•
3
DatPySci/pythia-1b-self-kto-iter0
Text Generation
•
1B
•
Updated
Feb 26, 2024
•
4
Previous
1
2
3
4
Next