Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
16
33
1
Jinyang Wu
Jinyang23
Follow
LZXzju's profile picture
Randolphzeng's profile picture
Rohitbobli's profile picture
11 followers
·
8 following
https://orcid.org/my-orcid?orcid=0009-0006-0220-616X
jinyangwu
AI & ML interests
large language models, reasoning, agentic rl
Recent Activity
authored
a paper
about 5 hours ago
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
authored
a paper
about 5 hours ago
Self-Distilled Agentic Reinforcement Learning
authored
a paper
about 5 hours ago
OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning
View all activity
Organizations
None yet
Jinyang23
's models
5
Sort: Recently updated
Jinyang23/OPID-ALFWorld-1.7B
Reinforcement Learning
•
2B
•
Updated
1 day ago
•
31
•
1
Jinyang23/Maestro-4B
5B
•
Updated
May 22
•
5
Jinyang23/Spark-1.5B-ScienceWorld
Reinforcement Learning
•
2B
•
Updated
Jan 30
•
9
Jinyang23/Spark-1.5B-WebShop
Reinforcement Learning
•
2B
•
Updated
Jan 30
•
2
Jinyang23/Spark-1.5B-ALFWorld
Reinforcement Learning
•
2B
•
Updated
Jan 30
•
4