Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
9
Yuezhou Hu
yuezhouhu
Follow
iAwAiL's profile picture
foreverpiano's profile picture
Nietzsche6700's profile picture
4 followers
·
3 following
https://yuezhouhu.github.io/
yuezhouhu
yuezhouhu
AI & ML interests
My research interests include efficient machine learning, particularly efficient training and inference.
Recent Activity
authored
a paper
2 days ago
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning
upvoted
a
paper
3 days ago
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning
submitted
a paper
18 days ago
Residual Context Diffusion Language Models
View all activity
Organizations
yuezhouhu
's models
12
Sort: Recently updated
yuezhouhu/RCD-LLaDA-8B-Instruct
1B
•
Updated
20 days ago
•
23
yuezhouhu/SeqD-LLaDA-8B-Instruct
1B
•
Updated
20 days ago
•
16
yuezhouhu/RCD-SDAR-8B-b64-Thinking
8B
•
Updated
23 days ago
•
12
yuezhouhu/RCD-SDAR-8B-b32-Thinking
8B
•
Updated
23 days ago
•
7
yuezhouhu/RCD-SDAR-4B-b64-Thinking
4B
•
Updated
23 days ago
•
26
yuezhouhu/RCD-SDAR-4B-b32-Thinking
4B
•
Updated
23 days ago
•
31
yuezhouhu/SeqD-SDAR-8B-b64-Thinking
8B
•
Updated
23 days ago
•
15
yuezhouhu/SeqD-SDAR-8B-b32-Thinking
8B
•
Updated
23 days ago
•
13
yuezhouhu/SeqD-SDAR-4B-b64-Thinking
4B
•
Updated
23 days ago
•
34
yuezhouhu/SeqD-SDAR-4B-b32-Thinking
4B
•
Updated
23 days ago
•
10
yuezhouhu/SeqD-SDAR-1.7B-b64-Thinking
2B
•
Updated
23 days ago
•
40
yuezhouhu/SeqD-SDAR-1.7B-b32-Thinking
2B
•
Updated
23 days ago
•
100