Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
34
5
1
Mostafa Elhoushi
melhoushi
Follow
clem's profile picture
Mahfuz33's profile picture
VisakhJayakumar's profile picture
39 followers
·
8 following
m_elhoushi
mostafaelhoushi
mostafaelhoushi
AI & ML interests
Make ML faster, smaller, smarter.
Recent Activity
updated
a model
6 days ago
melhoushi/gpt_cp_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
published
a model
6 days ago
melhoushi/gpt_cp_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
updated
a model
6 days ago
melhoushi/common_pile_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
View all activity
Organizations
Articles
1
Article
65
Faster Text Generation with Self-Speculative Decoding
Papers
11
arxiv:
2507.04610
arxiv:
2506.00204
arxiv:
2505.20309
arxiv:
2410.00215
View 11 papers
models
151
Sort: Recently updated
melhoushi/gpt_cp_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
Updated
6 days ago
•
124
melhoushi/common_pile_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
Updated
6 days ago
•
32
melhoushi/gpt_cp_h1152_d23_gbs76_tpp20.0_lp0.4_linear_null
Updated
6 days ago
•
119
melhoushi/common_pile_h1152_d23_gbs76_tpp20.0_lp0.4_linear_null
Updated
6 days ago
•
27
melhoushi/gpt_cp_h896_d17_gbs66_tpp20.0_lp0.4_linear_null
Updated
6 days ago
•
127
melhoushi/common_pile_h896_d17_gbs66_tpp20.0_lp0.4_linear_null
Updated
6 days ago
•
33
melhoushi/gpt_cp_h640_d13_gbs48_tpp20.0_lp0.4_linear_null
Updated
6 days ago
•
122
melhoushi/common_pile_h640_d13_gbs48_tpp20.0_lp0.4_linear_null
Updated
6 days ago
•
38
melhoushi/gpt_cp_h640_d13_gbs48_tpp20.0_lp0.4_linear_linear_reverse
Updated
6 days ago
•
125
melhoushi/common_pile_h640_d13_gbs48_tpp20.0_lp0.4_linear_linear_reverse
Updated
6 days ago
•
42
View 151 models
datasets
138
Sort: Recently updated
melhoushi/OpenThoughts3_science_qwen7binst_traj_n16w16_2048
Viewer
•
Updated
May 3
•
4.5M
•
55
melhoushi/OpenThoughts3_math_qwen7binst_traj_n16w16_2048
Viewer
•
Updated
May 3
•
4.05M
•
35
melhoushi/OpenThoughts3_code_qwen7binst_traj_n16w16_2048
Viewer
•
Updated
May 3
•
9.63M
•
52
melhoushi/OpenThoughts3_science_qwen7binst_sft_2048
Viewer
•
Updated
May 2
•
91.8k
•
30
melhoushi/OpenThoughts3_math_qwen7binst_sft_2048
Viewer
•
Updated
May 2
•
88.3k
•
16
melhoushi/OpenThoughts3_code_qwen7binst_sft_2048
Viewer
•
Updated
May 2
•
200k
•
47
melhoushi/OpenThoughts3_code_qwen_sft_2048
Viewer
•
Updated
Apr 23
•
212k
•
40
•
1
melhoushi/OpenThoughts3_math_qwen_sft_2048
Viewer
•
Updated
Apr 23
•
79.5k
•
17
melhoushi/OpenThoughts3_math_qwen_sft
Viewer
•
Updated
Apr 22
•
80.7k
•
202
melhoushi/baseline_gptq_dataset_codesmath
Viewer
•
Updated
Nov 24, 2025
•
1.02k
•
16
View 138 datasets