Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Mostafa Elhoushi
melhoushi
34
5
1
Follow
orionrich's profile picture
mahnerak's profile picture
3zzazakl's profile picture
39 followers
·
8 following
m_elhoushi
mostafaelhoushi
mostafaelhoushi
AI & ML interests
Make ML faster, smaller, smarter.
Recent Activity
updated
a model
22 days ago
melhoushi/JacobiForcing_Code_10k
updated
a model
about 1 month ago
melhoushi/gpt_cp_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
published
a model
about 1 month ago
melhoushi/gpt_cp_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
View all activity
Organizations
Articles
1
Article
66
Faster Text Generation with Self-Speculative Decoding
Papers
11
arxiv:
2507.04610
arxiv:
2506.00204
arxiv:
2505.20309
arxiv:
2410.00215
View 11 papers
models
151
Sort: Recently updated
melhoushi/gpt_cp_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
Updated
Jun 1
•
95
melhoushi/common_pile_h1152_d23_gbs76_tpp20.0_lp0.4_linear_linear_reverse
Updated
Jun 1
•
2
melhoushi/gpt_cp_h1152_d23_gbs76_tpp20.0_lp0.4_linear_null
Updated
Jun 1
•
92
melhoushi/common_pile_h1152_d23_gbs76_tpp20.0_lp0.4_linear_null
Updated
Jun 1
•
2
melhoushi/gpt_cp_h896_d17_gbs66_tpp20.0_lp0.4_linear_null
Updated
May 31
•
95
melhoushi/common_pile_h896_d17_gbs66_tpp20.0_lp0.4_linear_null
Updated
May 31
melhoushi/gpt_cp_h640_d13_gbs48_tpp20.0_lp0.4_linear_null
Updated
May 31
•
92
melhoushi/common_pile_h640_d13_gbs48_tpp20.0_lp0.4_linear_null
Updated
May 31
•
1
melhoushi/gpt_cp_h640_d13_gbs48_tpp20.0_lp0.4_linear_linear_reverse
Updated
May 31
•
92
melhoushi/common_pile_h640_d13_gbs48_tpp20.0_lp0.4_linear_linear_reverse
Updated
May 31
•
1
View 151 models
datasets
138
Sort: Recently updated
melhoushi/OpenThoughts3_science_qwen7binst_traj_n16w16_2048
Viewer
•
Updated
May 3
•
4.5M
•
13
melhoushi/OpenThoughts3_math_qwen7binst_traj_n16w16_2048
Viewer
•
Updated
May 3
•
4.05M
•
15
melhoushi/OpenThoughts3_code_qwen7binst_traj_n16w16_2048
Viewer
•
Updated
May 3
•
9.63M
•
18
melhoushi/OpenThoughts3_science_qwen7binst_sft_2048
Viewer
•
Updated
May 2
•
91.8k
•
13
melhoushi/OpenThoughts3_math_qwen7binst_sft_2048
Viewer
•
Updated
May 2
•
88.3k
•
9
melhoushi/OpenThoughts3_code_qwen7binst_sft_2048
Viewer
•
Updated
May 2
•
200k
•
10
melhoushi/OpenThoughts3_code_qwen_sft_2048
Viewer
•
Updated
Apr 23
•
212k
•
31
•
1
melhoushi/OpenThoughts3_math_qwen_sft_2048
Viewer
•
Updated
Apr 23
•
79.5k
•
12
melhoushi/OpenThoughts3_math_qwen_sft
Viewer
•
Updated
Apr 22
•
80.7k
•
12
melhoushi/baseline_gptq_dataset_codesmath
Viewer
•
Updated
Nov 24, 2025
•
1.02k
•
22
View 138 datasets