-
Iterative Layer Pruning for Efficient Translation Inference
Paper • 2510.22763 • Published -
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated • 1 -
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 3 -
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 1
Yasmin Moslem PRO
ymoslem
AI & ML interests
Machine Translation, Speech Translation, Large Language Models, Natural Language Processing
Recent Activity
updated
a model
about 3 hours ago
AfriNLP/AfriNLLB-12enc-12dec-full-ft
upvoted
a
collection
about 5 hours ago
AfriNLLB
updated
a collection
1 day ago
AfriNLLB
Organizations
MT Quality Estimation
Models for reference-free quality estimation of machine translation
-
ymoslem/ModernBERT-base-long-context-qe-v1
Text Classification • 0.1B • Updated • 11 • 5 -
ymoslem/ModernBERT-large-qe-v1
Text Classification • 0.4B • Updated • 3 • 2 -
ymoslem/xlm-roberta-large-qe-v1
Text Classification • 0.6B • Updated • 5 • 1 -
ymoslem/ModernBERT-large-qe-maxlen512-v1
Text Classification • 0.4B • Updated • 5 • 1
WMT-Model-Compression
-
Iterative Layer Pruning for Efficient Translation Inference
Paper • 2510.22763 • Published -
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated • 1 -
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 3 -
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 1
MT Quality Estimation
Models for reference-free quality estimation of machine translation
-
ymoslem/ModernBERT-base-long-context-qe-v1
Text Classification • 0.1B • Updated • 11 • 5 -
ymoslem/ModernBERT-large-qe-v1
Text Classification • 0.4B • Updated • 3 • 2 -
ymoslem/xlm-roberta-large-qe-v1
Text Classification • 0.6B • Updated • 5 • 1 -
ymoslem/ModernBERT-large-qe-maxlen512-v1
Text Classification • 0.4B • Updated • 5 • 1
models
69
ymoslem/wmt25-eng-arz-16layers-2e-5lr-news-commentary
Text Generation
•
5B
•
Updated
•
4
ymoslem/wmt25-eng-arz-20layers-2e-5lr-news-commentary
Text Generation
•
5B
•
Updated
•
4
ymoslem/wmt25-eng-arz-24layers-2e-5lr-news-commentary
Text Generation
•
6B
•
Updated
•
3
ymoslem/aya-expanse-8b-eng-arz-16layers
Text Generation
•
5B
•
Updated
•
2
ymoslem/aya-expanse-8b-eng-arz-20layers
Text Generation
•
5B
•
Updated
•
1
ymoslem/aya-expanse-8b-eng-arz-24layers
Text Generation
•
6B
•
Updated
•
3
ymoslem/aya-expanse-8b-20layers-cs-de-iter
Text Generation
•
5B
•
Updated
•
1
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation
•
5B
•
Updated
•
1
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation
•
5B
•
Updated
•
3
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation
•
6B
•
Updated
•
1
datasets
38
ymoslem/flores-test-pruning
Viewer
•
Updated
•
1.1k
•
4
ymoslem/TeleQnA-processed
Viewer
•
Updated
•
10k
•
310
ymoslem/news-commentary-eng-arz
Viewer
•
Updated
•
83.7k
•
134
ymoslem/Anhui-Telecom-QA
Viewer
•
Updated
•
157k
•
4
•
2
ymoslem/Law-StackExchange
Viewer
•
Updated
•
24.4k
•
394
•
31
ymoslem/IWSLT2025-Test
Viewer
•
Updated
•
772
•
17
ymoslem/news-commentary-en-ar
Viewer
•
Updated
•
84.3k
•
7
•
1
ymoslem/news-commentary-cs-de
Viewer
•
Updated
•
201k
•
33
ymoslem/paragraph-cs-de-src-50k
Viewer
•
Updated
•
44.1k
•
3
ymoslem/paragraph-cs-de-src-tgt-50k
Viewer
•
Updated
•
44.6k
•
2