Tiny models used for testing
Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch3
2B • Updated • 10 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch1
2B • Updated • 9 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch2
2B • Updated • 7 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-GSM8K-MTP-finetuned
81B • Updated • 5
Tiny models used for testing
-
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch3
2B • Updated • 10 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch1
2B • Updated • 9 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch2
2B • Updated • 7 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-GSM8K-MTP-finetuned
81B • Updated • 5
models 180
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-muon-ckpt4
2B • Updated
inference-optimization/Qwen3-8B-FP8-Dynamic
Text Generation • 8B • Updated
inference-optimization/Qwen3-8B-speculator.dflash.fullattn-qwen235b-instruct-bs16-ckpt0
2B • Updated
inference-optimization/dflash-DeepSeek-V4-Flash-swa-muon-speculators-50k
2B • Updated • 121
inference-optimization/dflash-DeepSeek-V4-Flash-all-swa-muon-speculators-50k
2B • Updated • 94 • 1
inference-optimization/Qwen3.5-397B-A17B-FP8-dynamic-data-subset-speculator.dflash
2B • Updated • 16
inference-optimization/Qwen3-8B-from-Qwen3-8B_regen-speculators.eagle31-fcnorm-ckpt1
1B • Updated • 27
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt4
2B • Updated • 23
inference-optimization/Qwen3-8B-from-Qwen3-8B_regen-speculators.eagle31-qwen3arch-3e4-ckpt1
1B • Updated • 14
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt3
2B • Updated • 226
datasets 26
inference-optimization/every-eval-ever-demo
Viewer • Updated • 1 • 58
inference-optimization/DeepSeek-V4-Flash-responses
Viewer • Updated • 508k • 49
inference-optimization/Qwen3.5-4B-responses
Viewer • Updated • 7.47k • 100
inference-optimization/Qwen3.5-0.8B-responses
Viewer • Updated • 7.47k • 159
inference-optimization/Qwen3.5-9B-responses
Viewer • Updated • 7.67k • 67
inference-optimization/Qwen3-8B-Regenerated-Collection
Preview • Updated • 252
inference-optimization/Qwen3-30B-A3B-responses
Preview • Updated • 70
inference-optimization/gpt-oss-120b-responses
Preview • Updated • 20
inference-optimization/Qwen3-32B-responses
Preview • Updated • 54
inference-optimization/ctest-Qwen3.6-27B-speculator-dataset
Viewer • Updated • 5.61k • 51 • 1