Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 11 days ago • 129
Qwen3 DWQ Quants Collection High-quality 4-bit quants of the Qwen3 model family. • 8 items • Updated Jul 11, 2025 • 7
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 10 items • Updated 5 days ago • 557