inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_noise 22B • Updated about 1 hour ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_hybrid 22B • Updated about 3 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_heuristic 20B • Updated about 4 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_noise 20B • Updated about 5 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_hybrid 20B • Updated about 10 hours ago • 10
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt3 0.5B • Updated 3 days ago • 23
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt3-speculator.eagle3 0.9B • Updated 3 days ago • 36
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt1-speculator.eagle3 0.9B • Updated 3 days ago • 22
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt0-speculator.eagle3 0.9B • Updated 3 days ago • 21
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt2 0.5B • Updated 3 days ago • 21
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt1 0.5B • Updated 3 days ago • 19