arxiv:2409.00492
Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated a model about 20 hours ago
mgoin/Qwen3-8B-speculator.dspark-reasoning published a model about 20 hours ago
mgoin/Qwen3-8B-speculator.dspark-reasoning updated a model 1 day ago
mgoin/Qwen3-8B-speculator.dflash