Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated
a model about 2 hours ago
inference-optimization/Llama-3.2-3B-Instruct_7_bits_mode_heuristic published
a model about 2 hours ago
inference-optimization/Llama-3.2-3B-Instruct_7_bits_mode_heuristic updated
a model about 2 hours ago
inference-optimization/Llama-3.2-3B-Instruct_7_bits_mode_noise