ISTA-DASLab/Qwen3-8B-FPQuant-RTN-MXFP4
Text Generation • 5B • Updated • 4
None defined yet.
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers