Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation Paper • 2512.21002 • Published Dec 24, 2025
Scaling Down, Serving Fast: Compressing and Deploying Efficient LLMs for Recommendation Systems Paper • 2502.14305 • Published Oct 26, 2025