FastKernels: Benchmarking GPU Kernel Generation in Production Paper • 2605.23215 • Published 12 days ago • 8
view article Article Context Engineering & Reuse Pattern Under the Hood of Claude Code kobe0938 • Dec 22, 2025 • 7
Switch-Transformers release Collection This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. • 9 items • Updated Mar 12 • 19