Scaling Properties of Continuous Diffusion Spoken Language Models Paper • 2604.24416 • Published 15 days ago • 1
view article Article easyaligner: Forced alignment of text and audio, made easy KBLab • 26 days ago • 5
SEC EDGAR Collection A collection of all major filings available through the SEC EDGAR database. • 11 items • Updated Apr 7 • 7
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 qgallouedec, stevhliu, pcuenq, sergiopaniego • Mar 31 • 51
EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation Paper • 2603.18739 • Published Mar 19 • 11
view article Article Using Storage Buckets as a Working Layer for Data Pipelines davanstrien • Mar 26 • 3
ndl-core-collection Collection A collection of UK government structured datasets and textual sources for research, analysis, and AI applications. • 6 items • Updated Jan 12 • 3
view article Article Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets nomadicml • Mar 21 • 17
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding nvidia • Mar 19 • 47