LLM Training Datasets Collection A collection of datasets for training LLMs. • 127 items • Updated about 7 hours ago • 30
Papers Collection Large Language Model (LLM) and NLP related papers. • 347 items • Updated 4 days ago • 14
Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned Paper • 2603.05344 • Published 10 days ago • 6
Papers Collection Large Language Model (LLM) and NLP related papers. • 347 items • Updated 4 days ago • 14
Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought Paper • 2510.24941 • Published Oct 28, 2025 • 4
view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts 7 days ago • 20
view article Article NEO-unify: Building Native Multimodal Unified Models End to End 10 days ago • 93
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 4 days ago • 113
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 17 days ago • 87
LLM Tools Collection A collection of tools as various HF Spaces on LLMs. • 142 items • Updated 13 days ago • 3
Running Featured 90 LFM2.5 1.2B Thinking WebGPU 💧 90 Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU