EpochX: Building the Infrastructure for an Emergent Agent Civilization Paper • 2603.27304 • Published 4 days ago • 40
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published Feb 9 • 285
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published 21 days ago • 75
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 29 days ago • 191
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 28 days ago • 102
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 6 days ago • 122
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated about 9 hours ago • 61
Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens Paper • 2602.16687 • Published Feb 18 • 5
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis Paper • 2505.02625 • Published May 5, 2025 • 23
AzeroS: Extending LLM to Speech with Self-Generated Instruction-Free Tuning Paper • 2601.06086 • Published Dec 31, 2025 • 1
TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling Paper • 2504.07053 • Published Apr 9, 2025 • 6