Macaron-A2UI: A Model for Generative UI in Personal Agents Paper • 2605.24830 • Published 10 days ago • 80
SOD: Step-wise On-policy Distillation for Small Language Model Agents Paper • 2605.07725 • Published 26 days ago • 25
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 14 days ago • 204
Babsie/Qwen3.6-27B-Heretic2-Uncensored-Finetune-Thinking Image-Text-to-Text • 27B • Updated 13 days ago • 38 • 2
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 21 days ago • 270
Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents Paper • 2604.04979 • Published Apr 4 • 10
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 115