FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Paper • 2408.12168 • Published Aug 22, 2024
ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting Paper • 2406.19976 • Published Jun 28, 2024
R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization Paper • 2505.15155 • Published May 21, 2025 • 1
R&D-Agent: An LLM-Agent Framework Towards Autonomous Data Science Paper • 2505.14738 • Published May 20, 2025 • 1
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL Paper • 2605.18703 • Published 4 days ago • 46