arxiv:2306.03460
Apurva Gandhi
apurvaga
·
AI & ML interests
Agents, LLMs, Reinforcement Learning
Recent Activity
published
a model about 2 months ago
apurvaga/code-search-qwen-4b-distilled-from-14b-str-output updated
a model about 2 months ago
apurvaga/code-search-qwen-4b-distilled-from-14b-str-output upvoted a paper 5 months ago
Agent Data Protocol: Unifying Datasets for Diverse, Effective
Fine-tuning of LLM Agents