arxiv:2504.07128
Dongchan Shin
ShinDC
AI & ML interests
NLP
Recent Activity
updated a dataset 2 days ago
ShinDC/mdaqa_corpus published a dataset 2 days ago
ShinDC/mdaqa_corpus upvoted a paper about 1 year ago
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent
Trajectories