AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

aris-hofmann  updated a dataset about 11 hours ago
evaleval/auto-benchmarkcards
evijit  updated a bucket about 16 hours ago
evaleval/general-eval-card-storage
deepmage121  updated a collection about 22 hours ago
To-be processed datasets
View all activity

Articles