EvalEval Coalition

community

https://evalevalai.com/

evaluatingevals

Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

aris-hofmann updated a dataset about 11 hours ago

evaleval/auto-benchmarkcards

evijit updated a bucket about 16 hours ago

evaleval/general-eval-card-storage

deepmage121 updated a collection about 22 hours ago

To-be processed datasets

View all activity

Papers

Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

View all Papers

Articles

Introducing Evaluation Cards: A Live Interpretive Layer for Understanding the AI Evaluations Ecosystem

AI evals are becoming the new compute bottleneck

evaleval 's papers 1

Submitted by

taesiri

Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

evaleval

EvalEval Coalition