Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Next Generation Internet Group
university
https://ging.github.io/
Activity Feed
Follow
11
AI & ML interests
Evaluation of LLMs
Recent Activity
mariagrandury
authored
a paper
2 days ago
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
mariagrandury
authored
a paper
2 days ago
Measuring what Matters: Construct Validity in Large Language Model Benchmarks
mariagrandury
authored
a paper
5 months ago
Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings
View all activity
Team members
7
GING-UPM
's datasets
None public yet