Next Generation Internet Group

university

https://ging.github.io/

AI & ML interests

Evaluation of LLMs

Recent Activity

Arri98 authored a paper 7 days ago

The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations

Arri98 authored a paper 7 days ago

Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings

Arri98 authored a paper 7 days ago

Why Do Large Language Models (LLMs) Struggle to Count Letters?

View all activity

GING-UPM 's datasets

None public yet