pinned
Running
28
Benchmark Finder
๐
A space to view and inspect all the tasks in lighteval
LLM evaluation
A space to view and inspect all the tasks in lighteval
Explore LLM benchmark trends over time
Explore and discover all leaderboards from the HF community
Display and inspect log files
Launch and monitor model evaluation jobs
Generate a command to run model evaluations
Compare tokenization lengths across languages