Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
30
EvalEval Bot
EvalEvalBot
Follow
evijit's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
new
activity
about 18 hours ago
evaleval/EEE_datastore:
Upload Theory of Mind
updated
a dataset
about 18 hours ago
evaleval/EEE_datastore
new
activity
about 21 hours ago
evaleval/EEE_datastore:
Upload Theory of Mind
View all activity
Organizations
EvalEvalBot
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
evaleval/EEE_datastore
about 18 hours ago
Upload Theory of Mind
4
#53 opened about 19 hours ago by
SirGankalot
updated
a dataset
about 18 hours ago
evaleval/EEE_datastore
Viewer
•
Updated
about 18 hours ago
•
5.4k
•
8.43k
•
14
New activity in
evaleval/EEE_datastore
about 21 hours ago
Upload Theory of Mind
19
#38 opened 12 days ago by
SirGankalot
New activity in
evaleval/EEE_datastore
2 days ago
Upload 5 files
1
#52 opened 5 days ago by
lmushro
[ACL Shared Task] Add Wordle Arena & Fibble Arena evaluation results
10
#35 opened 19 days ago by
drchangliu
New activity in
evaleval/EEE_datastore
5 days ago
Upload 5 files
1
#51 opened 5 days ago by
lmushro
New activity in
evaleval/EEE_datastore
7 days ago
Upload 5 files
9
#34 opened 20 days ago by
lmushro
Parquet for dataset viewer
#49 opened 7 days ago by
EvalEvalBot
Update HELM Leaderboards
3
#45 opened 9 days ago by
Damian96
Parquet for dataset viewer
#48 opened 7 days ago by
EvalEvalBot
Parquet for dataset viewer
#44 opened 9 days ago by
EvalEvalBot
Parquet for dataset viewer
#47 opened 7 days ago by
EvalEvalBot
[Mercor] APEX eval results (apex-agents, ace, apex-v1)
5
#36 opened 19 days ago by
madhavan113
[Submission] Update Exgentic Open Agent Leaderboard results (fix duplicate agent names)
2
#46 opened 7 days ago by
Elron
New activity in
evaleval/EEE_datastore
9 days ago
Parquet for dataset viewer
#42 opened 9 days ago by
EvalEvalBot
Parquet for dataset viewer
#41 opened 9 days ago by
EvalEvalBot
New activity in
evaleval/EEE_datastore
10 days ago
[Submission] Terminal-Bench 2.0 leaderboard data (115 agent+model results)
4
#28 opened 20 days ago by
StevenDillmann
[Submission] Terminal-Bench 2.0 leaderboard data (schema v0.2.2, eval_library=harbor)
6
#37 opened 13 days ago by
StevenDillmann
New activity in
evaleval/EEE_datastore
11 days ago
Upload 5 files
9
#34 opened 20 days ago by
lmushro
[Submission] Terminal-Bench 2.0 leaderboard data (schema v0.2.2, eval_library=harbor)
6
#37 opened 13 days ago by
StevenDillmann
Load more