Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Antieval
non-profit
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
evalevanto
updated
a dataset
1 day ago
antieval/repro
rb
updated
a dataset
6 days ago
antieval/generator_confound_capped
evalevanto
published
a dataset
7 days ago
antieval/repro
View all activity
Team members
4
antieval
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Articles
evalevanto
updated
a dataset
1 day ago
antieval/repro
Updated
1 day ago
•
130
rb
updated
a dataset
6 days ago
antieval/generator_confound_capped
Updated
13 days ago
•
369
evalevanto
published
a dataset
7 days ago
antieval/repro
Updated
1 day ago
•
130
rb
updated
a dataset
8 days ago
antieval/repro_deploy
Updated
8 days ago
•
197
rb
in
antieval/repro
9 days ago
Add dataclaw deployment dataset (diverse 40 per model with tools)
1
#1 opened 9 days ago by
rb
rb
published
a dataset
9 days ago
antieval/repro_deploy
Updated
8 days ago
•
197
rb
published
a dataset
13 days ago
antieval/generator_confound_capped
Updated
13 days ago
•
369
rb
updated
a dataset
15 days ago
antieval/generator_confound
Updated
15 days ago
•
107
rb
published
a dataset
23 days ago
antieval/generator_confound
Updated
15 days ago
•
107
rb
updated
a dataset
26 days ago
antieval/frontier_sweep_evals
Updated
26 days ago
•
94
rb
published
a dataset
28 days ago
antieval/frontier_sweep_evals
Updated
26 days ago
•
94
rb
updated
a dataset
29 days ago
antieval/swebench-trajectories
Viewer
•
Updated
29 days ago
•
200
•
24
rb
published
a dataset
29 days ago
antieval/swebench-trajectories
Viewer
•
Updated
29 days ago
•
200
•
24
rb
updated
a dataset
29 days ago
antieval/cybench-trajectories
Viewer
•
Updated
29 days ago
•
190
•
17
rb
published
a dataset
29 days ago
antieval/cybench-trajectories
Viewer
•
Updated
29 days ago
•
190
•
17
rb
updated
a dataset
29 days ago
antieval/agentharm-trajectories
Viewer
•
Updated
29 days ago
•
160
•
19
rb
published
a dataset
29 days ago
antieval/agentharm-trajectories
Viewer
•
Updated
29 days ago
•
160
•
19
Load more