Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
1
David Heineman
davidheineman
Follow
andito's profile picture
frimelle's profile picture
vaibhavb's profile picture
4 followers
·
4 following
https://davidheineman.com
heinemandavidj
davidheineman
AI & ML interests
None yet
Recent Activity
updated
a dataset
7 days ago
davidheineman/minieval
published
a dataset
7 days ago
davidheineman/minieval
authored
a paper
8 days ago
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
View all activity
Organizations
davidheineman
's datasets
23
Sort: Recently updated
davidheineman/minieval
Viewer
•
Updated
7 days ago
•
247k
•
16
davidheineman/colm-2025
Viewer
•
Updated
21 days ago
•
418
•
36
davidheineman/artisanal-texts
Viewer
•
Updated
Sep 21
•
10.4k
•
42
davidheineman/neurips-2025
Viewer
•
Updated
Sep 20
•
5.79k
•
58
davidheineman/scifact-open
Viewer
•
Updated
Sep 11
•
279
•
4
davidheineman/inverse-scaling
Viewer
•
Updated
Sep 3
•
37.5k
•
68
davidheineman/plato
Viewer
•
Updated
Aug 20
•
1.64k
•
12
davidheineman/hackernews
Viewer
•
Updated
Jun 18
•
221k
•
13
davidheineman/deepseek-leetcode
Viewer
•
Updated
Jun 16
•
180
•
33
davidheineman/medqa-en
Viewer
•
Updated
May 30
•
12.7k
•
224
davidheineman/nsf-awards
Viewer
•
Updated
May 27
•
525k
•
23
davidheineman/ponder-this
Viewer
•
Updated
May 23
•
324
•
5
davidheineman/jane-street-puzzles
Viewer
•
Updated
May 23
•
120
•
20
davidheineman/irt-evals
Updated
Apr 13
•
27
davidheineman/consistent-ranking-evals
Viewer
•
Updated
Mar 24
•
366k
•
2
davidheineman/aime
Viewer
•
Updated
Feb 2
•
933
•
9
davidheineman/hle
Viewer
•
Updated
Feb 1
•
2.68k
•
5
davidheineman/gpqa
Viewer
•
Updated
Feb 1
•
448
•
3
davidheineman/autobencher-math
Viewer
•
Updated
Jan 20
•
18k
•
11
davidheineman/deepmind-math-large
Viewer
•
Updated
Jan 19
•
38.1k
•
15
davidheineman/autobencher-knowledge-qa
Viewer
•
Updated
Jan 8
•
33.7k
•
4
davidheineman/georgia-environmental-complaints
Viewer
•
Updated
Jan 5
•
836
•
16
davidheineman/consumer-finance-complaints-large
Viewer
•
Updated
Jan 4
•
7.18M
•
28
•
1