EDITORIAL
LEADERBOARD
SIGN IN
SIGN UP
Login
/
Sign Up
Editorial
Leaderboard
About
Benchmarks
Type
Type
LATEST
UPVOTES
LAST 30 DAYS
LAST 30 DAYS
886
Anthropic
Anthropic's Claude Beats ChemDraw at Reading Molecular Spectra Without Chemistry
Training
Anthropic's Claude Beats ChemDraw at Reading Molecular Spectra Without Chemistry
Training
Anthropic
llms
1 hr ago
886
1H AGO
117
Kaggle
Kaggle Lets Developers Build AI Benchmarks Locally With Claude
Code
Kaggle Lets Developers Build AI Benchmarks Locally With Claude
Code
Kaggle
benchmarks
1 day ago
117
1D AGO
404
Microsoft AI
Microsoft's MAI-Transcribe-1.5 Transcribes an Hour of Audio in 15
Seconds
Microsoft's MAI-Transcribe-1.5 Transcribes an Hour of Audio in 15
Seconds
Microsoft AI
audio
1 day ago
404
1D AGO
452
Arena.ai
LM Arena Ships Agent Mode to Rank AI Models on Real Multi-Step
Tasks
LM Arena Ships Agent Mode to Rank AI Models on Real Multi-Step
Tasks
Arena.ai
agents
1 day ago
452
1D AGO
1,171
Anthropic
Anthropic's Study Shows Claude Turning Unskilled Hackers Into Advanced
Threats
Anthropic's Study Shows Claude Turning Unskilled Hackers Into Advanced
Threats
Anthropic
security
2 days ago
1,171
2D AGO
541
Microsoft AI
Microsoft's MAI-Image-2.5 Beats GPT-Image-1.5 With 75-Point Arena
Jump
Microsoft's MAI-Image-2.5 Beats GPT-Image-1.5 With 75-Point Arena
Jump
Microsoft AI
image
2 days ago
541
2D AGO
189
Artificial Analysis
Alibaba's Fun-Realtime-TTS Beats Google to Claim the #1 Speech
Throne
Alibaba's Fun-Realtime-TTS Beats Google to Claim the #1 Speech
Throne
Artificial Analysis
audio
2 days ago
189
2D AGO
953
Epoch AI
Epoch Finds Open-Weight AI Models Trail Closed Frontier by Four
Months
Epoch Finds Open-Weight AI Models Trail Closed Frontier by Four
Months
Epoch AI
benchmarks
7 days ago
953
7D AGO