EDITORIAL
LEADERBOARD
SIGN IN
SIGN UP
Login
/
Sign Up
Editorial
Leaderboard
About
Benchmarks
Type
Type
LATEST
UPVOTES
LAST 30 DAYS
LAST 30 DAYS
117
Kaggle
Kaggle Lets Developers Build AI Benchmarks Locally With Claude
Code
Kaggle Lets Developers Build AI Benchmarks Locally With Claude
Code
Kaggle
benchmarks
1 day ago
117
1D AGO
396
Microsoft AI
Microsoft's MAI-Transcribe-1.5 Transcribes an Hour of Audio in 15
Seconds
Microsoft's MAI-Transcribe-1.5 Transcribes an Hour of Audio in 15
Seconds
Microsoft AI
audio
1 day ago
396
1D AGO
446
Arena.ai
LM Arena Ships Agent Mode to Rank AI Models on Real Multi-Step
Tasks
LM Arena Ships Agent Mode to Rank AI Models on Real Multi-Step
Tasks
Arena.ai
agents
1 day ago
446
1D AGO
1,171
Anthropic
Anthropic's Study Shows Claude Turning Unskilled Hackers Into Advanced
Threats
Anthropic's Study Shows Claude Turning Unskilled Hackers Into Advanced
Threats
Anthropic
security
2 days ago
1,171
2D AGO
540
Microsoft AI
Microsoft's MAI-Image-2.5 Beats GPT-Image-1.5 With 75-Point Arena
Jump
Microsoft's MAI-Image-2.5 Beats GPT-Image-1.5 With 75-Point Arena
Jump
Microsoft AI
image
2 days ago
540
2D AGO
189
Artificial Analysis
Alibaba's Fun-Realtime-TTS Beats Google to Claim the #1 Speech
Throne
Alibaba's Fun-Realtime-TTS Beats Google to Claim the #1 Speech
Throne
Artificial Analysis
audio
2 days ago
189
2D AGO
953
Epoch AI
Epoch Finds Open-Weight AI Models Trail Closed Frontier by Four
Months
Epoch Finds Open-Weight AI Models Trail Closed Frontier by Four
Months
Epoch AI
benchmarks
6 days ago
953
6D AGO