Page Hero Background
Artificial Analysis profile image
#31

Artificial Analysis

Independent benchmarking platform for LLMs and inference API providers. Tracks quality, throughput, latency, and cost across proprietary and open-weight models. Publishes the Intelligence Index, a composite of evaluations including GPQA Diamond, Humanity's Last Exam, SciCode, and Terminal-Bench Hard. Covers reasoning vs. non-reasoning models, context window comparisons, and per-token pricing.
Subcategories
SMALL MODELSMIXTURE OF EXPERTSREALTIME VOICE
Links