Page Hero Background
Cerebras profile image
#43

Cerebras

Builds the WSE-3, a wafer-scale processor with 900,000 AI-optimized cores and 44 GB on-chip SRAM on a single die, eliminating GPU cluster interconnect overhead. Powers the CS-3 system and a cloud inference API serving open models like Llama, DeepSeek R1, and Qwen3 at thousands of tokens per second with OpenAI-compatible endpoints.
Categories
Subcategories
SMALL MODELS
Links