
Exa just closed a $250M Series C at a $2.2B valuation, led by Andreessen Horowitz. The timing was almost theatrical: Exa announced its raise on May 20, the morning after Google declared its own search box obsolete at I/O 2026. Whether that was coordinated or coincidence, the message landed clearly: the era of search built for humans is giving way to search built for machines.
What Exa actually is
The core difference from Google, Bing, or any traditional search engine is that Exa uses neural embeddings to understand query meaning, not keyword matching. The company was founded in 2021 by Will Bryk and Jeff Wang, two Harvard roommates who watched GPT-3 launch and saw a gap: GPT-3 could understand natural language at a level that Google's keyword algorithms could not match, but search was still built on term frequency and link graphs.
Bryk and Wang started as Metaphor Systems (YC W22), rebranded to Exa in January 2024, and raised $85M at a $700M valuation in September 2025. The Series C more than triples the five-year-old startup's valuation since that raise just last fall. The speed of that re-rating tells you everything about how hot this category has become.
The stack they built from scratch
Most search APIs you've heard of are wrappers. There's a reason there are more space programs than independent search engines , most other search providers actually wrap other search engines and therefore cannot compete on quality, latency, or cost. Exa went the hard route.
- Crawlers that track over 500 billion URLs, research teams that train special embedding models on a GPU cluster they assembled, and a new vector database built for the extremely high queries-per-second that agents need.
- The infrastructure is built in Rust, running a custom vector database with Matryoshka embeddings (a technique that stores embeddings at multiple resolutions for flexible precision tradeoffs), document clustering, binary compression, and assembly-level SIMD optimizations. The embedding model was trained for over a month on a 144-GPU H200 cluster.
- They built the fastest search API in the world at sub-200ms, and fast text extraction models that reduce LLM token counts by over 20x.
Don't miss what's next in AI
Join 300,000+ engineers and researchers who get the signal, not the noise.
- Full access to in-depth AI research breakdowns
- Be the first to know what's trending before it hits mainstream
- Daily curated papers, repos, and industry moves
