Exa Raises $250M as AI Agents Threaten to Replace Human Web Search

Exa

May 20, 2026

2 min read

BUSINESS

RETRIEVAL

embeddings rag vector_db

May 20, 2026

BUSINESS

RETRIEVAL

embeddings rag vector_db

2 min read

Exa just closed a $250M Series C at a $2.2B valuation, led by Andreessen Horowitz. The timing was almost theatrical: Exa announced its raise on May 20, the morning after Google declared its own search box obsolete at I/O 2026. Whether that was coordinated or coincidence, the message landed clearly: the era of search built for humans is giving way to search built for machines.

What Exa actually is

The core difference from Google, Bing, or any traditional search engine is that Exa uses neural embeddings to understand query meaning, not keyword matching. The company was founded in 2021 by Will Bryk and Jeff Wang, two Harvard roommates who watched GPT-3 launch and saw a gap: GPT-3 could understand natural language at a level that Google's keyword algorithms could not match, but search was still built on term frequency and link graphs.

Bryk and Wang started as Metaphor Systems (YC W22), rebranded to Exa in January 2024, and raised $85M at a $700M valuation in September 2025. The Series C more than triples the five-year-old startup's valuation since that raise just last fall. The speed of that re-rating tells you everything about how hot this category has become.

The stack they built from scratch

Most search APIs you've heard of are wrappers. There's a reason there are more space programs than independent search engines , most other search providers actually wrap other search engines and therefore cannot compete on quality, latency, or cost. Exa went the hard route.

Crawlers that track over 500 billion URLs, research teams that train special embedding models on a GPU cluster they assembled, and a new vector database built for the extremely high queries-per-second that agents need.
The infrastructure is built in Rust, running a custom vector database with Matryoshka embeddings (a technique that stores embeddings at multiple resolutions for flexible precision tradeoffs), document clustering, binary compression, and assembly-level SIMD optimizations. The embedding model was trained for over a month on a 144-GPU H200 cluster.
They built the fastest search API in the world at sub-200ms, and fast text extraction models that reduce LLM token counts by over 20x.

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

Full access to in-depth AI research breakdowns
Be the first to know what's trending before it hits mainstream
Daily curated papers, repos, and industry moves

Exa Raises $250M as AI Agents Threaten to Replace Human Web Search

Takeaways

What Exa actually is

The stack they built from scratch

Don't miss what's next in AI