Sakana AI's Sheaf-ADMM Solves Multi-Agent Sudoku at 92.6% Where Rivals Hit 34%

Sakana AI

3H AGO

2 min read

3 hrs ago

2 min read

Most neural networks are monoliths: one model, one forward pass, one answer. But a growing number of real-world problems -- sensor grids, robot swarms, decentralized inference -- are fundamentally distributed. No single node sees the whole picture. Sheaf-ADMM, a new framework from Sakana AI accepted at ICML 2026, takes this constraint seriously and builds it into the architecture itself.

The problem with centralized thinking

AI systems are increasingly composed of many interacting agents rather than a single monolithic model. Yet in practice, most multi-agent systems still rely on a central orchestrator to delegate and aggregate. In many systems of interest, however, no such central coordinator exists -- the nodes of a sensor network, the ants of a colony, or the neurons of a nervous system each observe a small part of the environment and communicate only with their neighbors, and coherent global behavior arises from these local interactions alone.

The question Sheaf-ADMM asks is deceptively simple: how do you build a neural network where coordination is the computation, not an afterthought? The answer pulls from two mature fields that have studied this problem from very different angles.

Two old ideas, one new architecture

The first ingredient is ADMM (Alternating Direction Method of Multipliers), a classical algorithm from distributed optimization. ADMM solves convex optimization problems by breaking them into smaller pieces, each of which are then easier to handle. It was introduced in the 1970s but has seen a recent resurgence in the context of dramatic increases in computing power and the development of widely available distributed computing platforms. In ADMM, each agent solves its own local subproblem, then reconciles with neighbors, and repeats until the whole system converges.

The second ingredient is a network sheaf -- a concept from applied algebraic topology. Sheaf theory provides a systematic framework to track data locally on a topological space and determine how to "glue" or assemble this local data into global structures. In the context of multi-agent systems, sheaf theory provides a framework for consensus on complex data structures. Crucially, a sheaf lets you define

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

Full access to in-depth AI research breakdowns
Be the first to know what's trending before it hits mainstream
Daily curated papers, repos, and industry moves

Sakana AI's Sheaf-ADMM Solves Multi-Agent Sudoku at 92.6% Where Rivals Hit 34%

Takeaways

The problem with centralized thinking

Two old ideas, one new architecture

Don't miss what's next in AI