
Microsoft just pulled the wraps off the Surface RTX Spark Dev Box at Build 2026 in San Francisco, and it is one of the most interesting pieces of developer hardware in years. The pitch is simple: a compact desktop that can run frontier-scale AI models entirely on your desk, no cloud API calls required. This is not a consumer PC with an AI sticker slapped on the box. It is a purpose-built local inference and fine-tuning machine.
The chip that makes it possible
Everything starts with NVIDIA's RTX Spark, the company's first-ever system-on-a-chip designed for Windows PCs. RTX Spark fuses a CPU, GPU, and memory controller onto a single package. Unlike traditional designs where the processor and graphics card operate separately with their own memory pools, RTX Spark uses a unified memory architecture similar to Apple Silicon, meaning the CPU and GPU share the same pool of fast LPDDR5X RAM, eliminating the bottleneck that occurs when data has to shuttle between separate chips.
The chip packs a 20-core Grace CPU built alongside MediaTek, a Blackwell RTX GPU with 6,144 CUDA cores, an NPU, and up to 128GB of unified LPDDR5X memory, all on a single TSMC 3nm package. Blackwell is the same GPU architecture powering NVIDIA's RTX 50 series desktop cards. This is reportedly equivalent to an RTX 5070 in terms of CUDA core count. The difference is that no consumer RTX card ships with anywhere near 128GB of VRAM -- and that memory headroom is the whole point.
Three numbers that define the machine
Three numbers define the device: a 100W sustained thermal envelope, up to 128GB of unified memory, and approximately 1 petaflop of AI compute from NVIDIA's RTX Spark chip. To put the memory figure in context: most consumer GPUs top out at 24-32GB of VRAM. Running a 70B-parameter model in full precision typically requires around 140GB. With quantization (compressing model weights to use less memory), 128GB of unified memory is enough to run much larger models comfortably.
Don't miss what's next in AI
Join 300,000+ engineers and researchers who get the signal, not the noise.
- Full access to in-depth AI research breakdowns
- Be the first to know what's trending before it hits mainstream
- Daily curated papers, repos, and industry moves
