Google's Gemma 4 12B Runs Text, Images and Audio on a 16GB Laptop

LM Studio

Google's Gemma 4 12B Runs Text, Images and Audio on a 16GB Laptop

Jun 03, 2026

1 min read

Jun 03, 2026

1 min read

Google just filled the most important gap in its open-model lineup. Gemma 4 12B is now available in LM Studio, and it's the first mid-sized model in the Gemma family to natively handle text, images, audio, and video , all in a single decoder-only transformer, no separate encoders required. The kicker: it runs on a standard 16GB laptop.

The gap it fills

The original Gemma 4 family, released in April, held four models: two tuned for phones (E2B and E4B) and two built for heavier work (a 26B Mixture of Experts and a 31B Dense). A wide gap sat in the middle , and the new 12-billion-parameter model drops right into it.

Gemma 4 12B delivers performance nearing the larger 26B MoE model on standard benchmarks, but at less than half the total memory footprint. Google says it clearly beats the older Gemma 3 27B across tests like GPQA Diamond, MMLU Pro, and DocVQA. Concretely, Gemma 4 12B scores 77.2% on MMLU Pro, beating last year's Gemma 3 27B (67.6%).

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

Full access to in-depth AI research breakdowns
Be the first to know what's trending before it hits mainstream
Daily curated papers, repos, and industry moves

Google's Gemma 4 12B Runs Text, Images and Audio on a 16GB Laptop

Takeaways

The gap it fills

Don't miss what's next in AI