Suno Rebuilds Stem Separation From Scratch, Hitting 90% Accuracy

EDITORIAL LEADERBOARD

Suno

1D AGO

2 min read

1 day ago

2 min read

Stem separation has always been the awkward middle step in AI music production: you generate something great, then watch it get mangled when you try to pull it apart. Suno just shipped a significant overhaul to its stem separation system that attacks this problem at the root, and the approach is meaningfully different from what came before.

From isolating to regenerating

Traditional stem separation works by analyzing a finished mix and trying to reverse-engineer the individual parts. Tools like Meta's Demucs or iZotope RX use spectral masks to predict which frequencies belong to which instrument. A modern stem separation tool converts audio into a spectrogram, predicts spectral masks for each sound source, and uses those masks to isolate vocals, drums, bass, and other instruments. The problem: when frequencies overlap between instruments, you get bleed, warbling, and phase artifacts that make the stems hard to use in a professional mix.

Suno's new approach sidesteps this entirely. Unlike basic stem separation tools that use frequency-based splitting, Suno's stem system has access to the generation model's internal representation of each instrument, which means cleaner separation with less crosstalk between stems. In other words, because Suno generated the track in the first place, it can regenerate each stem from scratch rather than trying to guess what was there. The result is stems that are cleaner, crisper, and free from artifacts, ready to drop into your track.

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

Full access to in-depth AI research breakdowns
Be the first to know what's trending before it hits mainstream
Daily curated papers, repos, and industry moves

Takeaways

From isolating to regenerating

Don't miss what's next in AI