Higgsfield just shipped Seed Audio 1.0, a new audio model that handles three distinct tasks in one place: text-to-speech narration, voice cloning from a short sample, and full video dubbing across 18 languages. It is available directly on the Higgsfield platform and , the more interesting part for developers , through Claude via the Higgsfield MCP server.

What Seed Audio 1.0 actually does

Seed Audio 1.0 is ByteDance Seed's all-in-one audio generation model for creating complete sound scenes. It accepts text, image, or audio context to guide multi-speaker dialogue, emotional delivery, native accents, ambience, background music, and foley-style effects. That last part matters: this is not just a TTS engine. It is positioned for complete audio scenes , multi-character dialogue, emotion, tone, accents, ambience beds, BGM, and foley in a single creative pass.

The three core modes are:

  • Text-to-speech (Voiceover): Paste a script, pick a voice preset or your own clone, and get a narrated audio file.
  • Voice swap: Replace the voice in an existing video with any preset or cloned voice.
  • Video dubbing: Translate a video into a target language with automatic lip-sync to the new audio track.

Higgsfield brings ElevenLabs, MiniMax, Seed Speech, and Vibe Voice into one subscription, with multilingual cloning, dubbing, video sync, and a multi-voice library. Seed Audio 1.0 is the new addition to that roster, sitting alongside ElevenLabs v3 as a primary engine. Claude can now generate audio via Higgsfield MCP , voiceovers, voice cloning, and dubbing in 50+ languages, powered by Seed Audio 1.0 and ElevenLabs v3, all inside Claude.

Alpha Signal

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

  • Full access to in-depth AI research breakdowns
  • Be the first to know what's trending before it hits mainstream
  • Daily curated papers, repos, and industry moves