
Fish Audio's S2.1 Pro , the same model powering its paid tier , is now available to any developer for free via API. S2.1 Pro is now available as a free text-to-speech API with unlimited access under Fair Use. The move flips the standard industry script, where production-quality voice has always sat behind a paywall.
The paywall problem it's solving
The models that actually sound good cost money. ElevenLabs' free tier gives you roughly 6–10 minutes of audio before the paywall kicks in. OpenAI TTS has no free tier at all. Google's latest Gemini TTS models have zero free usage , you pay from the first token.
The AI voice generator market is growing at nearly 20% annually, but the tooling to build voice-enabled products has stayed behind a paywall. You can't properly evaluate a model on 10,000 credits. Fish Audio is betting that removing the barrier entirely will accelerate adoption , and that their new inference stack makes it economically viable.
What S2.1 Pro actually is
S2.1 Pro is Fish Audio's current state-of-the-art voice model, designed for production-grade AI voice generation, with particular strengths in low-latency streaming, multilingual TTS, and voice cloning. It's an improved version of S2 Pro, which Fish Audio released with open weights earlier this year.
The headline improvements over the previous generation:
- 61% win rate against S2 Pro in head-to-head listening evaluations
- ~70ms Time-to-First-Audio (TTFA) at single request , down from ~100ms in the prior generation
- 2x+ throughput improvement under high-concurrency load
- 83 languages supported, up from 80+ in S2-Pro
Under the hood: why it sounds this good
S2 Pro was trained on over 10 million hours of audio data covering more than 80 languages, combining a Dual-Autoregressive (Dual-AR) architecture with reinforcement learning alignment to generate speech that is exceptionally natural, realistic, and emotionally rich.
The Dual-AR architecture is the key structural innovation. Instead of one model doing everything, there are two working in tandem:
Don't miss what's next in AI
Join 300,000+ engineers and researchers who get the signal, not the noise.
- Full access to in-depth AI research breakdowns
- Be the first to know what's trending before it hits mainstream
- Daily curated papers, repos, and industry moves
