Fish Audio Opens S2.1 Pro to Developers With Free Unlimited API Access

Fish Audio

5H AGO

2 min read

5 hrs ago

2 min read

Fish Audio's S2.1 Pro , the same model powering its paid tier , is now available to any developer for free via API. S2.1 Pro is now available as a free text-to-speech API with unlimited access under Fair Use. The move flips the standard industry script, where production-quality voice has always sat behind a paywall.

The paywall problem it's solving

The models that actually sound good cost money. ElevenLabs' free tier gives you roughly 6–10 minutes of audio before the paywall kicks in. OpenAI TTS has no free tier at all. Google's latest Gemini TTS models have zero free usage , you pay from the first token.

The AI voice generator market is growing at nearly 20% annually, but the tooling to build voice-enabled products has stayed behind a paywall. You can't properly evaluate a model on 10,000 credits. Fish Audio is betting that removing the barrier entirely will accelerate adoption , and that their new inference stack makes it economically viable.

What S2.1 Pro actually is

S2.1 Pro is Fish Audio's current state-of-the-art voice model, designed for production-grade AI voice generation, with particular strengths in low-latency streaming, multilingual TTS, and voice cloning. It's an improved version of S2 Pro, which Fish Audio released with open weights earlier this year.

The headline improvements over the previous generation:

61% win rate against S2 Pro in head-to-head listening evaluations
~70ms Time-to-First-Audio (TTFA) at single request , down from ~100ms in the prior generation
2x+ throughput improvement under high-concurrency load
83 languages supported, up from 80+ in S2-Pro

Under the hood: why it sounds this good

S2 Pro was trained on over 10 million hours of audio data covering more than 80 languages, combining a Dual-Autoregressive (Dual-AR) architecture with reinforcement learning alignment to generate speech that is exceptionally natural, realistic, and emotionally rich.

The Dual-AR architecture is the key structural innovation. Instead of one model doing everything, there are two working in tandem:

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

Full access to in-depth AI research breakdowns
Be the first to know what's trending before it hits mainstream
Daily curated papers, repos, and industry moves

Fish Audio Opens S2.1 Pro to Developers With Free Unlimited API Access

Takeaways

The paywall problem it's solving

What S2.1 Pro actually is

Under the hood: why it sounds this good

Don't miss what's next in AI