/Tech2h ago

Gemini TTS Now Streams Audio Chunks For Instant Voice Output

5332102.7K

Original post

QoL for Speech Generation! You can now stream audio from Gemini TTS as it's generated. No more waiting. Build voice assistants, narration tools, and conversational apps that start talking instantly.

Set `stream: true` and receive chunks.

7:11 AM · Jun 17, 2026 · 1.9K Views

Sentiment

Users criticize Gemini TTS streaming audio as undermined by GCP's overwhelming complexity, which they say requires a PhD just to handle basic implementation.

Pos

0.0%

Neg

100.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Related links

Gemini API | Google AI for Developers

GOOGLE AI FOR DEVELOPERSVia

#1404

Posts from X

Most Activity

VIEWS813LIKES1

Philipp Schmid@_philschmid

https://ai.google.dev/gemini-api/docs/interactions/speech-generation#streaming

Philipp Schmid@_philschmid

QoL for Speech Generation! You can now stream audio from Gemini TTS as it's generated. No more waiting. Build voice assistants, narration tools, and conversational apps that start talking instantly.

Set `stream: true` and receive chunks.

2h81310

REPLIES1

William 🇺🇦@WillimerTercero

@_philschmid Nice, but then one enters GCP web and it's a huge mess that requires a PhD to learn how to do the simplest things.

1h9

Philipp Schmid@_philschmid

@WillimerTercero Thats why we build @GoogleAIStudio

1h6

Nils Ingwersen@nilsingwersen

@_philschmid This is only for the tts model right? Or would this also speedup gemini-3.1-flash-live-preview?

1h4