In this video, I look at the Gemini TTS that was released at Google I/O last week and show you how you can use it to do various things with speech and dialogue.
Blog: blog.google/technology/ai/io-2025-keynote/
blog.google/technology/google-deepmind/google-gemi…
Colab: dripl.ink/Dq3gy
For more tutorials on using LLMs and building agents, check out my Patreon
Patreon: www.patreon.com/SamWitteveen
Twitter: x.com/Sam_Witteveen
🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: drp.li/dIMes
👨💻Github:
github.com/samwit/llm-tutorials
⏱️Time Stamps:
00:00 Intro
00:28 New Gemini 2.5 Speech Generation Text-to-Speech
01:44 Google AI Studio: Native Speech Generation
02:37 Colab Demo: Single Speaker
08:51 Colab Demo: Multi-Speaker Podc
コメント