Loading...
「ツール」は右上に移動しました。
利用したサーバー: wtserver3
797いいね 34835回再生

Gemini TTS - Native Audio Out

In this video, I look at the Gemini TTS that was released at Google I/O last week and show you how you can use it to do various things with speech and dialogue.

Blog: blog.google/technology/ai/io-2025-keynote/
blog.google/technology/google-deepmind/google-gemi…
Colab: dripl.ink/Dq3gy

For more tutorials on using LLMs and building agents, check out my Patreon
Patreon: www.patreon.com/SamWitteveen
Twitter: x.com/Sam_Witteveen

🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: drp.li/dIMes

👨‍💻Github:
github.com/samwit/llm-tutorials

⏱️Time Stamps:
00:00 Intro
00:28 New Gemini 2.5 Speech Generation Text-to-Speech
01:44 Google AI Studio: Native Speech Generation
02:37 Colab Demo: Single Speaker
08:51 Colab Demo: Multi-Speaker Podc

コメント