Google Cloud Text-to-Speech
Google Cloud Text-to-Speech enables developers to synthesize natural-sounding speech with 100+ voices, available in multiple languages and variants. It applies DeepMindโs groundbreaking research in WaveNet and Googleโs powerful neural networks to deliver the highest fidelity possible.
This notebook shows how to interact with the
Google Cloud Text-to-Speech API
to achieve speech synthesis
capabilities.
First, you need to set up an Google Cloud project. You can follow the instructions here.
%pip install --upgrade --quiet google-cloud-text-to-speech
Usageโ
from langchain_community.tools import GoogleCloudTextToSpeechTool
text_to_speak = "Hello world!"
tts = GoogleCloudTextToSpeechTool()
tts.name
API Reference:
We can generate audio, save it to the temporary file and then play it.
speech_file = tts.run(text_to_speak)