Skip to main content
Generate Speech turns text into spoken audio using a speech synthesis model. This node runs on a selectable model and consumes credits when it runs.

When to use

  • Voiceovers
  • Narration
  • Audio ads

Inputs

Text
text
required
The text to speak. Required. Supports loop mode.

Outputs

Audio
audio
The generated speech.

Configuration

voice
string
default:""
The voice to use for synthesis.
language
string
default:"en"
Language code for the spoken output.
stability
number
Voice stability, from 0 to 1. Higher values produce a more consistent delivery.
similarity_boost
number
How closely the output matches the chosen voice, from 0 to 1.
style
number
default:0
Style exaggeration, from 0 to 1.
speed
number
default:1
Playback speed, from 0.5 to 2.
This node runs on a selectable model. See Audio models for supported models and Model settings for available options.