Audio models - Pleyor

Audio models are selected on the audio-generating nodes. They cover three jobs: turning text into speech, composing music from a description, and adding sound effects to a video.

Families

Model	Provider	Used by	Job
ElevenLabs Multilingual v2	ElevenLabs	Generate Speech	Text-to-speech
ElevenLabs v3	ElevenLabs	Generate Speech	Text-to-speech
ElevenLabs Music v1	ElevenLabs	Generate Music	Music generation
Mirelo 1.5	FAL	Add Sound Effects	Video sound effects

Settings

Each audio node exposes the settings relevant to its model. Availability depends on the model you pick.

Speech

Setting	What it does
`stability`	Voice consistency, from 0 to 1. Higher values give a steadier delivery.
`similarity_boost`	How closely the output matches the chosen voice, from 0 to 1.
`style`	Style exaggeration, from 0 to 1.
`speed`	Playback speed, from 0.5 to 2.

Music

Setting	What it does
`duration`	Track length in seconds, from 3 to 600.
`force_instrumental`	Generate an instrumental track with no vocals.

See Model settings for the full reference.

Generate Speech

Convert text to spoken audio with an ElevenLabs voice.

Generate Music

Compose a music track from a text description.

Add Sound Effects

Generate and attach sound effects to a video.

Text models Model settings

​Families

​Settings

​Speech

​Music

​Related nodes