When to use
- Generating subtitles or captions
- Creating searchable transcripts from recordings
- Feeding spoken content into downstream text nodes
Inputs
The audio to transcribe. Required. Supports loop mode.
Outputs
The transcribed text.
Configuration
ISO 639-1 language code (e.g.
en, fr). Leave blank to auto-detect.When enabled, the transcript includes speaker labels (Speaker 1, Speaker 2, …).
When enabled, non-speech sounds such as laughter or applause are tagged in the transcript.
Example
Connect an Upload Audio node (or a Generate Speech node) to the Audio input, then wire the Transcript output to a Generate Text node to summarise or reformat the spoken content.This node runs on a selectable model. See Audio models for supported models and Model settings for available options.