Skip to content

Add streaming speech generation #784

@davidmezzetti

Description

@davidmezzetti

Currently, the TextToSpeech pipeline generates output speech as a single batch. In order to support real-time use cases, similar to streaming LLM generation, this pipeline should be modified to yield chunks of audio.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions