State-of-the-art speech generative model, supports six languages, removes transient noise, edits content, transfers audio style, and generates diverse speech samples.
Synthesizes speech across six languages: English, French, German, Spanish, Polish, and Portuguese.
Removes transient noise by re-generating noise-corrupted speech.
Corrects misspoken words without having the speaker re-record the audio.
Transfers audio style within and across languages.
Creates unique and expressive audio styles by sampling without conditioning on any audio.
Removing transient noise from speech recordings.
Editing content in speech recordings without re-recording.
Generating speech in different languages and styles.
Transferring audio style across languages.
Access the Voicebox website for demos and examples.
Explore the different features and capabilities of Voicebox.
Contact the developers for more information on potential use cases.