« Voicebox » : différence entre les versions


Aucun résumé des modifications
Aucun résumé des modifications
Ligne 11 : Ligne 11 :


   
   
We’ve developed Voicebox, a state of the art AI model that can perform speech generation tasks — like editing, sampling and stylizing — that it wasn’t specifically trained to do through in-context learning.
Meta AI Research published a paper unveiling VoiceBox, a text-to-speech generative model that achieved state-of-the-art performance in tasks that was not originally trained on. The model can synthesize speech across six languages, as well as perform noise removal, content editing, style conversion, and diverse sample generation
Voicebox can produce high quality audio clips and edit pre-recorded audio — like removing car horns or a dog barking — all while preserving the content and style of the audio. The model is also multilingual and can produce speech in six languages.
   
   



Version du 25 juin 2023 à 14:58

en construction

Définition

XXXXXXXXX

Français

Voicebox

Anglais

Voicebox


Meta AI Research published a paper unveiling VoiceBox, a text-to-speech generative model that achieved state-of-the-art performance in tasks that was not originally trained on. The model can synthesize speech across six languages, as well as perform noise removal, content editing, style conversion, and diverse sample generation


Source : about.fb

Contributeurs: Maya Pentsch, wiki