DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Title | DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech |
Publication Type | Conference Paper |
Year of Publication | 2024 |
Authors | Melechovsky J., Mehrish A., Sisman B., Herremans D. |
Conference Name | Audio Imagination: NeurIPS 2024 Workshop |
Conference Location | Vancouver |
Abstract | Recent advancements in Text-to-Speech (TTS) systems have enabled the generation of natural and expressive speech from textual input. Accented TTS aims to enhance user experience by making the synthesized speech more relatable to minority group listeners, and useful across various applications and context. |
URL | https://arxiv.org/abs/2410.13342 |