Adaptive End-to-End Text-to-Speech Synthesis Based on Error Correction Feedback from Humans

被引：0

作者：

Fujii, Kazuki ^{[1
]}

Saito, Yuki ^{[1
]}

Saruwatari, Hiroshi ^{[1
]}

机构：

[1] Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo,133-8656, Japan

来源：

Proceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022 | 2022年

关键词：

Engineering Village;

D O I：

2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022

中图分类号：

学科分类号：

摘要：

Correct error - Embeddings - End to end - Errors correction - Human listeners - Human-in-the-loop - State of the art - Synthetic speech - Text to speech - Text-to-speech system

引用

页码：1702 / 1707

共 50 条

[21] Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Kim, Jaehyeon
Kong, Jungil
Son, Juhee
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[22] On the Training and Testing Data Preparation for End-to-End Text-to-Speech Application
Duc Chung Tran
Khan, M. K. A. Ahamed
Sridevi, S.
2020 11TH IEEE CONTROL AND SYSTEM GRADUATE RESEARCH COLLOQUIUM (ICSGRC), 2020, : 73 - 75
[23] SEMI-SUPERVISED END-TO-END SPEECH RECOGNITION USING TEXT-TO-SPEECH AND AUTOENCODERS
Karita, Shigeki
Watanabe, Shinji
Iwata, Tomoharu
Delcroix, Marc
Ogawa, Atsunori
Nakatani, Tomohiro
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6166 - 6170
[24] End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue
Mitsui, Kentaro
Zhao, Tianyu
Sawada, Kei
Hono, Yukiya
Nankaku, Yoshihiko
Tokuda, Keiichi
INTERSPEECH 2022, 2022, : 2328 - 2332
[25] Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework
Prakash, Anusha
Murthy, Hema A.
INTERSPEECH 2020, 2020, : 2962 - 2966
[26] Optimization for Low-Resource Speaker Adaptation in End-to-End Text-to-Speech
Hong, Changi
Lee, Jung Hyuk
Jeon, Moongu
Kim, Hong Kook
2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1060 - 1061
[27] SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
Cho, Hyunjae
Jung, Wonbin
Lee, Junhyeok
Woo, Sang Hoon
INTERSPEECH 2022, 2022, : 1 - 5
[28] Phonetic and Prosodic Information Estimation from Texts for Genuine Japanese End-to-End Text-to-Speech
Kakegawa, Naoto
Hara, Sunao
Abe, Masanobu
Ijima, Yusuke
INTERSPEECH 2021, 2021, : 126 - 130
[29] End-to-End Speech Synthesis for Bangla with Text Normalization
Pial, Tanzir Islam
Aunti, Shahreen Salim
Ahmed, Shabbir
Heickal, Hasnain
2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 66 - 71
[30] End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders
Masumura, Ryo
Sato, Hiroshi
Tanaka, Tomohiro
Moriya, Takafumi
Ijima, Yusuke
Oba, Takanobu
INTERSPEECH 2019, 2019, : 1606 - 1610

← 1 2 3 4 5 →