Adaptive End-to-End Text-to-Speech Synthesis Based on Error Correction Feedback from Humans

被引：0

作者：

Fujii, Kazuki ^{[1
]}

Saito, Yuki ^{[1
]}

Saruwatari, Hiroshi ^{[1
]}

机构：

[1] Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo,133-8656, Japan

来源：

Proceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022 | 2022年

关键词：

Engineering Village;

D O I：

2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022

中图分类号：

学科分类号：

摘要：

Correct error - Embeddings - End to end - Errors correction - Human listeners - Human-in-the-loop - State of the art - Synthetic speech - Text to speech - Text-to-speech system

引用

页码：1702 / 1707

共 50 条

[1] Adaptive End-to-End Text-to-Speech Synthesis Based on Error Correction Feedback from Humans
Fujii, Kazuki
Saito, Yuki
Saruwatari, Hiroshi
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1702 - 1707
[2] EXPLORING END-TO-END NEURAL TEXT-TO-SPEECH SYNTHESIS FOR ROMANIAN
Dumitrache, Marius
Rebedea, Traian
PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE LINGUISTIC RESOURCES AND TOOLS FOR NATURAL LANGUAGE PROCESSING, 2020, : 93 - 102
[3] Myanmar Text-to-Speech Synthesis Using End-to-End Model
Qin, Qinglai
Yang, Jian
Li, Peiying
2020 4TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2020, 2020, : 6 - 11
[4] End-to-End Mongolian Text-to-Speech System
Li, Jingdong
Zhang, Hui
Liu, Rui
Zhang, Xueliang
Bao, Feilong
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 483 - 487
[5] Improving transfer of expressivity for end-to-end multispeaker text-to-speech synthesis
Kulkarni, Ajinkya
Colotte, Vincent
Jouvet, Denis
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 31 - 35
[6] End-to-end text-to-speech synthesis with unaligned multiple language units based on attention
Aso, Masashi
Takamichi, Shinnosuke
Saruwatari, Hiroshi
INTERSPEECH 2020, 2020, : 4009 - 4013
[7] Knowledge-based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis
Li, Jingbei
Wu, Zhiyong
Li, Runnan
Zhi, Pengpeng
Yang, Song
Meng, Helen
INTERSPEECH 2019, 2019, : 4494 - 4498
[8] End-to-End Thai Text-to-Speech with Linguistic Unit
Wisetpaitoon, Kontawat
Singkul, Sattaya
Sakdejayont, Theerat
Chalothorn, Tawunrat
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 951 - 959
[9] NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality
Tan, Xu
Chen, Jiawei
Liu, Haohe
Cong, Jian
Zhang, Chen
Liu, Yanqing
Wang, Xi
Leng, Yichong
Yi, Yuanhao
He, Lei
Zhao, Sheng
Qin, Tao
Soong, Frank
Liu, Tie-Yan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4234 - 4245
[10] End-to-End Text-To-Speech synthesis for under resourced South African languages
Nthite, Thapelo
Tsoeu, Mohohlo
2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 684 - 689

← 1 2 3 4 5 →