共 50 条
- [21] TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
- [22] Keyword-based speaker localization: Localizing a target speaker in a multi-speaker environment [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2703 - 2707
- [23] PHONEME DEPENDENT SPEAKER EMBEDDING AND MODEL FACTORIZATION FOR MULTI-SPEAKER SPEECH SYNTHESIS AND ADAPTATION [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6930 - 6934
- [24] Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows [J]. INTERSPEECH 2021, 2021, : 3131 - 3135
- [25] Sparse DNN-based speaker segmentation using side information [J]. ELECTRONICS LETTERS, 2015, 51 (08) : 651 - 653
- [26] An Investigation of DNN-Based Speech Synthesis Using Speaker Codes [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2278 - 2282
- [27] Improving Multi-Speaker Tacotron with Speaker Gating Mechanisms [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7498 - 7503
- [28] Zero-shot multi-speaker accent TTS with limited accent data [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1931 - 1936
- [29] A hybrid approach to speaker recognition in multi-speaker environment [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 272 - 275
- [30] CAN WE USE COMMON VOICE TO TRAIN A MULTI-SPEAKER TTS SYSTEM? [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 900 - 905