共 50 条
- [1] PHONEME DEPENDENT SPEAKER EMBEDDING AND MODEL FACTORIZATION FOR MULTI-SPEAKER SPEECH SYNTHESIS AND ADAPTATION [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6930 - 6934
- [2] An Unsupervised Method to Select a Speaker Subset from Large Multi-Speaker Speech Synthesis Datasets [J]. INTERSPEECH 2020, 2020, : 1758 - 1762
- [3] TOWARDS MULTI-SPEAKER UNSUPERVISED SPEECH PATTERN DISCOVERY [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4366 - 4369
- [4] Unsupervised Discovery of Phoneme Boundaries in Multi-Speaker Continuous Speech [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING (ICDL), 2011,
- [5] MULTI-SPEAKER MODELING AND SPEAKER ADAPTATION FOR DNN-BASED TTS SYNTHESIS [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4475 - 4479
- [6] Improving Multi-Speaker Tacotron with Speaker Gating Mechanisms [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7498 - 7503
- [7] A hybrid approach to speaker recognition in multi-speaker environment [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 272 - 275
- [8] Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1630 - 1635
- [9] Automatic speaker clustering from multi-speaker utterances [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 817 - 820
- [10] Speaker conditioned acoustic modeling for multi-speaker conversational ASR [J]. INTERSPEECH 2022, 2022, : 3834 - 3838