共 50 条
- [31] Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription [J]. INTERSPEECH 2022, 2022, : 3844 - 3848
- [32] Two-way cluster voting to improve speaker diarisation performance [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 753 - 756
- [33] SPEAKER DIARISATION AND LONGITUDINAL LINKING IN MULTI-GENRE BROADCAST DATA [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 660 - 666
- [34] JOINT SPEAKER DIARISATION AND TRACKING IN SWITCHING STATE-SPACE MODEL [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 605 - 612
- [35] AUDIO ENHANCING WITH DNN AUTOENCODER FOR SPEAKER RECOGNITION [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5090 - 5094
- [36] Sparse DNN-based speaker segmentation using side information [J]. ELECTRONICS LETTERS, 2015, 51 (08) : 651 - 653
- [37] Usage of DNN in Speaker Recognition: Advantages and Problems [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 82 - 91
- [38] An Investigation of DNN-Based Speech Synthesis Using Speaker Codes [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2278 - 2282
- [39] Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS [J]. INTERSPEECH 2022, 2022, : 2968 - 2972
- [40] AN INVESTIGATION OF AUGMENTING SPEAKER REPRESENTATIONS TO IMPROVE SPEAKER NORMALISATION FOR DNN-BASED SPEECH RECOGNITION [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4610 - 4613