共 50 条
- [1] On maximum mutual information speaker-adapted training [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 130 - 147
- [2] Speaker-adapted training on the Switchboard Corpus [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1059 - 1062
- [3] Maximum mutual information speaker adapted training with semi-tied covariance matrices [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 128 - 131
- [4] Speech separation using speaker-adapted eigenvoice speech models [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (01): : 16 - 29
- [5] ASR Confidence Estimation with Speaker-Adapted Recurrent Neural Networks [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3464 - 3468
- [6] Mutual Information Enhanced Training for Speaker Embedding [J]. INTERSPEECH 2021, 2021, : 91 - 95
- [7] Speaker-adapted confidence measures for speech recognition of video lectures [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 37 : 11 - 23
- [8] Speaker-adapted neural-network-based fusion for multimodal reference resolution [J]. 20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 210 - 214