共 50 条
- [41] END-TO-END MULTI-SPEAKER SPEECH RECOGNITION WITH TRANSFORMER [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6134 - 6138
- [42] INVESTIGATING ON INCORPORATING PRETRAINED AND LEARNABLE SPEAKER REPRESENTATIONS FOR MULTI-SPEAKER MULTI-STYLE TEXT-TO-SPEECH [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8588 - 8592
- [43] Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1704 - 1707
- [44] Speaker-Attributed Training for Multi-Speaker Speech Recognition Using Multi-Stage Encoders and Attention-Weighted Speaker Embedding [J]. Applied Sciences (Switzerland), 2024, 14 (18):
- [45] Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 120 - 123
- [46] Cross-lingual multi-speaker speech synthesis with limited bilingual training data [J]. COMPUTER SPEECH AND LANGUAGE, 2023, 77
- [48] Silent versus modal multi-speaker speech recognition from ultrasound and video [J]. INTERSPEECH 2021, 2021, : 641 - 645
- [50] An emotional speech synthesis markup language processor for multi-speaker and emotional text-to-speech applications [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 523 - 529