共 50 条
- [31] Emotional sounds of crowds: spectrogram-based analysis using deep learning [J]. Multimedia Tools and Applications, 2020, 79 : 36063 - 36075
- [34] IMPROVING GAN-BASED VOCODER FOR FAST AND HIGH-QUALITY SPEECH SYNTHESIS [J]. INTERSPEECH 2022, 2022, : 1601 - 1605
- [35] A COMPACT FRAMEWORK FOR VOICE CONVERSION USING WAVENET CONDITIONED ON PHONETIC POSTERIORGRAMS [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6810 - 6814
- [36] SPECTROGRAM-BASED CLASSIFICATION OF SPOKEN FOUL LANGUAGE USING DEEP CNN [J]. 2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
- [37] Continuous vocoder applied in deep neural network based voice conversion [J]. Multimedia Tools and Applications, 2019, 78 : 33549 - 33572
- [39] A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION [J]. 2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,