共 28 条
- [1] SISMAN B, YAMAGISHI J, KING S, Et al., An overview of voice conversion and its challenges: from statistical modeling to deep learning, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, pp. 132-157, (2021)
- [2] MOUCHTARIS A, AGIOMYRGIANNAKIS Y, STYLIANOU Y., Conditional vector quantization for voice conversion, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 505-508, (2007)
- [3] AIHARA R, TAKASHIMA R, TAKIGUCHI T, Et al., GMM-based emotional voice conversion using spectrum and prosody features, American Journal of Signal Processing, 2, 5, pp. 134-138, (2012)
- [4] HELANDER E, SILEN H, VIRTANEN T, Et al., Voice conversion using dynamic kernel partial least squares regression, IEEE Transactions on Audio, Speech, and Language Processing, 20, 3, pp. 806-817, (2012)
- [5] WU Z Z, VIRTANEN T, CHNG E S, Et al., Exemplar-based sparse representation with residual compensation for voice conversion, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22, 10, pp. 1506-1521, (2014)
- [6] SUN L F, LI K, WANG H, Et al., Phonetic posterior grams for many-to-one voice conversion without parallel data training, Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pp. 1-6, (2016)
- [7] MURAKAMI H, HARA S, ABE M., DNN-based voice conversion with auxiliary phonemic information to improve intelligibility of glossectomy patients' speech, Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 138-142, (2019)
- [8] ALAA Y, ALFONSE M, AREF M M., A survey on generative adversarial networks based models for many-to-many non-parallel voice conversion, Proceedings of 5th International Conference on Computing and Informatics (ICCI), pp. 221-226, (2022)
- [9] KANEKO T, KAMEOKA H, TANAKA K, Et al., CycleGAN-VC2: improved cyclegan-based non-parallel voice conversion, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6820-6824, (2019)
- [10] KAMEOKA H, KANEKO T, TANAKA K, Et al., StarGAN-VC: non-parallel many-to-many voice conversion using star generative adversarial networks, Proceedings of IEEE Spoken Language Technology Workshop (SLT), pp. 266-273, (2018)