LOCAL LINEAR TRANSFORMATION FOR VOICE CONVERSION

被引:0
|
作者
Popa, Victor [1 ]
Silen, Hanna [1 ]
Nurminen, Jani [2 ]
Gabbouj, Moncef [1 ]
机构
[1] Tampere Univ Technol, Dept Signal Proc, FIN-33101 Tampere, Finland
[2] Nokia, Tampere, Finland
基金
芬兰科学院;
关键词
Gaussian Mixture Model (GMM); Line Spectral Frequencies (LSF); Local Linear Transformation (LLT);
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many popular approaches to spectral conversion involve linear transformations determined for particular acoustic classes and compute the converted result as a linear combination between different local transformations in an attempt to ensure a continuous conversion. These methods often produce over-smoothed spectra and parameter tracks. The proposed method computes an individual linear transformation for every feature vector based on a small neighborhood in the acoustic space thus preserving local details. The method effectively reduces the over-smoothing by eliminating undesired contributions from acoustically remote regions. The method is evaluated in listening tests against the well-known Gaussin Mixture Model based conversion, representative of the class of methods involving linear transformations. Perceptual results indicate a clear preference for the proposed scheme.
引用
收藏
页码:4517 / 4520
页数:4
相关论文
共 50 条
  • [1] The Linear Transformation of LF Glottal Waveforms for Voice Conversion
    del Pozo, Arantza
    Young, Steve
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1457 - 1460
  • [2] Transformation of Prosody in Voice Conversion
    Sisman, Berrak
    Li, Haizhou
    Tan, Kay Chen
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1588 - 1597
  • [3] On the transformation of the speech spectrum for voice conversion
    Baudoin, G
    Stylianou, Y
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1405 - 1408
  • [4] Transformation of speaker characteristics for voice conversion
    Rentzos, D
    Vaseghi, S
    Turajlic, E
    Yan, Q
    Ho, CH
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 706 - 711
  • [5] Parametric Formant Modelling and Transformation in Voice Conversion
    Rentzos, Dimitrios
    Vaseghi, Saeed
    Yan, Qin
    Ho, Ching-Hsiang
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2005, 8 (03) : 227 - 245
  • [6] Efficient fundamental frequency transformation for voice conversion
    Song, Peng
    Jin, Yun
    Bao, Yongqiang
    Zhao, Li
    Zou, Cairong
    Journal of Southeast University (English Edition), 2012, 28 (02) : 140 - 144
  • [7] Voice Conversion Based on Locally Linear Embedding
    Hwang, Hsin-Te
    Wu, Yi-Chiao
    Peng, Yu-Huai
    Hsu, Chin-Cheng
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2018, 34 (06) : 1493 - 1516
  • [8] Voice conversion with linear prediction residual estimaton
    Percybrooks, Winston S.
    Moore, Elliot, II
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4673 - +
  • [9] Evaluation of methods for parameteric formant transformation in voice conversion
    Turajlic, E
    Rentzos, D
    Vaseghi, S
    Ho, CH
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 724 - 727
  • [10] Pitch Transformation in Neural Network based Voice Conversion
    Xie, Feng-Long
    Qian, Yao
    Soong, Frank K.
    Li, Haifeng
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 197 - +