LOCAL LINEAR TRANSFORMATION FOR VOICE CONVERSION

被引：0

作者：

Popa, Victor ^{[1
]}

Silen, Hanna ^{[1
]}

Nurminen, Jani ^{[2
]}

Gabbouj, Moncef ^{[1
]}

机构：

[1] Tampere Univ Technol, Dept Signal Proc, FIN-33101 Tampere, Finland

[2] Nokia, Tampere, Finland

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

基金：

芬兰科学院;

关键词：

Gaussian Mixture Model (GMM); Line Spectral Frequencies (LSF); Local Linear Transformation (LLT);

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Many popular approaches to spectral conversion involve linear transformations determined for particular acoustic classes and compute the converted result as a linear combination between different local transformations in an attempt to ensure a continuous conversion. These methods often produce over-smoothed spectra and parameter tracks. The proposed method computes an individual linear transformation for every feature vector based on a small neighborhood in the acoustic space thus preserving local details. The method effectively reduces the over-smoothing by eliminating undesired contributions from acoustically remote regions. The method is evaluated in listening tests against the well-known Gaussin Mixture Model based conversion, representative of the class of methods involving linear transformations. Perceptual results indicate a clear preference for the proposed scheme.

引用

页码：4517 / 4520

页数：4

共 50 条

[1] The Linear Transformation of LF Glottal Waveforms for Voice Conversion
del Pozo, Arantza
Young, Steve
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1457 - 1460
[2] Transformation of Prosody in Voice Conversion
Sisman, Berrak
Li, Haizhou
Tan, Kay Chen
2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1588 - 1597
[3] On the transformation of the speech spectrum for voice conversion
Baudoin, G
Stylianou, Y
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1405 - 1408
[4] Transformation of speaker characteristics for voice conversion
Rentzos, D
Vaseghi, S
Turajlic, E
Yan, Q
Ho, CH
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 706 - 711
[5] Parametric Formant Modelling and Transformation in Voice Conversion
Rentzos, Dimitrios
Vaseghi, Saeed
Yan, Qin
Ho, Ching-Hsiang
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2005, 8 (03) : 227 - 245
[6] Efficient fundamental frequency transformation for voice conversion
Song, Peng
Jin, Yun
Bao, Yongqiang
Zhao, Li
Zou, Cairong
Journal of Southeast University (English Edition), 2012, 28 (02) : 140 - 144
[7] Voice Conversion Based on Locally Linear Embedding
Hwang, Hsin-Te
Wu, Yi-Chiao
Peng, Yu-Huai
Hsu, Chin-Cheng
Tsao, Yu
Wang, Hsin-Min
Wang, Yih-Ru
Chen, Sin-Horng
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2018, 34 (06) : 1493 - 1516
[8] Voice conversion with linear prediction residual estimaton
Percybrooks, Winston S.
Moore, Elliot, II
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4673 - +
[9] Evaluation of methods for parameteric formant transformation in voice conversion
Turajlic, E
Rentzos, D
Vaseghi, S
Ho, CH
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 724 - 727
[10] Pitch Transformation in Neural Network based Voice Conversion
Xie, Feng-Long
Qian, Yao
Soong, Frank K.
Li, Haifeng
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 197 - +

← 1 2 3 4 5 →