HIGH QUALITY VOICE MANIPULATION METHOD BASED ON THE VOCAL TRACT AREA FUNCTION OBTAINED FROM SUB-BAND LSP OF STRAIGHT SPECTRUM

被引:2
|
作者
Arakawa, Ayanori
Uchimura, Yoshinori
Banno, Hideki
Itakura, Fumitada
Kawahara, Hideki
机构
关键词
Speech synthesis; Speech analysis; Vocoders; Vocal system;
D O I
10.1109/ICASSP.2010.5495142
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a high-quality manipulation method of voice quality base on the vocal tract area function (VTAF) obtained from sub-band LSP of STRAIGHT spectrum. Our research group had developed the manipulation technique of voice quality based on VTAF that can generate natural formant transition. However, it is observed that the generated sound sometimes results in degradation when the input signal has a high sampling frequency. Therefore, we develop a new method that extracts VTAF properly from such input signal. This method firstly divides the input spectral envelope represented by STRAIGHT spectrum into lower and higher frequency bands, secondly extracts the Line spectrum pair (LSP) in each frequency band after spectral flattening that is appropriate for the frequency band, thirdly concatenates a pair of the sub-band LSP, and finally obtains VTAF from PARCOR coefficients converted from the con-catenated LSP. A subjective experiment proved that the proposed method is high quality enough.
引用
收藏
页码:4834 / 4837
页数:4
相关论文
共 2 条
  • [1] Study on Manipulation Method of Voice Quality Based on the Vocal Tract Area Function
    Uchimura, Yoshinori
    Banno, Hideki
    Itakura, Fumitada
    Kawahara, Hideki
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1084 - 1087
  • [2] A preliminary study of voice quality transformation based on modifications to the neutral vocal tract area function
    Story, BH
    Titze, IR
    [J]. JOURNAL OF PHONETICS, 2002, 30 (03) : 485 - 509