A Precise Estimation of Vocal Tract Parameters for High Quality Voice Morphing

被引:2
|
作者
Xu, Ning [1 ]
Yang, Zhen [1 ]
机构
[1] Nanjing Univ Post & Telecommun, Inst Signal Proc & Transmiss, Nanjing, Peoples R China
关键词
D O I
10.1109/ICOSP.2008.4697223
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
One of the most recent models for voice conversion is the classical LPC analysis-synthesis model combined with GMM, which aims to separate information from excitation and vocal tract and to learn the transformation rules with statistical methods. However it does not work well as it is supposed to be due to the inaccuracy of the extracted feature information as well as the overly-smoothed spectral converted by traditional GMM. In this paper we propose a novel method to solve the problem which is based on the technique of the separation of glottal waveforms and the prediction of the excitations. The final result shows that not only are the transformed vocal tract parameters matching the target one better, but also is the high quality of the synthesized speech preserved.
引用
下载
收藏
页码:684 / 687
页数:4
相关论文
共 50 条
  • [21] Vocal tract resonances in singing: The soprano voice
    Joliveau, E
    Smith, J
    Wolfe, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (04): : 2434 - 2439
  • [22] Glottal and Vocal Tract Characteristics of Voice Impersonators
    Bin Amin, Talal
    Marziliano, Pina
    German, James Sneed
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (03) : 668 - 678
  • [23] Voice conversion by prosody and vocal tract modification
    Rao, K. Sreenivasa
    Yegnanarayana, B.
    ICIT 2006: 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2006, : 111 - +
  • [24] Vocal tract resonances in singing: The soprano voice
    Joliveau, Elodie
    Smith, John
    Wolfe, Joe
    1600, Acoustical Society of America (116):
  • [25] Estimation of Acoustic Microphone Vocal Tract Parameters from Throat Microphone Recordings
    Akarguen, Uelkue Cagri
    Erzin, Engin
    IN-VEHICLE CORPUS AND SIGNAL PROCESSING FOR DRIVER BEHAVIOR, 2009, : 161 - +
  • [26] Estimation of acoustic microphone vocal tract parameters from throat microphone recordings
    Akarguen, Uelkue Cagra
    Erzin, Engin
    2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 747 - 750
  • [27] Immediate effect of semioccluded vocal tract therapy on acoustic parameters in the processes of masculinization and feminization of the voice
    Fuenzalida Cabezas, Rodrigo
    Sandoval Zuniga, Maria Soledad
    Sandoval, Eugenia Diaz
    Perez Zurita, Tanya
    Quiroz Bustamante, Fernanda
    Rosales Orellana, Marcela
    REVISTA DE INVESTIGACION EN LOGOPEDIA, 2021, 11 (01): : 23 - 35
  • [28] Inverse estimation of the vocal tract shape based on a vocal tract mapping interface
    Ogata, Kohichi
    Kodama, Tayuto
    Hayakawa, Tomohiro
    Aoki, Riku
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (04): : 1961 - 1974
  • [29] Vocal fold vibration and voice quality
    Niimi, S
    Miyaji, M
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2000, 52 (1-3) : 32 - 38
  • [30] Acoustic interactions of the voice source with the lower vocal tract
    Titze, IR
    Story, BH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (04): : 2234 - 2243