A Precise Estimation of Vocal Tract Parameters for High Quality Voice Morphing

被引:2
|
作者
Xu, Ning [1 ]
Yang, Zhen [1 ]
机构
[1] Nanjing Univ Post & Telecommun, Inst Signal Proc & Transmiss, Nanjing, Peoples R China
关键词
D O I
10.1109/ICOSP.2008.4697223
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
One of the most recent models for voice conversion is the classical LPC analysis-synthesis model combined with GMM, which aims to separate information from excitation and vocal tract and to learn the transformation rules with statistical methods. However it does not work well as it is supposed to be due to the inaccuracy of the extracted feature information as well as the overly-smoothed spectral converted by traditional GMM. In this paper we propose a novel method to solve the problem which is based on the technique of the separation of glottal waveforms and the prediction of the excitations. The final result shows that not only are the transformed vocal tract parameters matching the target one better, but also is the high quality of the synthesized speech preserved.
引用
收藏
页码:684 / 687
页数:4
相关论文
共 50 条
  • [1] Fast and robust joint estimation of vocal tract and voice source parameters
    Ding, W
    Campbell, N
    Higuchi, N
    Kasuya, H
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1291 - 1294
  • [2] High quality voice morphing
    Ye, H
    Young, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 9 - 12
  • [3] Voice quality enhancement for vocal tract rehabilitation
    Sutcliffe, Bianca
    Wiggins, Lindzi
    Rubin, David
    Aharonson, Vered
    [J]. 2018 3RD BIENNIAL SOUTH AFRICAN BIOMEDICAL ENGINEERING CONFERENCE (SAIBMEC), 2018,
  • [4] A novel approach to the estimation of voice source and vocal tract parameters from speech signals
    Ding, W
    Kasuya, H
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1257 - 1260
  • [5] SIMULTANEOUS ESTIMATION OF VOCAL-TRACT AND VOICE SOURCE PARAMETERS BASED ON AN ARX MODEL
    DING, W
    KASUYA, H
    ADACHI, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 738 - 743
  • [6] Enhancing Voice Quality in Vocal Tract Rehabilitation Device
    Sutcliffe, Bianca
    Wiggins, Lindzi
    Rubin, David M.
    Aharonson, Vered
    [J]. ADVANCES IN USABILITY, USER EXPERIENCE AND ASSISTIVE TECHNOLOGY, 2019, 794 : 1006 - 1013
  • [7] FLEXIBLE VOICE MORPHING BASED ON LINEAR COMBINATION OF MULTI-SPEAKERS' VOCAL TRACT AREA FUNCTIONS
    Nambu, Yoshiki
    Mikawa, Masahiko
    Tanaka, Kazuyo
    [J]. 18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 790 - 794
  • [8] ESTIMATION OF VOCAL TRACT PARAMETERS FOR THE CLASSIFICATION OF SPEECH UNDER STRESS
    Yao, Xiao
    Jitsuhiro, Takatoshi
    Miyajima, Chiyomi
    Kitaoka, Norihide
    Takeda, Kazuya
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7532 - 7536
  • [9] Mapping Articulatory-Features to Vocal-Tract Parameters for Voice Conversion
    Ariwardhani, Narpendyah Wisjnu
    Kimura, Masashi
    Iribe, Yurie
    Katsurada, Kouichi
    Nitta, Tsuneo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (04): : 911 - 918
  • [10] Evaluation of voice pathology based on the estimation of vocal fold biomechanical parameters
    Gomez-Vilda, P.
    Fernandez-Baillo, R.
    Nieto, A.
    Diaz, F.
    Fernandez-Camacho, F. J.
    Rodellar, V.
    Alvarez, A.
    Martinez, R.
    [J]. JOURNAL OF VOICE, 2007, 21 (04) : 450 - 476