Iterative MMSE Estimation of Vocal Tract Length Normalization Factors for Voice Transformation

被引:0
|
作者
Erro, Daniel [1 ]
Navas, Eva [1 ]
Hernaez, Inma [1 ]
机构
[1] Univ Basque Country UPV EHU, AHOLAB, Bilbao, Spain
关键词
vocal tract length normalization; voice conversion; frequency warping plus amplitude scaling; speech synthesis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method that determines the optimal configuration of a bilinear vocal tract length normalization function to transform the frequency axis of one voice according to a specific target voice. Given a number of parallel utterances of the involved speakers, the single parameter of this function can be calculated through an iterative procedure by minimizing an objective error measure defined in the cepstral domain. This method is also applicable when multiple warping classes are considered, and it can be complemented with amplitude correction filters. The resulting physically motivated cepstral transformation results in highly satisfactory conversion accuracy and improved quality with respect to standard satistical systems.
引用
收藏
页码:86 / 89
页数:4
相关论文
共 50 条
  • [1] A novel feature transformation for vocal tract length normalization in automatic speech recognition
    Claes, T
    Dologlou, I
    ten Bosch, L
    Van Compernolle, D
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (06): : 549 - 557
  • [2] Time domain vocal tract length normalization
    Sündermann, D
    Bonafonte, A
    Ney, H
    Hoge, H
    [J]. Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 191 - 194
  • [3] A parametric approach to vocal tract length normalization
    Eide, E
    Gish, H
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 346 - 348
  • [4] Parameter optimization for Vocal Tract Length Normalization
    Dognin, P
    El-Jaroudi, A
    Billa, J
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1767 - 1770
  • [5] The ΔF method of vocal tract length normalization for vowels
    Johnson, Keith
    [J]. LABORATORY PHONOLOGY, 2020, 11 (01):
  • [6] A bilinear transform approach for vocal tract length normalization
    Xu, W
    Wang, BX
    Ding, Q
    [J]. 2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 547 - 551
  • [7] Vocal Tract Length Normalization Features for Audio Search
    Madhavi, Maulik C.
    Sharma, Shubham
    Patil, Hemant A.
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 387 - 395
  • [8] An Approach to Vocal Tract Length Normalization by Robust Formant
    Kabir, A.
    Barker, J.
    Giurgiu, M.
    [J]. RECENT ADVANCES IN CIRCUITS, SYSTEMS AND SIGNALS, 2010, : 345 - +
  • [9] A frequency warping approach for vocal tract length normalization
    Ding, Q
    Xu, W
    Wang, BX
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 691 - 694
  • [10] Vocal-tract length estimation
    V. N. Sorokin
    I. V. Geras’kin
    [J]. Journal of Communications Technology and Electronics, 2013, 58 : 1292 - 1301