A bilinear transform approach for vocal tract length normalization

被引:0
|
作者
Xu, W [1 ]
Wang, BX [1 ]
Ding, Q [1 ]
机构
[1] Informat Engn Univ, Zhengzhou 450002, Henan, Peoples R China
关键词
speech recognition; frequency warping; bilinear transformation; vocal tract length normalization;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We have developed and evaluated a set of speaker normalization procedures derived by Bilinear Transform (BLT) to compensate for variations in vocal tract lengths of different classes of speakers. The warping factors are estimated using the average third formants and their bandwidth, leaving out the exhaustive search. The MFCC of the testing data are transformed by the warped Met filterbanks to match the models of the training data. The effectiveness of this set of speaker normalization procedures is examined in an experimental study performed using an isolated digit database of man, woman and children comparing to other standard speaker normalization method. The results of experiments demonstrate their capacity to achieve recognition accuracy increase of 19.5% and 16.5% at the best.
引用
收藏
页码:547 / 551
页数:5
相关论文
共 50 条
  • [1] A parametric approach to vocal tract length normalization
    Eide, E
    Gish, H
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 346 - 348
  • [2] An Approach to Vocal Tract Length Normalization by Robust Formant
    Kabir, A.
    Barker, J.
    Giurgiu, M.
    [J]. RECENT ADVANCES IN CIRCUITS, SYSTEMS AND SIGNALS, 2010, : 345 - +
  • [3] A frequency warping approach for vocal tract length normalization
    Ding, Q
    Xu, W
    Wang, BX
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 691 - 694
  • [4] Frequency warping approach for vocal tract length normalization in speech recognition
    Xu, W
    Wang, BX
    Ding, Q
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 2, 2004, : 494 - 499
  • [5] Time domain vocal tract length normalization
    Sündermann, D
    Bonafonte, A
    Ney, H
    Hoge, H
    [J]. Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 191 - 194
  • [6] Parameter optimization for Vocal Tract Length Normalization
    Dognin, P
    El-Jaroudi, A
    Billa, J
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1767 - 1770
  • [7] Vocal Tract Length Normalization Features for Audio Search
    Madhavi, Maulik C.
    Sharma, Shubham
    Patil, Hemant A.
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 387 - 395
  • [8] The ΔF method of vocal tract length normalization for vowels
    Johnson, Keith
    [J]. LABORATORY PHONOLOGY, 2020, 11 (01):
  • [9] Region-Based Vocal Tract Length Normalization for ASR
    Maragakis, Michail G.
    Potamianos, Alexandros
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1365 - 1368
  • [10] Combining Vocal Tract Length Normalization With Hierarchical Linear Transformations
    Saheer, Lakshmi
    Yamagishi, Junichi
    Garner, Philip N.
    Dines, John
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (02) : 262 - 272