An amplitude warping approach to intra-speaker normalization for speech recognition

被引:0
|
作者
Hong, KS [1 ]
机构
[1] Sungkyunkwan Univ, Sch Informat & Commun Engn, Suwon, South Korea
[2] SITRC, Suwon, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an intra-speaker warping factor estimation based on pitch alteration utterance. The feature space distributions of untransformed speech from the pitch alteration utterance of intra-speaker would vary due to the acoustic differences of speech produced by glottis and vocal tract. Therefore, it may be possible to determine the amplitude warping factor by calculating the inverse ratio of input to reference pitch. As the recognition results, the error rate is reduced from 0.4% to 2.3% for digit and word decoding.
引用
收藏
页码:639 / 645
页数:7
相关论文
共 50 条
  • [1] On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks
    Hansen, John H. L.
    Boril, Hynek
    [J]. SPEECH COMMUNICATION, 2018, 101 : 94 - 108
  • [2] Study of relationships between intra-speaker's speech variability and speech recognition performance
    Tsuge, Satoru
    Fukumi, Minoru
    Shishibori, Masami
    Ren, Fuji
    Kita, Kenji
    Kuroiwa, Shingo
    [J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 33 - +
  • [3] A frequency warping approach to speaker normalization
    Lee, L
    Rose, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01): : 49 - 60
  • [4] Study of intra-speaker's speech variability over long and short time periods for speech recognition
    Tsuge, Satoru
    Shishibori, Masami
    Kita, Kenji
    Ren, Fuji
    Kuroiwa, Shingo
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 397 - 400
  • [5] INTRA-SPEAKER VARIABILITY OF THE LONG-TERM SPEECH SPECTRUM
    HARMEGNIES, B
    LANDERCY, A
    [J]. SPEECH COMMUNICATION, 1988, 7 (01) : 81 - 86
  • [6] Frequency warping approach for vocal tract length normalization in speech recognition
    Xu, W
    Wang, BX
    Ding, Q
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 2, 2004, : 494 - 499
  • [7] AN INVESTIGATION OF INTRA-SPEAKER RELIABILITY
    MARINE, DR
    [J]. SPEECH TEACHER, 1965, 14 (02): : 128 - 131
  • [8] Variable pronunciations reveal dynamic intra-speaker variation in speech planning
    Oriana Kilbourn-Ceron
    Matthew Goldrick
    [J]. Psychonomic Bulletin & Review, 2021, 28 : 1365 - 1380
  • [9] Variable pronunciations reveal dynamic intra-speaker variation in speech planning
    Kilbourn-Ceron, Oriana
    Goldrick, Matthew
    [J]. PSYCHONOMIC BULLETIN & REVIEW, 2021, 28 (04) : 1365 - 1380
  • [10] On the sources of inter- & intra-speaker variability in the acoustic dynamics of speech
    Yang, X
    Millar, JB
    Macleod, I
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1792 - 1795