Mandarin Connected Digits Recognition for Whispered Speech

被引:0
|
作者
Ru Tingting [1 ]
Xie Xiang [1 ]
Yin Hui [1 ]
Kuang Jingming [1 ]
机构
[1] Beijing Inst Technol, Dept Elect Engn, Sch Informat Sci & Technol, Beijing 100081, Peoples R China
关键词
whispered speech; connected digits; speech recognition; confusion matrix;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, the acoustic characteristics and recognition of whispered speech are discussed. A Mandarin digits database is built both in normal speech and whispered speech. The collected speech materials of normal and whispered speech are analyzed to verify the characteristics and differences for the two kinds of speech. Cross recognition is carried out using normal and whispered speech as training data and testing data respectively, and the detailed recognition results are analyzed by using the confusion matrices. The results show that it's not suitable to recognize whispered speech using models trained by normal speech, and the word correct rate of the whispered speech is in close relation with its acoustic characteristics. Some possible solutions are also suggested.
引用
下载
收藏
页码:1141 / 1144
页数:4
相关论文
共 50 条
  • [31] STATISTICAL DECISION APPROACH TO RECOGNITION OF CONNECTED DIGITS
    SAMBUR, MR
    RABINER, LR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 : S12 - S12
  • [32] STATISTICAL DECISION APPROACH TO RECOGNITION OF CONNECTED DIGITS
    SAMBUR, MR
    RABINER, LR
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (06): : 550 - 558
  • [33] Discriminative utterance verification for connected digits recognition
    Rahim, MG
    Lee, CH
    Juang, BH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 266 - 277
  • [34] SOME PRELIMINARY EXPERIMENTS IN RECOGNITION OF CONNECTED DIGITS
    RABINER, LR
    SAMBUR, MR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 : S105 - S106
  • [35] SOME PRELIMINARY EXPERIMENTS IN RECOGNITION OF CONNECTED DIGITS
    RABINER, LR
    SAMBUR, MR
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (02): : 170 - 182
  • [36] Group Delay based Methods for Detection and Recognition of Whispered Speech
    Vedvyasan, Kishore
    Nathwani, Karan
    Hegde, Rajesh M.
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 499 - 505
  • [37] A STUDY ON ROBUSTNESS OF ARTICULATORY FEATURES FOR AUTOMATIC SPEECH RECOGNITION OF NEUTRAL AND WHISPERED SPEECH
    Srinivasan, Gokul
    Illa, Aravind
    Ghosh, Prasanta Kumar
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5936 - 5940
  • [38] Advances in Mandarin Broadcast Speech Recognition
    Hwang, Mei-Yuh
    Wang, Wen
    Lei, Xin
    Zheng, Jing
    Cetin, Ozgur
    Peng, Gang
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2876 - +
  • [39] Prosody Dependent Mandarin Speech Recognition
    Ni, Chong-Jia
    Liu, Wen-Ju
    Xu, Bo
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 197 - 201
  • [40] Connected Mandarin Digit Speech Recognition Using Two-layer Acoustic Universal Structure
    Rui, Xianyi
    Yu, Yibiao
    Jiang, Ying
    ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1380 - 1383