Mandarin Connected Digits Recognition for Whispered Speech

被引:0
|
作者
Ru Tingting [1 ]
Xie Xiang [1 ]
Yin Hui [1 ]
Kuang Jingming [1 ]
机构
[1] Beijing Inst Technol, Dept Elect Engn, Sch Informat Sci & Technol, Beijing 100081, Peoples R China
关键词
whispered speech; connected digits; speech recognition; confusion matrix;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, the acoustic characteristics and recognition of whispered speech are discussed. A Mandarin digits database is built both in normal speech and whispered speech. The collected speech materials of normal and whispered speech are analyzed to verify the characteristics and differences for the two kinds of speech. Cross recognition is carried out using normal and whispered speech as training data and testing data respectively, and the detailed recognition results are analyzed by using the confusion matrices. The results show that it's not suitable to recognize whispered speech using models trained by normal speech, and the word correct rate of the whispered speech is in close relation with its acoustic characteristics. Some possible solutions are also suggested.
引用
收藏
页码:1141 / 1144
页数:4
相关论文
共 50 条
  • [1] Neighboring digits pattern training method in quickly-spoken connected mandarin digits speech recognition
    Guo C.
    Li R.
    Fan M.
    Liu K.
    [J]. Journal of Multimedia, 2011, 6 (03): : 300 - 307
  • [2] Distributed speech recognition of mandarin digits string
    Wang, Yih-Ru
    Lu, Bo-Xuan
    Liao, Yuan-Fu
    Chen, Sin-Horng
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 370 - +
  • [3] Performance Analysis of Mandarin Whispered Speech Recognition Based on Normal Speech Training Model
    Chen Xueqin
    Zhao Heming
    Fan Xiaohe
    [J]. 2016 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2016, : 548 - 551
  • [4] Mandarin Digits Speech Recognition Using Support Vector Machines
    谢湘
    匡镜明
    [J]. Journal of Beijing Institute of Technology, 2005, (01) : 9 - 12
  • [5] Performance Improvement of Mandarin Digital Whispered Speech Recognition Based on Multistage Classification
    Chen Xueqin
    Sha Jun
    Yu Yibiao
    Zhao Heming
    [J]. 2016 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2016, : 544 - 547
  • [6] Audio-Visual Automatic Speech Recognition for Connected Digits
    Wang, Xiaoping
    Hao, Yufeng
    Fu, Degang
    Yuan, Chunwei
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL III, PROCEEDINGS, 2008, : 328 - +
  • [7] Analysis and recognition of whispered speech
    Ito, T
    Takeda, K
    Itakura, F
    [J]. SPEECH COMMUNICATION, 2005, 45 (02) : 139 - 152
  • [8] Efficient decoding algorithms for Mandarin Connected Digit Speech Recognition
    Zhu, X
    Li, HS
    Lu, J
    Liu, RS
    [J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 555 - 558
  • [9] RODIGITS - A ROMANIAN CONNECTED-DIGITS SPEECH CORPUS FOR AUTOMATIC SPEECH AND SPEAKER RECOGNITION
    Georgescu, Alexandru Lucian
    Caranica, Alexandru
    Cucu, Horia
    Burileanu, Corneliu
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2018, 80 (03): : 45 - 62
  • [10] Acoustic analysis and recognition of whispered speech
    Itoh, T
    Takeda, K
    Itakura, F
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 389 - 392