An Integrated Approach to Robust Speaker Identification and Speech Recognition

被引:1
|
作者
Kwan, C. [1 ]
Yin, J. [1 ]
Ayhan, B. [1 ]
Chu, S. [1 ]
Liu, X. [1 ]
Puckett, K. [1 ]
Zhao, Y.
Ho, K. C.
Kruger, M.
Sityar, I.
机构
[1] Signal Proc Inc, Rockville, MD 20850 USA
关键词
D O I
10.1109/IJCNN.2008.4634016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional speaker identification and speech recognition algorithms cannot deal with noisy and multiple speaker environments. For example, HIM via Voice has low recognition rates if dictation is done in a noisy environment. In order to achieve high performance in speaker identification and speech recognition, we propose an integrated approach that takes every facet of the process into account. Here we summarize some preliminary results from the application of this integrated approach to robust speaker identification and speech recognition. A real-time stand-alone software prototype has been developed to evaluate the effectiveness of the approach.
引用
收藏
页码:1635 / +
页数:3
相关论文
共 50 条
  • [1] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
    Zhou, Xi
    Fu, Yun
    Liu, Ming
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
  • [2] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
    Shih, Po-Yi
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Lin, Yuan-Ning
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467
  • [3] An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition
    Tsao, Yu
    Lee, Chin-Hui
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (05): : 1025 - 1037
  • [4] SPEAKER IDENTIFICATION AND MESSAGE IDENTIFICATION IN SPEECH RECOGNITION
    GARVIN, PL
    LADEFOGED, P
    [J]. PHONETICA, 1963, 9 (04) : 193 - 199
  • [5] An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition
    Hariharan, R
    Viikki, O
    [J]. SPEECH COMMUNICATION, 2002, 37 (3-4) : 349 - 361
  • [6] MULTILEVEL SPEECH INTELLIGIBILITY FOR ROBUST SPEAKER RECOGNITION
    Nemala, Sridhar Krishna
    Elhilali, Mounya
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4393 - 4396
  • [7] Speaker and Noise Factorization for Robust Speech Recognition
    Wang, Yongqiang
    Gales, Mark J. F.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2149 - 2158
  • [8] Continuous Speech Recognition and Identification of the Speaker System
    Guffanti, Diego
    Martinez, Danilo
    Paladines, Jose
    Sarmiento, Andrea
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY & SYSTEMS (ICITS 2018), 2018, 721 : 767 - 776
  • [9] Robust Digital Speech Watermarking For Online Speaker Recognition
    Nematollahi, Mohammad Ali
    Gamboa-Rosales, Hamurabi
    Akhaee, Mohammad Ali
    Al-Haddad, S. A. R.
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [10] Noise robust estimate of speech dynamics for speaker recognition
    Openshaw, JP
    Mason, JS
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 925 - 928