An Integrated Approach to Robust Speaker Identification and Speech Recognition

被引：1

作者：

Kwan, C. ^{[1
]}

Yin, J. ^{[1
]}

Ayhan, B. ^{[1
]}

Chu, S. ^{[1
]}

Liu, X. ^{[1
]}

Puckett, K. ^{[1
]}

Zhao, Y.

Ho, K. C.

Kruger, M.

Sityar, I.

机构：

[1] Signal Proc Inc, Rockville, MD 20850 USA

来源：

2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8 | 2008年

关键词：

D O I：

10.1109/IJCNN.2008.4634016

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Conventional speaker identification and speech recognition algorithms cannot deal with noisy and multiple speaker environments. For example, HIM via Voice has low recognition rates if dictation is done in a noisy environment. In order to achieve high performance in speaker identification and speech recognition, we propose an integrated approach that takes every facet of the process into account. Here we summarize some preliminary results from the application of this integrated approach to robust speaker identification and speech recognition. A real-time stand-alone software prototype has been developed to evaluate the effectiveness of the approach.

引用

页码：1635 / +

页数：3

共 50 条

[1] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
Zhou, Xi
Fu, Yun
Liu, Ming
Hasegawa-Johnson, Mark
Huang, Thomas S.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
[2] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
Shih, Po-Yi
Lin, Po-Chuan
Wang, Jhing-Fa
Lin, Yuan-Ning
[J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467
[3] An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition
Tsao, Yu
Lee, Chin-Hui
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (05): : 1025 - 1037
[4] SPEAKER IDENTIFICATION AND MESSAGE IDENTIFICATION IN SPEECH RECOGNITION
GARVIN, PL
LADEFOGED, P
[J]. PHONETICA, 1963, 9 (04) : 193 - 199
[5] An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition
Hariharan, R
Viikki, O
[J]. SPEECH COMMUNICATION, 2002, 37 (3-4) : 349 - 361
[6] MULTILEVEL SPEECH INTELLIGIBILITY FOR ROBUST SPEAKER RECOGNITION
Nemala, Sridhar Krishna
Elhilali, Mounya
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4393 - 4396
[7] Speaker and Noise Factorization for Robust Speech Recognition
Wang, Yongqiang
Gales, Mark J. F.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2149 - 2158
[8] Continuous Speech Recognition and Identification of the Speaker System
Guffanti, Diego
Martinez, Danilo
Paladines, Jose
Sarmiento, Andrea
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY & SYSTEMS (ICITS 2018), 2018, 721 : 767 - 776
[9] Robust Digital Speech Watermarking For Online Speaker Recognition
Nematollahi, Mohammad Ali
Gamboa-Rosales, Hamurabi
Akhaee, Mohammad Ali
Al-Haddad, S. A. R.
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[10] Noise robust estimate of speech dynamics for speaker recognition
Openshaw, JP
Mason, JS
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 925 - 928

← 1 2 3 4 5 →