A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition

被引：0

作者：

Zhou, Xinhui ^{[1
]}

Kwan, Chiman ^{[1
]}

Ayhan, Bulent ^{[1
]}

Kim, Chanwoo ^{[2
]}

Kumar, K. ^{[2
]}

Stern, Richard ^{[2
]}

机构：

[1] Signal Proc Inc, Rockville, MD 20850 USA

[2] Carnegie Mellon Univ, Pittsburgh, PA USA

来源：

ADVANCES IN NEURAL NETWORKS - ISNN 2018 | 2018年 / 10878卷

基金：

美国国家科学基金会;

关键词：

Speech recognition; Spatial speech separation; Reverberant;

D O I：

10.1007/978-3-319-92537-0_57

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Robust speech recognition in noisy and reverberant conditions is an important research area in recent years. Here we present a comparative study of several spatial speech separation methods. The main performance metric is word error rate (WER) under different signal-to-noise ratio (SNR) and reverberant conditions. Extensive simulations showed that one technique known as polyaural processing stood out as the best one.

引用

页码：494 / 502

页数：9

共 50 条

[1] Comparative study of automatic speech recognition techniques
Cutajar, Michelle
Gatt, Edward
Grech, Ivan
Casha, Owen
Micallef, Joseph
[J]. IET SIGNAL PROCESSING, 2013, 7 (01) : 25 - 46
[2] Comparative study of automatic speech recognition techniques
Faculty of Information and Communication Technology, Department of Microelectronics and Nanoelectronics, University of Malta, Tal-Qroqq, Msida
MSD 2080, Malta
[J]. IET Signal Proc., 2013, 1 (25-46):
[3] SPEECH SEPARATION FOR SPEECH RECOGNITION
DECHEVEIGNE, A
HAWAHARA, H
AIKAWA, K
LEA, A
[J]. JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 545 - 548
[4] A comparative study of linear feature transformation techniques for automatic speech recognition
Eisele, T
HaebUmbach, R
Langmann, D
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 252 - 255
[5] A Comparative Study of Noise Reduction Techniques for Automatic Speech Recognition Systems
Garg, Kanika
Jain, Goonjan
[J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 2098 - 2103
[6] Improve Multichannel Speech Recognition with Temporal and Spatial Information
Zhang, Yu
Zhang, Pengyuan
Zhao, Qingwei
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (07) : 1963 - 1967
[7] A Comparative Study of Arabic Speech Recognition
Ali, Onsy Abdel Alim
Moselhy, Mohamed M.
Bzeih, Aya
[J]. 2012 16TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2012, : 884 - 887
[8] Predicting speech-in-speech recognition: Short-term audibility and spatial separation
Wasiuk, Peter A.
Calandruccio, Lauren
Oleson, Jacob J.
Buss, Emily
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 154 (03): : 1827 - 1837
[9] Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients
Kokkinakis, Kostas
Loizou, Philipos C.
[J]. 1600, Acoustical Society of America, 2 Huntington Quadrangle, Ste 1NO1, Melville, NY 11747-4502, United States (123):
[10] Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients
Kokkinakis, Kostas
Loizou, Philipos C.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 2379 - 2390

← 1 2 3 4 5 →