A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition

被引:0
|
作者
Zhou, Xinhui [1 ]
Kwan, Chiman [1 ]
Ayhan, Bulent [1 ]
Kim, Chanwoo [2 ]
Kumar, K. [2 ]
Stern, Richard [2 ]
机构
[1] Signal Proc Inc, Rockville, MD 20850 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
来源
基金
美国国家科学基金会;
关键词
Speech recognition; Spatial speech separation; Reverberant;
D O I
10.1007/978-3-319-92537-0_57
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Robust speech recognition in noisy and reverberant conditions is an important research area in recent years. Here we present a comparative study of several spatial speech separation methods. The main performance metric is word error rate (WER) under different signal-to-noise ratio (SNR) and reverberant conditions. Extensive simulations showed that one technique known as polyaural processing stood out as the best one.
引用
收藏
页码:494 / 502
页数:9
相关论文
共 50 条
  • [1] Comparative study of automatic speech recognition techniques
    Cutajar, Michelle
    Gatt, Edward
    Grech, Ivan
    Casha, Owen
    Micallef, Joseph
    [J]. IET SIGNAL PROCESSING, 2013, 7 (01) : 25 - 46
  • [2] Comparative study of automatic speech recognition techniques
    Faculty of Information and Communication Technology, Department of Microelectronics and Nanoelectronics, University of Malta, Tal-Qroqq, Msida
    MSD 2080, Malta
    [J]. IET Signal Proc., 2013, 1 (25-46):
  • [3] SPEECH SEPARATION FOR SPEECH RECOGNITION
    DECHEVEIGNE, A
    HAWAHARA, H
    AIKAWA, K
    LEA, A
    [J]. JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 545 - 548
  • [4] A comparative study of linear feature transformation techniques for automatic speech recognition
    Eisele, T
    HaebUmbach, R
    Langmann, D
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 252 - 255
  • [5] A Comparative Study of Noise Reduction Techniques for Automatic Speech Recognition Systems
    Garg, Kanika
    Jain, Goonjan
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 2098 - 2103
  • [6] Improve Multichannel Speech Recognition with Temporal and Spatial Information
    Zhang, Yu
    Zhang, Pengyuan
    Zhao, Qingwei
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (07) : 1963 - 1967
  • [7] A Comparative Study of Arabic Speech Recognition
    Ali, Onsy Abdel Alim
    Moselhy, Mohamed M.
    Bzeih, Aya
    [J]. 2012 16TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2012, : 884 - 887
  • [8] Predicting speech-in-speech recognition: Short-term audibility and spatial separation
    Wasiuk, Peter A.
    Calandruccio, Lauren
    Oleson, Jacob J.
    Buss, Emily
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 154 (03): : 1827 - 1837
  • [9] Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients
    Kokkinakis, Kostas
    Loizou, Philipos C.
    [J]. 1600, Acoustical Society of America, 2 Huntington Quadrangle, Ste 1NO1, Melville, NY 11747-4502, United States (123):
  • [10] Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients
    Kokkinakis, Kostas
    Loizou, Philipos C.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 2379 - 2390