Robust speech recognition with speaker localization by a microphone array

被引:0
|
作者
Yamada, T
Nakamura, S
Shikano, K
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes robust speech recognition with Speaker Localization by a Arrayed Microphone (SLAM) to realize hands-free speech interface in noisy environments, In order to localize a speaker direction accurately in low SNR conditions, a speaker localization algorithm based on extracting a pitch harmonics is introduced. To evaluate the performance of the proposed system, speech recognition experiments art carried out both in computer simulation and real environments. These results show that the proposed system attains the much higher speech recognition performance than that of a single microphone not only in computer simulation bat also in real environments.
引用
收藏
页码:1317 / 1320
页数:4
相关论文
共 50 条
  • [1] Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech
    Sahidullah, Md
    Hautamaki, Rosa Gonzalez
    Thomsen, Dennis Alexander Lehmann
    Kinntinenl, Tomi
    Tang, Zheng-Hua
    Hautamaki, Ville
    Parts, Robert
    Pitkanen, Martti
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1720 - 1724
  • [2] Robust continuous speech recognition system based on a microphone array
    Lleida, E
    Fernandez, J
    Masgrau, E
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 241 - 244
  • [3] Performance of speaker localization using microphone array
    Visalakshi, R.
    Dhanalakshmi, P.
    Palanivel, S.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 467 - 483
  • [4] Microphone array system for speech recognition
    Kiyohara, K
    Kaneda, Y
    Takahashi, S
    Nomura, H
    Kojima, J
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
  • [5] Speaker localization using microphone array in a reverberant room
    Zou, QY
    Rahardja, S
    Cai, ZB
    [J]. 2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 354 - 357
  • [6] Robust Feature Combination for Speech Recognition Using Linear Microphone Array in a Car
    Obuchi, Yasunari
    Hataoka, Nobuo
    [J]. IN-VEHICLE CORPUS AND SIGNAL PROCESSING FOR DRIVER BEHAVIOR, 2009, : 187 - +
  • [7] Microphone array driven speech recognition:: Influence of localization on the word error rate
    Wölfel, M
    Nickel, K
    McDonough, J
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3869 : 320 - 331
  • [8] Kinect microphone array-based speech and speaker recognition for the exhibition control of humanoid robots
    Ding, Ing-Jr
    Shi, Jia-Yi
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 719 - 729
  • [9] Microphone Array Processing for Distant Speech Recognition
    Kumatani, Kenichi
    McDonough, John
    Raj, Bhiksha
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 127 - 140
  • [10] A DIGITAL MICROPHONE ARRAY FOR DISTANT SPEECH RECOGNITION
    Zwyssig, Erich
    Lincoln, Mike
    Renals, Steve
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5106 - 5109