Distant Speech Recognition Using a Microphone Array Network

被引:3
|
作者
Nakano, Alberto Yoshihiro [1 ]
Nakagawa, Seiichi [1 ]
Yamamoto, Kazumasa [1 ]
机构
[1] Toyohashi Univ Technol, Dept Informat & Comp Sci, Toyohashi, Aichi 4418580, Japan
来源
关键词
distant speech recognition; microphone array network; GMM-based CMN; speaker's position and orientation estimation; POSITION;
D O I
10.1587/transinf.E93.D.2451
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, spatial information consisting of the position and orientation angle of an acoustic source is estimated by an artificial neural network (ANN). The estimated position of a speaker in an enclosed space is used to refine the estimated time delays for a delay-and-sum beamformer, thus enhancing the output signal. On the other hand, the orientation angle is used to restrict the lexicon used in the recognition phase, assuming that the speaker faces a particular direction while speaking. To compensate the effect of the transmission channel inside a short frame analysis window, a new cepstral mean normalization (CMN) method based on a Gaussian mixture model (GMM) is investigated and shows better performance than the conventional CMN for short utterances. The performance of the proposed method is evaluated through Japanese digit/command recognition experiments.
引用
收藏
页码:2451 / 2462
页数:12
相关论文
共 50 条
  • [1] A DIGITAL MICROPHONE ARRAY FOR DISTANT SPEECH RECOGNITION
    Zwyssig, Erich
    Lincoln, Mike
    Renals, Steve
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5106 - 5109
  • [2] Microphone Array Processing for Distant Speech Recognition
    Kumatani, Kenichi
    McDonough, John
    Raj, Bhiksha
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 127 - 140
  • [3] Microphone Array Processing for Distant Speech Recognition: Spherical Arrays
    McDonough, John
    Kumatani, Kenichi
    Raj, Bhiksha
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [4] HMM adaptation and microphone array processing for distant speech recognition
    Kleban, J
    Gong, YF
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1411 - 1414
  • [5] DISTANT SPEECH RECOGNITION IN REVERBERANT NOISY CONDITIONS EMPLOYING A MICROPHONE ARRAY
    Morales-Cordovilla, Juan A.
    Hagmueller, Martin
    Pessentheiner, Hannes
    Kubin, Gernot
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2380 - 2384
  • [6] Speech Enhancement Using Compact Microphone Array and Applications in Distant Speech Acquisition
    Zhang Heng
    Fu Qiang
    Yan Yonghong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2009, 18 (03) : 481 - 486
  • [7] Microphone Array Processing Strategies for Distant-Based Automatic Speech Recognition
    Khoubrouy, Soudeh A.
    Hansen, John H. L.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (10) : 1344 - 1348
  • [8] Microphone Array Processing for Distant Speech Recognition: Towards Real-World Deployment
    Kumatani, Kenichi
    Arakawa, Takayuki
    Yamamoto, Kazumasa
    McDonough, John
    Raj, Bhiksha
    Singh, Rita
    Tashev, Ivan
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [9] Microphone array system for speech recognition
    Kiyohara, K
    Kaneda, Y
    Takahashi, S
    Nomura, H
    Kojima, J
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
  • [10] Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array
    Yamada, T
    Nakamura, S
    Shikano, K
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 48 - 56