Noise robust hands-free speech recognition using microphone array and Kalman filter as front-end system of conversational TV

被引:0
|
作者
Fujimoto, M [1 ]
Ariki, Y [1 ]
机构
[1] Ryukoku Univ, Dept Elect & Informat, Otsu, Shiga 5202194, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we investigate hands-free speech recognition as front-end system of conversational TV. The conversational TV is one of machine conversation systems to retrieve the interesting information by inquiring it to the TV. To realize the natural machine conversation without consciousness of microphone, hands-free speech recognition is required. In the hands-free speech recognition system, the directions of the arriving signal are estimated by using a microphone array and the desired signal is enhanced by beam forming. Then, the user utterance section is detected automatically from continuously observed signal. Furthermore, by applying the noise reduction and noise adaptation, the enhanced speech signal is recognized accurately.
引用
收藏
页码:268 / 271
页数:4
相关论文
共 38 条
  • [21] A noise-robust ASR front-end using Wiener filter constructed from MMSE estimation of clean speech and noise
    Wu, J
    Droppo, J
    Deng, L
    Acero, A
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 321 - 326
  • [22] A noise robust front-end with low computational cost for embedded in-car speech recognition
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1045 - +
  • [23] A noise robust front-end using Wiener filter, probability model and CMS for ASR
    Xu, W
    Guo, YH
    Wang, BX
    Wang, XB
    Mai, ZF
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 102 - 105
  • [24] Robust connected digit recognition using speech enhancement and an auditory model front-end
    Flynn, Ronan
    Jones, Edward
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 410 - +
  • [25] Incorporating a Generative Front-end Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition
    Kundu, Souvik
    Sim, Khe Chai
    Gales, Mark
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2359 - 2363
  • [26] A study of mutual front-end processing method based on statistical model for noise robust speech recognition
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    Nakatani, Tomohiro
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1251 - 1254
  • [27] A Multichannel Noise Reduction Front-end based on psychoacoustics for robust speech recognition in highly noisy environments
    Cifani, Simone
    Principi, Emanuele
    Rocchi, Cesare
    Squartini, Stefano
    Piazza, Francesco
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 173 - 176
  • [28] Robust automatic speech recognition using a multi-channel signal separation front-end
    Yen, KC
    Zhao, YX
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1337 - 1340
  • [29] OPTIMIZED JOINT NOISE SUPPRESSION AND DEREVERBERATION BASED ON BLIND SIGNAL EXTRACTION FOR HANDS-FREE SPEECH RECOGNITION SYSTEM
    Aprilyanti, Fine D.
    Saruwatari, Hiroshi
    Nakamura, Satoshi
    Takatani, Tomoya
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 182 - 186
  • [30] Robust speech recognition system using multi-parameter bidirectional Kalman filter
    Goh Y.-H.
    Goh Y.-L.
    Lee Y.-K.
    Ko Y.-H.
    Goh, Yeh-Huann (gohyh@acd.tarc.edu.my), 1600, Springer Science and Business Media, LLC (20): : 455 - 463