Noise robust hands-free speech recognition using microphone array and Kalman filter as front-end system of conversational TV

被引:0
|
作者
Fujimoto, M [1 ]
Ariki, Y [1 ]
机构
[1] Ryukoku Univ, Dept Elect & Informat, Otsu, Shiga 5202194, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we investigate hands-free speech recognition as front-end system of conversational TV. The conversational TV is one of machine conversation systems to retrieve the interesting information by inquiring it to the TV. To realize the natural machine conversation without consciousness of microphone, hands-free speech recognition is required. In the hands-free speech recognition system, the directions of the arriving signal are estimated by using a microphone array and the desired signal is enhanced by beam forming. Then, the user utterance section is detected automatically from continuously observed signal. Furthermore, by applying the noise reduction and noise adaptation, the enhanced speech signal is recognized accurately.
引用
收藏
页码:268 / 271
页数:4
相关论文
共 38 条
  • [31] NOISE ADAPTIVE FRONT-END NORMALIZATION BASED ON VECTOR TAYLOR SERIES FOR DEEP NEURAL NETWORKS IN ROBUST SPEECH RECOGNITION
    Bo Li
    Chai, Khe Sim
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7408 - 7412
  • [32] Hands-free continuous speech recognition in noise using a speaker beam-former based on spectrum-entropy
    George, N
    Evangelos, D
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 889 - 892
  • [33] A novel noise robust front-end using first order VTS in construction of mel-warped wiener filter
    Su, Mu
    Li, Peng
    Wang, Zhuo
    Ding, Peng
    Xu, Bo
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 777 - 780
  • [34] Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition
    Abdelaziz, Ahmed Hussen
    Zeiler, Steffen
    Kolossa, Dorothea
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 867 - 871
  • [35] A portable analog front-end system for label-free sensing of proteins using nanowell array impedance sensors
    Muhammad Tayyab
    Pengfei Xie
    Muhammad Ahsan Sami
    Hassan Raji
    Zhongtian Lin
    Zhuolun Meng
    Seyed Reza Mahmoodi
    Mehdi Javanmard
    [J]. Scientific Reports, 12
  • [36] A portable analog front-end system for label-free sensing of proteins using nanowell array impedance sensors
    Tayyab, Muhammad
    Xie, Pengfei
    Sami, Muhammad Ahsan
    Raji, Hassan
    Lin, Zhongtian
    Meng, Zhuolun
    Mahmoodi, Seyed Reza
    Javanmard, Mehdi
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [37] Robust Front-End based on MVA and HEQ post-processing for Arabic Speech Recognition Using Hidden Markov Model Toolkit(HTK)
    Techini, Elhem
    Sakka, Zied
    Bouhlel, MedSalim
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 815 - 820
  • [38] Microphone Array Post-filter in Frequency Domain for Speech Recognition Using Short-Time Log-Spectral Amplitude Estimator and Spectral Harmonic/Noise Classifier
    Salishev, Sergey
    Klotchkov, Ilya
    Barabanov, Andrey
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 525 - 534