Noise robust hands-free speech recognition using microphone array and Kalman filter as front-end system of conversational TV

被引：0

作者：

Fujimoto, M ^{[1
]}

Ariki, Y ^{[1
]}

机构：

[1] Ryukoku Univ, Dept Elect & Informat, Otsu, Shiga 5202194, Japan

来源：

PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING | 2002年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we investigate hands-free speech recognition as front-end system of conversational TV. The conversational TV is one of machine conversation systems to retrieve the interesting information by inquiring it to the TV. To realize the natural machine conversation without consciousness of microphone, hands-free speech recognition is required. In the hands-free speech recognition system, the directions of the arriving signal are estimated by using a microphone array and the desired signal is enhanced by beam forming. Then, the user utterance section is detected automatically from continuously observed signal. Furthermore, by applying the noise reduction and noise adaptation, the enhanced speech signal is recognized accurately.

引用

页码：268 / 271

页数：4

共 38 条

[31] NOISE ADAPTIVE FRONT-END NORMALIZATION BASED ON VECTOR TAYLOR SERIES FOR DEEP NEURAL NETWORKS IN ROBUST SPEECH RECOGNITION
Bo Li
Chai, Khe Sim
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7408 - 7412
[32] Hands-free continuous speech recognition in noise using a speaker beam-former based on spectrum-entropy
George, N
Evangelos, D
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 889 - 892
[33] A novel noise robust front-end using first order VTS in construction of mel-warped wiener filter
Su, Mu
Li, Peng
Wang, Zhuo
Ding, Peng
Xu, Bo
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 777 - 780
[34] Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition
Abdelaziz, Ahmed Hussen
Zeiler, Steffen
Kolossa, Dorothea
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 867 - 871
[35] A portable analog front-end system for label-free sensing of proteins using nanowell array impedance sensors
Muhammad Tayyab
Pengfei Xie
Muhammad Ahsan Sami
Hassan Raji
Zhongtian Lin
Zhuolun Meng
Seyed Reza Mahmoodi
Mehdi Javanmard
Scientific Reports, 12
[36] A portable analog front-end system for label-free sensing of proteins using nanowell array impedance sensors
Tayyab, Muhammad
Xie, Pengfei
Sami, Muhammad Ahsan
Raji, Hassan
Lin, Zhongtian
Meng, Zhuolun
Mahmoodi, Seyed Reza
Javanmard, Mehdi
SCIENTIFIC REPORTS, 2022, 12 (01)
[37] Robust Front-End based on MVA and HEQ post-processing for Arabic Speech Recognition Using Hidden Markov Model Toolkit(HTK)
Techini, Elhem
Sakka, Zied
Bouhlel, MedSalim
2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 815 - 820
[38] Microphone Array Post-filter in Frequency Domain for Speech Recognition Using Short-Time Log-Spectral Amplitude Estimator and Spectral Harmonic/Noise Classifier
Salishev, Sergey
Klotchkov, Ilya
Barabanov, Andrey
SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 525 - 534

← 1 2 3 4 →