Real-time Prototype for Integration of Blind Source Extraction and Robust Automatic Speech Recognition

被引：0

作者：

Nesta, Francesco ^{[1
]}

Matassoni, Marco ^{[1
]}

Maganti, HariKrishna ^{[1
]}

机构：

[1] Fdn Bruno Kessler Irst, I-38123 Trento, Italy

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

blind source separation; speech enhancement; robust speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This demo presents a real-time prototype for automatic blind source extraction and speech recognition in presence of multiple interfering noise sources. Binaural recorded mixtures are processed by a combined Blind/Semi-Blind Source Separation algorithm in order to obtain an estimation of the target signal. The recovered target signal is segmented and used as input to a real-time automatic speech recognition (ASR) system. Further, to improve the recognition performance, noise robust features based on Gammatone filters are used. The demo utilizes the data provided for the CHiME Pascal speech separation and recognition challenge and also real-time mixtures recorded onsite. Users will be able to listen to the recovered target signal and compare it with the original mixture and ASR output.

引用

页码：3350 / 3351

页数：2

共 50 条

[1] Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition
Oualil, Youssef
Schulder, Marc
Helmke, Hartmut
Schmidt, Anna
Klakow, Dietrich
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2107 - 2111
[2] Real-time blind source separation system with applications to distant speech recognition
Ferreira, Alberto E. A.
Alarcao, Diogo
[J]. APPLIED ACOUSTICS, 2016, 113 : 170 - 184
[3] Real-Time Robust Automatic Speech Recognition Using Compact Support Vector Machines
Solera-Urena, Ruben
Isabel Garcia-Moral, Ana
Pelaez-Moreno, Carmen
Martinez-Ramon, Manel
Diaz-de-Maria, Fernando
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1347 - 1361
[4] Blind source extraction for robust speech recognition in multisource noisy environments
Nesta, Francesco
Matassoni, Marco
[J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 703 - 725
[5] SPEAKER REINFORCEMENT USING TARGET SOURCE EXTRACTION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
Zorila, Catalin
Doddipatla, Rama
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6297 - 6301
[6] Robust speech recognition in a high interference real room environment using Blind Speech Extraction
Koutras, A
Dermatas, E
[J]. DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 167 - 171
[7] A ROBUST AND REAL-TIME VISUAL SPEECH RECOGNITION FOR SMARTPHONE APPLICATION
Song, Min Gyu
Tariquzzamani, Md
Kim, Jin Young
Hwang, Seong Taek
Chi, Seung Ho
[J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (04): : 2837 - 2853
[8] Lightweight Real-Time Recurrent Models for Speech Enhancement and Automatic Speech Recognition
Dhahbi, Sami
Saleem, Nasir
Gunawan, Teddy Surya
Bourouis, Sami
Ali, Imad
Trigui, Aymen
Algarni, Abeer D.
[J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2024, 8 (06):
[9] REAL-TIME SPEECH RECOGNITION
CAELEN, J
CASTAN, S
PERENNOU, G
[J]. AUTOMATISME, 1972, 17 (03): : 87 - &
[10] A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board
Mnassri, Aymen
Bennasr, Mohamed
Adnane, Cherif
[J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2019, 9 (02) : 4066 - 4070

← 1 2 3 4 5 →