Real-time Prototype for Integration of Blind Source Extraction and Robust Automatic Speech Recognition

被引:0
|
作者
Nesta, Francesco [1 ]
Matassoni, Marco [1 ]
Maganti, HariKrishna [1 ]
机构
[1] Fdn Bruno Kessler Irst, I-38123 Trento, Italy
关键词
blind source separation; speech enhancement; robust speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This demo presents a real-time prototype for automatic blind source extraction and speech recognition in presence of multiple interfering noise sources. Binaural recorded mixtures are processed by a combined Blind/Semi-Blind Source Separation algorithm in order to obtain an estimation of the target signal. The recovered target signal is segmented and used as input to a real-time automatic speech recognition (ASR) system. Further, to improve the recognition performance, noise robust features based on Gammatone filters are used. The demo utilizes the data provided for the CHiME Pascal speech separation and recognition challenge and also real-time mixtures recorded onsite. Users will be able to listen to the recovered target signal and compare it with the original mixture and ASR output.
引用
收藏
页码:3350 / 3351
页数:2
相关论文
共 50 条
  • [1] Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition
    Oualil, Youssef
    Schulder, Marc
    Helmke, Hartmut
    Schmidt, Anna
    Klakow, Dietrich
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2107 - 2111
  • [2] Real-time blind source separation system with applications to distant speech recognition
    Ferreira, Alberto E. A.
    Alarcao, Diogo
    [J]. APPLIED ACOUSTICS, 2016, 113 : 170 - 184
  • [3] Real-Time Robust Automatic Speech Recognition Using Compact Support Vector Machines
    Solera-Urena, Ruben
    Isabel Garcia-Moral, Ana
    Pelaez-Moreno, Carmen
    Martinez-Ramon, Manel
    Diaz-de-Maria, Fernando
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1347 - 1361
  • [4] Blind source extraction for robust speech recognition in multisource noisy environments
    Nesta, Francesco
    Matassoni, Marco
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 703 - 725
  • [5] SPEAKER REINFORCEMENT USING TARGET SOURCE EXTRACTION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Zorila, Catalin
    Doddipatla, Rama
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6297 - 6301
  • [6] Robust speech recognition in a high interference real room environment using Blind Speech Extraction
    Koutras, A
    Dermatas, E
    [J]. DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 167 - 171
  • [7] A ROBUST AND REAL-TIME VISUAL SPEECH RECOGNITION FOR SMARTPHONE APPLICATION
    Song, Min Gyu
    Tariquzzamani, Md
    Kim, Jin Young
    Hwang, Seong Taek
    Chi, Seung Ho
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (04): : 2837 - 2853
  • [8] Lightweight Real-Time Recurrent Models for Speech Enhancement and Automatic Speech Recognition
    Dhahbi, Sami
    Saleem, Nasir
    Gunawan, Teddy Surya
    Bourouis, Sami
    Ali, Imad
    Trigui, Aymen
    Algarni, Abeer D.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2024, 8 (06):
  • [9] REAL-TIME SPEECH RECOGNITION
    CAELEN, J
    CASTAN, S
    PERENNOU, G
    [J]. AUTOMATISME, 1972, 17 (03): : 87 - &
  • [10] A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board
    Mnassri, Aymen
    Bennasr, Mohamed
    Adnane, Cherif
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2019, 9 (02) : 4066 - 4070