A Multichannel Noise Reduction Front-end based on psychoacoustics for robust speech recognition in highly noisy environments

被引:0
|
作者
Cifani, Simone [1 ]
Principi, Emanuele [1 ]
Rocchi, Cesare [1 ]
Squartini, Stefano [1 ]
Piazza, Francesco [1 ]
机构
[1] Univ Politecn Marche, DEIT, MediaLabs 3, I-60131 Ancona, Italy
关键词
Multichannel Noise Reduction Front-end; psychoacoustics; Automatic Speech Recognition; Sphinx-4 open source ASR;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Microphone array systems, due to their spatial filtering capability, usually overcome the traditional mono approaches in noise reduction. Moreover, the employment of psychoacoustically motivated speech enhancement schemes typically allows to achieve a good balance between noise reduction and speech distortion. This drove some of the authors to merge the two advantageous aspects into a unique solution, allowing to achieve relevant performances in terms of enhanced speech quality in a wide range of operating conditions. Now, in this paper, the objective is assessing the effectiveness of the approach when applied as Noise Reduction Front-end to an Automatic Speech Recognition system working in adverse acoustic environments. Some computer simulations have been carried out and they show that a significant improvement of recognition rate is registered when such front-end is used, also w.r.t. the performances achievable when another Multichannel Noise Reduction architecture, not based on psychoacoustics concepts, is adopted on purpose.
引用
收藏
页码:173 / 176
页数:4
相关论文
共 50 条
  • [1] ROBUST FRONT-END PROCESSING FOR SPEECH RECOGNITION IN NOISY CONDITIONS
    Das, Biswajit
    Panda, Ashish
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5235 - 5239
  • [2] Robust Front-End Processing For Emotion Recognition In Noisy Speech
    Pandharipande, Meghna
    Chakraborty, Rupayan
    Panda, Ashish
    Kopparapu, Sunil Kumar
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 324 - 328
  • [3] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
    Narayanan, Arun
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
  • [4] Robust front-end for speech recognition by human and machine in noisy reverberant environments: the effect of phase information
    Liu, Yang
    Nower, Naushin
    Morita, Shota
    Unoki, Masashi
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [5] Advanced Front-end for Robust Speech Recognition in Extremely Adverse Environments
    Dimitriadis, Dimitrios
    Segura, Jose C.
    Garcia, Luz
    Potamianos, Alexandros
    Maragos, Petros
    Pitsikalis, Vassilis
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2221 - +
  • [6] Front-End Feature Compensation for Noise Robust Speech Emotion Recognition
    Pandharipande, Meghna
    Chakraborty, Rupayan
    Panda, Ashish
    Das, Biswajit
    Kopparapu, Sunil Kumar
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [7] Efficient Noise-Robust Speech Recognition Front-End Based on the ETSI Standard
    Neves, Claudio
    Veiga, Arlindo
    Sa, Luis
    Perdigao, Fernando
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 609 - 612
  • [8] A Front-End Technique for Automatic Noisy Speech Recognition
    Naing, Hay Mar Soe
    Hidayat, Risanuri
    Hartanto, Rudy
    Miyanaga, Yoshikazu
    [J]. PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 49 - 54
  • [9] A robust front-end for telephone speech recognition
    Cho, HY
    Chi, SM
    Oh, YH
    [J]. PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
  • [10] A Speech Enhancement Front-End for Intent Classification in Noisy Environments
    Ali, Mohamed Nabih
    Schmalz, Veronica Juliana
    Brutti, Alessio
    Falavigna, Daniele
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 471 - 475