Advanced Front-end for Robust Speech Recognition in Extremely Adverse Environments

被引：0

作者：

Dimitriadis, Dimitrios ^{[1
]}

Segura, Jose C. ^{[2
]}

Garcia, Luz ^{[2
]}

Potamianos, Alexandros ^{[3
]}

Maragos, Petros ^{[1
]}

Pitsikalis, Vassilis ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Sch ECE, GR-15773 Athens, Greece

[2] Univ Granada, Dept Teoria Senal Telemat & Comunicac TSTC, Granada, Spain

[3] Tech Univ Crete, Dept ECE, Khania 73100, Greece

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

Speech Recognition; Nonlinear Features; Parameter Equalization; Noise Suppression; Noise Invariant Features;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a unified approach to speech enhancement, feature extraction and feature normalization for speech recognition in adverse recording conditions is presented. The proposed front-end system consists of several different, independent, processing modules. Each of the algorithms contained in these modules has been independently applied to the problem of speech recognition in noise, significantly improving the recognition rates. In this work, these algorithms are merged in a single front-end and their combined performance is demonstrated. Specifically, the proposed advanced front-end extracts noise-invariant features via the following modules: Wiener filtering, voice-activity detection, robust feature extraction (nonlinear modulation or fractal features), parameter equalization and frame-dropping. The advanced front-end is applied to extremely adverse environments where most feature extraction schemes fail. We show that by combining speech enhancement, robust feature extraction and feature normalization up to a fivefold error rate reduction can be achieved for certain tasks.

引用

页码：2221 / +

页数：2

共 50 条

[1] A robust front-end for telephone speech recognition
Cho, HY
Chi, SM
Oh, YH
[J]. PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
[2] A comparison of front-end configurations for robust speech recognition
Milner, B
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 797 - 800
[3] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
Narayanan, Arun
Wang, DeLiang
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
[4] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
Wang, Haikun
Ye, Zhongfu
Chen, Jingdong
[J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
[5] Performance improvement of a bitstream-based front-end for wireless speech recognition in adverse environments
Kim, HK
Cox, RV
Rose, RC
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08): : 591 - 604
[6] Robust Front-End Processing For Emotion Recognition In Noisy Speech
Pandharipande, Meghna
Chakraborty, Rupayan
Panda, Ashish
Kopparapu, Sunil Kumar
[J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 324 - 328
[7] Performance evaluation of front-end algorithms for robust speech recognition
Cheng, O
Abdulla, W
Salcic, Z
[J]. ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 711 - 714
[8] ROBUST FRONT-END PROCESSING FOR SPEECH RECOGNITION IN NOISY CONDITIONS
Das, Biswajit
Panda, Ashish
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5235 - 5239
[9] A Reassigned Front-End for Speech Recognition
Tryfou, Georgina
Omologo, Maurizio
[J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 553 - 557
[10] A Multichannel Noise Reduction Front-end based on psychoacoustics for robust speech recognition in highly noisy environments
Cifani, Simone
Principi, Emanuele
Rocchi, Cesare
Squartini, Stefano
Piazza, Francesco
[J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 173 - 176

← 1 2 3 4 5 →