A hybrid genetic-neural front-end extension for robust speech recognition over telephone lines

被引：0

作者：

Selouani, Sid-Ahmed ^{[1
]}

Hamam, Habib ^{[2
]}

O'Shaughnessy, Douglas ^{[3
]}

机构：

[1] Univ Moncton, Campus Shippagan, Moncton, NB E8S1P6, Canada

[2] Univ Moncton, Moncton, NB E1A 3E9, Canada

[3] INRS Energie Materiaux telecommun, Montreal, PQ, Canada

来源：

ADVANCES IN NONLINEAR SPEECH PROCESSING | 2007年 / 4885卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a hybrid technique combining the Karhonen-Loeve Transform (KLT), the Multilayer Perceptron (MLP) and Genetic Algorithms (GAs) to obtain less-variant Mel-frequency parameters. The advantages of such an approach are that the robustness can be reached without modifying the recognition system, and that neither assumption nor estimation of the noise are required. To evaluate the effectiveness of the proposed approach, an extensive set of continuous speech recognition experiments are carried out by using the NTIMIT telephone speech database. The results show that the proposed approach outperforms the baseline and conventional systems.

引用

页码：169 / +

页数：3

共 50 条

[1] A robust front-end for telephone speech recognition
Cho, HY
Chi, SM
Oh, YH
[J]. PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
[2] A comparison of front-end configurations for robust speech recognition
Milner, B
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 797 - 800
[3] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
Narayanan, Arun
Wang, DeLiang
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
[4] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
Wang, Haikun
Ye, Zhongfu
Chen, Jingdong
[J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
[5] Robust Front-End Processing For Emotion Recognition In Noisy Speech
Pandharipande, Meghna
Chakraborty, Rupayan
Panda, Ashish
Kopparapu, Sunil Kumar
[J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 324 - 328
[6] Performance evaluation of front-end algorithms for robust speech recognition
Cheng, O
Abdulla, W
Salcic, Z
[J]. ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 711 - 714
[7] ROBUST FRONT-END PROCESSING FOR SPEECH RECOGNITION IN NOISY CONDITIONS
Das, Biswajit
Panda, Ashish
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5235 - 5239
[8] JOINT TRAINING OF FRONT-END AND BACK-END DEEP NEURAL NETWORKS FOR ROBUST SPEECH RECOGNITION
Gao, Tian
Du, Jun
Dai, Li-Rong
Lee, Chin-Hui
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4375 - 4379
[9] A Reassigned Front-End for Speech Recognition
Tryfou, Georgina
Omologo, Maurizio
[J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 553 - 557
[10] Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End
Tan, Qun Feng
Georgiou, Panayiotis G.
Narayanan, Shrikanth
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2418 - 2429

← 1 2 3 4 5 →