A hybrid genetic-neural front-end extension for robust speech recognition over telephone lines

被引：0

作者：

Selouani, Sid-Ahmed ^{[1
]}

Hamam, Habib ^{[2
]}

O'Shaughnessy, Douglas ^{[3
]}

机构：

[1] Univ Moncton, Campus Shippagan, Moncton, NB E8S1P6, Canada

[2] Univ Moncton, Moncton, NB E1A 3E9, Canada

[3] INRS Energie Materiaux telecommun, Montreal, PQ, Canada

来源：

ADVANCES IN NONLINEAR SPEECH PROCESSING | 2007年 / 4885卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a hybrid technique combining the Karhonen-Loeve Transform (KLT), the Multilayer Perceptron (MLP) and Genetic Algorithms (GAs) to obtain less-variant Mel-frequency parameters. The advantages of such an approach are that the robustness can be reached without modifying the recognition system, and that neither assumption nor estimation of the noise are required. To evaluate the effectiveness of the proposed approach, an extensive set of continuous speech recognition experiments are carried out by using the NTIMIT telephone speech database. The results show that the proposed approach outperforms the baseline and conventional systems.

引用

页码：169 / +

页数：3

共 50 条

[21] Recognition of Huffman Codewords with a Genetic-Neural Hybrid System
Ezin, Eugene C.
Reyes-Galaviz, Orion Fausto
Reyes-Garcia, Carlos A.
[J]. ADVANCES IN SOFT COMPUTING - MICAI 2010, PT II, 2010, 6438 : 280 - 289
[22] A Unified Front-end Anti-interference Approach for Robust Automatic Speech Recognition
Liang, Yunming
Zhou, Yi
Ma, Yongbao
Liu, Hongqing
[J]. 2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
[23] Efficient Noise-Robust Speech Recognition Front-End Based on the ETSI Standard
Neves, Claudio
Veiga, Arlindo
Sa, Luis
Perdigao, Fernando
[J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 609 - 612
[24] Comparing Front-End Enhancement Techniques and Multiconditioned Training for Robust Automatic Speech Recognition
Soni, Meet H.
Joshi, Sonal
Panda, Ashish
[J]. TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 329 - 340
[25] Robust connected digit recognition using speech enhancement and an auditory model front-end
Flynn, Ronan
Jones, Edward
[J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 410 - +
[26] A new approach to variable frame rate front-end processing for robust speech recognition
Epps, J
[J]. ISSPA 2005: The 8th International Symposium on Signal Processing and its Applications, Vols 1 and 2, Proceedings, 2005, : 723 - 726
[27] Automatic Speech Recognition with a Cochlear Implant Front-End
Nogueira, Waldo
Harczos, Tamas
Edler, Bernd
Ostermann, Joern
Buechner, Andreas
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1993 - +
[28] A Front-End Technique for Automatic Noisy Speech Recognition
Naing, Hay Mar Soe
Hidayat, Risanuri
Hartanto, Rudy
Miyanaga, Yoshikazu
[J]. PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 49 - 54
[29] NOISE ADAPTIVE FRONT-END NORMALIZATION BASED ON VECTOR TAYLOR SERIES FOR DEEP NEURAL NETWORKS IN ROBUST SPEECH RECOGNITION
Bo Li
Chai, Khe Sim
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7408 - 7412
[30] A noise robust front-end for speech recognition using hough transform and cumulative distribution mapping
Choi, Eric H. C.
[J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 286 - +

← 1 2 3 4 5 →