A Reassigned Front-End for Speech Recognition

被引：0

作者：

Tryfou, Georgina ^{[1
]}

Omologo, Maurizio ^{[1
]}

机构：

[1] Fdn Bruno Kessler, Via Sommarive 18, Trento, Italy

来源：

2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2017年

关键词：

TIME-FREQUENCY; REPRESENTATIONS; SCALE;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper introduces the use of the TFRCC features, a time-frequency reassigned feature set, as a front-end for speech recognition. Compared to the power spectrogram, the time-frequency reassigned version is particularly helpful in describing simultaneously the temporal and spectral features of speech signals, as it offers an improved visualization of the various components. This powerful attribute is exploited from the cepstral reassigned features, which are incorporated in a state-of-the-art speech recognizer. Experimental activities investigate the proposed features in various scenarios, starting from recognition of close-talk signals and gradually increasing the complexity of the task. The results prove the superiority of these features compared to a MFCC baseline.

引用

页码：553 / 557

页数：5

共 50 条

[21] Advanced Front-end for Robust Speech Recognition in Extremely Adverse Environments
Dimitriadis, Dimitrios
Segura, Jose C.
Garcia, Luz
Potamianos, Alexandros
Maragos, Petros
Pitsikalis, Vassilis
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2221 - +
[22] Combined Software/hardware implementation of a filterbank front-end for speech recognition
Mouchtaris, A
Cao, Y
Khan, S
Van der Spiegel, J
[J]. 2005 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS - DESIGN AND IMPLEMENTATION (SIPS), 2005, : 436 - 441
[23] Robust Front-End based on MVA processing for Arabic Speech Recognition
Techini, Elhem
Sakka, Zied
Bouhlel, MedSalim
[J]. 2017 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2017,
[24] Front-End Feature Compensation for Noise Robust Speech Emotion Recognition
Pandharipande, Meghna
Chakraborty, Rupayan
Panda, Ashish
Das, Biswajit
Kopparapu, Sunil Kumar
[J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[25] Implementation of The MFCC Front-end for Low-cost Speech Recognition Systems
Vu, Ngoc-Vinh
Whittington, Jim
Ye, Hua
Devlin, John
[J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 2334 - 2337
[26] A noise-robust front-end for distributed speech recognition in mobile communications
Addou, Djamel
Selouani, Sid-Ahmed
Kifaya, Kaoukeb
Boudraa, Malika
Boudraa, Bachir
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (04) : 167 - 173
[27] Feature enhancement for a bitstream-based front-end in wireless speech recognition
Kim, HK
Cox, RV
[J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 241 - 244
[28] Investigation into a Mel subspace based front-end processing for robust speech recognition
Selouani, SA
O'Shaughnessy, D
[J]. Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 187 - 190
[29] Front-end Feature Compensation and Denoising for Noise Robust Speech Emotion Recognition
Chakraborty, Rupayan
Panda, Ashish
Pandharipande, Meghna
Joshi, Sonal
Kopparapu, Sunil Kumar
[J]. INTERSPEECH 2019, 2019, : 3257 - 3261
[30] MULTICHANNEL AUDIO FRONT-END FOR FAR-FIELD AUTOMATIC SPEECH RECOGNITION
Chhetri, Amit
Hilmes, Philip
Kristjansson, Trausti
Chu, Wai
Mansour, Mohamed
Li, Xiaoxue
Zhang, Xianxian
[J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1527 - 1531

← 1 2 3 4 5 →