A Reassigned Front-End for Speech Recognition

被引：0

作者：

Tryfou, Georgina ^{[1
]}

Omologo, Maurizio ^{[1
]}

机构：

[1] Fdn Bruno Kessler, Via Sommarive 18, Trento, Italy

来源：

2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2017年

关键词：

TIME-FREQUENCY; REPRESENTATIONS; SCALE;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper introduces the use of the TFRCC features, a time-frequency reassigned feature set, as a front-end for speech recognition. Compared to the power spectrogram, the time-frequency reassigned version is particularly helpful in describing simultaneously the temporal and spectral features of speech signals, as it offers an improved visualization of the various components. This powerful attribute is exploited from the cepstral reassigned features, which are incorporated in a state-of-the-art speech recognizer. Experimental activities investigate the proposed features in various scenarios, starting from recognition of close-talk signals and gradually increasing the complexity of the task. The results prove the superiority of these features compared to a MFCC baseline.

引用

页码：553 / 557

页数：5

共 50 条

[1] A robust front-end for telephone speech recognition
Cho, HY
Chi, SM
Oh, YH
[J]. PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
[2] An efficient front-end for automatic speech recognition
Ahadi, SM
Sheikhzadeh, H
Brennan, RL
Freeman, GH
[J]. ICECS 2003: PROCEEDINGS OF THE 2003 10TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-3, 2003, : 128 - 131
[3] A biological front-end processing for speech recognition
Ferrandez, JM
del Valle, D
Rodellar, V
Gomez, P
[J]. BIOLOGICAL AND ARTIFICIAL COMPUTATION: FROM NEUROSCIENCE TO TECHNOLOGY, 1997, 1240 : 1058 - 1067
[4] A comparison of front-end configurations for robust speech recognition
Milner, B
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 797 - 800
[5] Automatic Speech Recognition with a Cochlear Implant Front-End
Nogueira, Waldo
Harczos, Tamas
Edler, Bernd
Ostermann, Joern
Buechner, Andreas
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1993 - +
[6] A Front-End Technique for Automatic Noisy Speech Recognition
Naing, Hay Mar Soe
Hidayat, Risanuri
Hartanto, Rudy
Miyanaga, Yoshikazu
[J]. PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 49 - 54
[7] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
Wang, Haikun
Ye, Zhongfu
Chen, Jingdong
[J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
[8] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
Narayanan, Arun
Wang, DeLiang
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
[9] Robust Front-End Processing For Emotion Recognition In Noisy Speech
Pandharipande, Meghna
Chakraborty, Rupayan
Panda, Ashish
Kopparapu, Sunil Kumar
[J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 324 - 328
[10] Performance evaluation of front-end algorithms for robust speech recognition
Cheng, O
Abdulla, W
Salcic, Z
[J]. ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 711 - 714

← 1 2 3 4 5 →