A Reassigned Front-End for Speech Recognition

被引:0
|
作者
Tryfou, Georgina [1 ]
Omologo, Maurizio [1 ]
机构
[1] Fdn Bruno Kessler, Via Sommarive 18, Trento, Italy
关键词
TIME-FREQUENCY; REPRESENTATIONS; SCALE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper introduces the use of the TFRCC features, a time-frequency reassigned feature set, as a front-end for speech recognition. Compared to the power spectrogram, the time-frequency reassigned version is particularly helpful in describing simultaneously the temporal and spectral features of speech signals, as it offers an improved visualization of the various components. This powerful attribute is exploited from the cepstral reassigned features, which are incorporated in a state-of-the-art speech recognizer. Experimental activities investigate the proposed features in various scenarios, starting from recognition of close-talk signals and gradually increasing the complexity of the task. The results prove the superiority of these features compared to a MFCC baseline.
引用
收藏
页码:553 / 557
页数:5
相关论文
共 50 条
  • [31] Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
    Ray, Swayambhu Nath
    Wu, Minhua
    Raju, Anirudh
    Ghahremani, Pegah
    Bilgi, Raghavendra
    Rao, Milind
    Arsikere, Harish
    Rastrow, Ariya
    Stolcke, Andreas
    Droppo, Jasha
    [J]. INTERSPEECH 2021, 2021, : 3455 - 3459
  • [32] A Unified Front-end Anti-interference Approach for Robust Automatic Speech Recognition
    Liang, Yunming
    Zhou, Yi
    Ma, Yongbao
    Liu, Hongqing
    [J]. 2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
  • [33] Developing the ETSI Aurora advanced distributed speech recognition front-end & What next?
    Pearce, D
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 131 - 134
  • [34] Comparing Front-End Enhancement Techniques and Multiconditioned Training for Robust Automatic Speech Recognition
    Soni, Meet H.
    Joshi, Sonal
    Panda, Ashish
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 329 - 340
  • [35] Robust connected digit recognition using speech enhancement and an auditory model front-end
    Flynn, Ronan
    Jones, Edward
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 410 - +
  • [36] Efficient Noise-Robust Speech Recognition Front-End Based on the ETSI Standard
    Neves, Claudio
    Veiga, Arlindo
    Sa, Luis
    Perdigao, Fernando
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 609 - 612
  • [37] Java']Java (J2ME) front-end for distributed speech recognition
    Zaykovskiy, Dmitry
    Schmitt, Alexander
    [J]. 21ST INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING AND APPLICATIONS WORKSHOPS/SYMPOSIA, VOL 2, PROCEEDINGS, 2007, : 353 - +
  • [38] ADAPTIVE DIFFERENTIAL MICROPHONE ARRAYS USED AS A FRONT-END FOR AN AUTOMATIC SPEECH RECOGNITION SYSTEM
    Messner, Elmar
    Pessentheiner, Hannes
    Morales-Cordovilla, Juan A.
    Hagmueller, Martin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2689 - 2693
  • [39] Data-Driven Design of Front-End Filter Bank for Lombard Speech Recognition
    Boril, Hynek
    Fousek, Petr
    Pollak, Petr
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 381 - 384
  • [40] A new approach to variable frame rate front-end processing for robust speech recognition
    Epps, J
    [J]. ISSPA 2005: The 8th International Symposium on Signal Processing and its Applications, Vols 1 and 2, Proceedings, 2005, : 723 - 726