MLP Based Hierarchical System for Task Adaptation in ASR

被引:4
|
作者
Pinto, Joel [1 ]
Magimai-Doss, Mathew [1 ]
Bourlard, Herve [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
关键词
D O I
10.1109/ASRU.2009.5373383
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate a multilayer perceptron (MLP) based hierarchical approach for task adaptation in automatic speech recognition. The system consists of two MLP classifiers in tandem. A well-trained MLP available off-the-shelf is used at the first stage of the hierarchy. A second MLP is trained on the posterior features estimated by the first, but with a long temporal context of around 130 ms. By using an MLP trained on 232 hours of conversational telephone speech, the hierarchical adaptation approach yields a word error rate of 1.8% on the 600-word Phonebook isolated word recognition task. This compares favorably to the error rate of 4% obtained by the conventional single MLP based system trained with the same amount of Phonebook data that is used for adaptation. The proposed adaptation scheme also benefits from the ability of the second MLP to model the temporal information in the posterior features.
引用
收藏
页码:365 / 370
页数:6
相关论文
共 50 条
  • [1] ENHANCE RNNLMS WITH HIERARCHICAL MULTI-TASK LEARNING FOR ASR
    Song, Minguang
    Zhao, Yunxin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6102 - 6106
  • [2] Generalized hierarchical search in the ISIP ASR system
    Jelinek, B
    Zheng, F
    Parihar, N
    Hamaker, J
    Picone, J
    CONFERENCE RECORD OF THE THIRTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2001, : 1553 - 1556
  • [3] An efficient robust ASR system based on the combination of speech enhancement and HMM adaptation
    Ding, Pei
    Cao, Zhigang
    2002, Chinese Institute of Electronics (11):
  • [4] Implementing PCA-based speaker adaptation methods in a Persian ASR system
    Ansari Z.
    Almasganj F.
    2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 769 - 774
  • [5] An efficient robust ASR system based on the combination of speech enhancement and HMM adaptation
    Ding, P
    Cao, ZG
    CHINESE JOURNAL OF ELECTRONICS, 2002, 11 (03): : 422 - 425
  • [6] MULTILINGUAL ADAPTATION OF RNN BASED ASR SYSTEMS
    Mueller, Markus
    Stueker, Sebastian
    Waibel, Alex
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5219 - 5223
  • [7] Reactive Task Adaptation Based on Hierarchical Constraints Classification for Safe Industrial Robots
    Ceriani, Nicola Maria
    Zanchettin, Andrea Maria
    Rocco, Paolo
    Stolt, Andreas
    Robertsson, Anders
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2015, 20 (06) : 2935 - 2949
  • [8] Reverse Correlation for Analyzing MLP Posterior Features in ASR
    Pinto, Joel
    Sivaram, Garimella S. V. S.
    Hermansky, Hynek
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 469 - 476
  • [9] Severity Based Adaptation for ASR to Aid Dysarthric Speakers
    Al-Qatab, Bassam Ali
    Mustafa, Mumtaz Begum
    Salim, Siti Salwah
    ASIA MODELLING SYMPOSIUM 2014 (AMS 2014), 2014, : 165 - 169
  • [10] Combat task-system function mapping method based on hierarchical task network
    Yi K.
    Zhang J.
    Jiao Z.
    Wang Z.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (10): : 3183 - 3191