AN INVESTIGATION ON DNN-DERIVED BOTTLENECK FEATURES FOR GMM-HMM BASED ROBUST SPEECH RECOGNITION

被引:0
|
作者
You, Yongbin [1 ]
Qian, Yanmin [1 ]
He, Tianxing [1 ]
Yu, Kai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Key Lab Shanghai Educ Commiss Intelligent Interac, Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
deep neural network; bottleneck feature; node-pruning; robust speech recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, deep neural network(DNN) has achieved great success when used as acoustic model in speech recognition. An important application of DNN is to derive bottleneck feature. In this paper, firstly we investigate the robustness of bottleneck features generated by three types of DNN structures on the Aurora 4 task without any explicit noise compensation. Secondly, we propose the node-pruning method to reconstruct DNN which generates a new type of deep bottleneck feature. Our experiments show that bottleneck features generated from the node-pruned DNN achieve a promising reduction in word error rate(WER) when compared to the other bottleneck features produced by the conventional types of DNN structures. In addition, the new approach using the node-pruned DNN structure can automatically obtain the compact layer which generates the bottleneck feature.
引用
收藏
页码:30 / 34
页数:5
相关论文
共 50 条
  • [1] A Scalable Approach to Using DNN-Derived Features in GMM-HMM Based Acoustic Modeling For LVCSR
    Yan, Zhi-Jie
    Huo, Qiang
    Xu, Jian
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 104 - 108
  • [2] Phoneme and Word Based Model for Tamil Speech Recognition using GMM-HMM
    Karpagavalli, S.
    Chandra, E.
    [J]. ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [3] Comparison of acoustical models of GMM-HMM based for speech recognition in Hindi using PocketSphinx
    Manasa, Chadalavada Sai
    Priya, K. Jeeva
    Gupta, Deepa
    [J]. PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 534 - 539
  • [4] Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System
    Zimmermann, Marina
    Ghazi, Mostafa Mehdipour
    Ekenel, Hazim Kemal
    Thiran, Jean-Philippe
    [J]. COMPUTER VISION - ACCV 2016 WORKSHOPS, PT II, 2017, 10117 : 264 - 276
  • [5] Motion intent recognition of intelligent lower limb prosthesis based on GMM-HMM
    Sheng, Min
    Liu, Shuangqing
    Wang, Jie
    Su, Benyue
    [J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2019, 40 (05): : 169 - 178
  • [6] CSI-Based Human Continuous Activity Recognition Using GMM-HMM
    Cheng, Xiaoyan
    Huang, Binke
    [J]. IEEE SENSORS JOURNAL, 2022, 22 (19) : 18709 - 18717
  • [7] Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system
    Pujol, P
    Pol, S
    Nadeu, C
    Hagen, A
    Bourlard, H
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (01): : 14 - 22
  • [8] Integration of Articulatory Knowledge and Voicing Features Based on DNN/HMM for Mandarin Speech Recognition
    Tan, Ying-Wei
    Liu, Wen-Ju
    Jiang, Wei
    Zheng, Hao
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [9] Research on Voiceprint recognition method of buried drainage pipe based on MFCC and GMM-HMM
    Yang, Jiarui
    Feng, Zao
    Wu, Jiande
    Fan, Yugang
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 645 - 650
  • [10] Contaminated speech training methods for robust DNN-HMM distant speech recognition
    Ravanelli, Mirco
    Omologo, Maurizio
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 756 - 760