Improved Bottleneck Features Using Pretrained Deep Neural Networks

被引:0
|
作者
Yu, Dong
Seltzer, Michael L.
机构
关键词
bottleneck features; pretraining; deep neural network; deep belief network; NECK FEATURES; LVCSR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bottleneck features have been shown to be effective, in improving the accuracy of automatic speech recognition (ASR) systems. Conventionally, bottleneck features are extracted from a multi-layer perceptron (MLP) trained to predict context-independent monophone states. The MLP typically has three hidden layers and is trained using the backpropagation algorithm. In this paper, we propose two improvements to the training of bottleneck features motivated by recent advances in the use of deep neural networks (DNNs) for speech recognition. First, we show how the use of unsupervised pretraining of a DNN enhances the network's discriminative power and improves the bottleneck features it generates. Second, we show that a neural network trained to predict context-dependent senone targets produces better bottleneck features than one trained to predict monophone states. Bottleneck features trained using the proposed methods produced a 16% relative reduction in sentence error rate over conventional bottleneck features on a large vocabulary business search task.
引用
收藏
页码:244 / 247
页数:4
相关论文
共 50 条
  • [31] EXTRACTING DEEP NEURAL NETWORK BOTTLENECK FEATURES USING LOW-RANK MATRIX FACTORIZATION
    Zhang, Yu
    Chuangsuwanich, Ekapol
    Glass, James
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [32] Speech Enhancement Based on Improved Deep Neural Networks with MMSE Pretreatment Features
    Han, Wei
    Wu, Congming
    Zhang, Xiongwei
    Sun, Meng
    Min, Gang
    [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1140 - 1145
  • [33] Improved Glioma Grading Using Deep Convolutional Neural Networks
    Gutta, S.
    Acharya, J.
    Shiroishi, M. S.
    Hwang, D.
    Nayak, K. S.
    [J]. AMERICAN JOURNAL OF NEURORADIOLOGY, 2021, 42 (02) : 233 - 239
  • [34] DEEP NEURAL NETWORK DERIVED BOTTLENECK FEATURES FOR ACCURATE AUDIO CLASSIFICATION
    Zhang, Bihong
    Xie, Lei
    Yuan, Yougen
    Ming, Huaiping
    Huang, Dongyan
    Song, Mingli
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
  • [35] IMPROVED LANGUAGE IDENTIFICATION USING DEEP BOTTLENECK NETWORK
    Song, Yan
    Cui, Ruilian
    Hong, Xinhai
    Mcloughlin, Ian
    Shi, Jiong
    Dai, Lirong
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4200 - 4204
  • [36] Brain Tumor Classification Using Pretrained Convolutional Neural Networks
    Daniel, Mihalas Constantin
    Ruxandra, Lascu Mihaela
    [J]. 2021 16TH INTERNATIONAL CONFERENCE ON ENGINEERING OF MODERN ELECTRIC SYSTEMS (EMES), 2021, : 130 - 133
  • [37] Automated detection of leukemia by pretrained deep neural networks and transfer learning: A comparison
    Anilkumar, K. K.
    Manoj, V. J.
    Sagi, T. M.
    [J]. MEDICAL ENGINEERING & PHYSICS, 2021, 98 : 8 - 19
  • [38] Improved bottleneck feature using hierarchical deep belief networks for keyword spotting in continues speech
    [J]. 1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (06):
  • [39] An Image Quality Evaluation and Masking Algorithm Based On Pretrained Deep Neural Networks
    Jia, Peng
    Song, Yu
    Lv, Jiameng
    Ning, Runyu
    [J]. ASTRONOMICAL JOURNAL, 2024, 168 (01):
  • [40] Nonlinear Synthesis of Expression Variation Dynamics on Video Using Deep Dynamic Bottleneck Neural Networks
    Moghadam, Saeed Montazeri
    Seyyedsalehi, Seyyed Ali
    Amini, Nima
    [J]. 2017 24TH NATIONAL AND 2ND INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING (ICBME), 2017, : 178 - 183