CONTEXT-DEPENDENT MODELLING OF DEEP NEURAL NETWORK USING LOGISTIC REGRESSION

被引:0
|
作者
Wang, Guangsen [1 ]
Sim, Khe Chai [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Sch Comp, Singapore 117548, Singapore
关键词
Context-Dependent Modelling; Deep Neural Network; Logistic Regression; Canonical State Modelling; Articulatory Features; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data sparsity problem of context-dependent acoustic modelling in automatic speech recognition is addressed by using the decision tree state clusters as the training targets in the standard context-dependent (CD) deep neural network (DNN) systems. As a result, the CD states within a cluster cannot be distinguished during decoding. This problem, referred to as the clustering problem, is not explicitly addressed in the current literature. In this paper, we formulate the CD DNN as an instance of the canonical state modelling technique based on a set of broad phone classes to address both the data sparsity and the clustering problems. The triphone is clustered into multiple sets of shorter biphones using broad phone contexts to address the data sparsity issue. A DNN is trained to discriminate the biphones within each set. The canonical states are represented by the concatenated log posteriors of all the broad phone DNNs. Logistic regression is used to transform the canonical states into the triphone state output probability. Clustering of the regression parameters is used to reduce model complexity while still achieving unique acoustic scores for all possible triphones. The experimental results on a broadcast news transcription task reveal that the proposed regression-based CD DNN significantly outperforms the standard CD DNN. The best system provides a 2.7% absolute WER reduction compared to the best standard CD DNN system.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 50 条
  • [41] Prediction of mortality of premature neonates using neural network and logistic regression
    Rezaeian, Aramesh
    Rezaeian, Marzieh
    Khatami, Seyede Fatemeh
    Khorashadizadeh, Fatemeh
    Moghaddam, Farshid Pouralizadeh
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (03) : 1269 - 1277
  • [42] Prediction of mortality of premature neonates using neural network and logistic regression
    Aramesh Rezaeian
    Marzieh Rezaeian
    Seyede Fatemeh Khatami
    Fatemeh Khorashadizadeh
    Farshid Pouralizadeh Moghaddam
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 1269 - 1277
  • [43] Context-Dependent Help for the DynaLearn Modelling and Simulation Workbench
    Beek, Wouter
    Bredeweg, Bert
    Latour, Sander
    ARTIFICIAL INTELLIGENCE IN EDUCATION, 2011, 6738 : 420 - 422
  • [44] Sentiment strength detection with a context-dependent lexicon-based convolutional neural network
    Huang, Minghui
    Xie, Haoran
    Rao, Yanghui
    Feng, Jingrong
    Wang, Fu Lee
    INFORMATION SCIENCES, 2020, 520 : 389 - 399
  • [45] Modelling the Scene Dependent Imaging in Cameras with a Deep Neural Network
    Nam, Seonghyeon
    Kim, Seon Joo
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1726 - 1734
  • [46] Global context-dependent recurrent neural network language model with sparse feature learning
    Deng, Hongli
    Zhang, Lei
    Wang, Lituan
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (Suppl 2): : 999 - 1011
  • [47] VGG16-Based Diffractive Optical Neural Network and Context-Dependent Processing
    Zhao Xingya
    Yang Zhiwei
    Dai Jian
    Zhang Tian
    Xu Kun
    ACTA OPTICA SINICA, 2022, 42 (19)
  • [48] Low-Energy and Fast Spiking Neural Network For Context-Dependent Learning on FPGA
    Asgari, Hajar
    Maybodi, Babak Mazloom-Nezhad
    Payvand, Melika
    Azghadi, Mostafa Rahimi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (11) : 2697 - 2701
  • [49] Global context-dependent recurrent neural network language model with sparse feature learning
    Hongli Deng
    Lei Zhang
    Lituan Wang
    Neural Computing and Applications, 2019, 31 : 999 - 1011
  • [50] Seismic features and automatic discrimination of deep and shallow induced-microearthquakes using neural network and logistic regression
    Mousavi, S. Mostafa
    Horton, Stephen P.
    Langston, Charles A.
    Samei, Borhan
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2016, 207 (01) : 29 - 46