CONTEXT-DEPENDENT MODELLING OF DEEP NEURAL NETWORK USING LOGISTIC REGRESSION

被引:0
|
作者
Wang, Guangsen [1 ]
Sim, Khe Chai [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Sch Comp, Singapore 117548, Singapore
关键词
Context-Dependent Modelling; Deep Neural Network; Logistic Regression; Canonical State Modelling; Articulatory Features; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data sparsity problem of context-dependent acoustic modelling in automatic speech recognition is addressed by using the decision tree state clusters as the training targets in the standard context-dependent (CD) deep neural network (DNN) systems. As a result, the CD states within a cluster cannot be distinguished during decoding. This problem, referred to as the clustering problem, is not explicitly addressed in the current literature. In this paper, we formulate the CD DNN as an instance of the canonical state modelling technique based on a set of broad phone classes to address both the data sparsity and the clustering problems. The triphone is clustered into multiple sets of shorter biphones using broad phone contexts to address the data sparsity issue. A DNN is trained to discriminate the biphones within each set. The canonical states are represented by the concatenated log posteriors of all the broad phone DNNs. Logistic regression is used to transform the canonical states into the triphone state output probability. Clustering of the regression parameters is used to reduce model complexity while still achieving unique acoustic scores for all possible triphones. The experimental results on a broadcast news transcription task reveal that the proposed regression-based CD DNN significantly outperforms the standard CD DNN. The best system provides a 2.7% absolute WER reduction compared to the best standard CD DNN system.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 50 条
  • [31] On Context-Dependent Neural Networks and Speaker Adaptation
    Zelinka, Jan
    Trmal, Jan
    Mueller, Ludek
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 515 - 518
  • [32] A Neural Model for Context-dependent Sequence Learning
    Luc Berthouze
    Adriaan Tijsseling
    Neural Processing Letters, 2006, 23 : 27 - 45
  • [33] The Computational and Neural Bases of Context-Dependent Learning
    Heald, James B.
    Wolpert, Daniel M.
    Lengyel, Mate
    ANNUAL REVIEW OF NEUROSCIENCE, 2023, 46 : 233 - 258
  • [34] A neural model for context-dependent sequence learning
    Berthouze, L
    Tijsseling, A
    NEURAL PROCESSING LETTERS, 2006, 23 (01) : 27 - 45
  • [35] Neural Evolution of Context-Dependent Fly Song
    Ding, Yun
    Lilivis, Joshua L.
    Cande, Jessica
    Berman, Gordon J.
    Arthur, Benjamin J.
    Long, Xi
    Xu, Min
    Dickson, Barry J.
    Stern, David L.
    CURRENT BIOLOGY, 2019, 29 (07) : 1089 - +
  • [36] Context-dependent neural nets - Structures and learning
    Ciskowski, P
    Rafajlowicz, E
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (06): : 1367 - 1377
  • [37] Context-Dependent Neural Modulations in the Perception of Duration
    Murai, Yuki
    Yotsumoto, Yuko
    FRONTIERS IN INTEGRATIVE NEUROSCIENCE, 2016, 10
  • [38] REGULARIZATION OF CONTEXT-DEPENDENT DEEP NEURAL NETWORKS WITH CONTEXT-INDEPENDENT MULTI-TASK TRAINING
    Bell, Peter
    Renals, Steve
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4290 - 4294
  • [39] CONTEXT-DEPENDENT DEEP NEURAL NETWORKS FOR AUDIO INDEXING OF REAL-LIFE DATA
    Li, Gang
    Zhu, Huifeng
    Cheng, Gong
    Thambiratnam, Kit
    Chitsaz, Behrooz
    Yu, Dong
    Seide, Frank
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 143 - 148
  • [40] Context-dependent route generation scheme using Network Voronoi Diagrams
    Kambara, Tomoya
    Kibe, Hiroaki
    Nishide, Ryo
    Ohnishi, Masaaki
    Ueshima, Shinichi
    2007 IEEE INTERNATIONAL WORKSHOP ON DATABASES FOR NEXT GENERATION RESEARCHERS, 2007, : 111 - +