Restructuring Output Layers of Deep Neural Networks using Minimum Risk Parameter Clustering

被引:0
|
作者
Kubo, Yotaro [1 ]
Suzuki, Jun [1 ]
Hori, Takaaki [1 ]
Nakamura, Atsushi [1 ]
机构
[1] NTT Commun Sci Labs, Kyoto, Japan
关键词
Automatic speech recognition; context clustering; discrete constraint optimization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper attempts to optimize a topology of hidden Markov models (HMMs) for automatic speech recognition. Current state-of-the-art acoustic models for ASR involve HMMs with deep neural network (DNN)-based emission density functions. Even though DNN parameters are typically trained by optimizing a discriminative criterion, topology optimization of HMMs is usually performed by optimizing a generative criterion. Several approaches have been studied to achieve a discriminative state clustering, these approaches typically assume underlying Gaussian distributions of the acoustic features, and do not compatible with DNN-based emission density functions. In this paper, we attempt to derive a discriminative restructuring method of an HMM topology by introducing discriminative optimization with discrete constraints on the parameters, which force the parameters to be tied with the parameters of the other states. By applying this constrained optimization to the clustering of parameters of DNN-based acoustic models, we derived a discriminative HMM restructuring method that maintains discriminative performance of the original HMMs with the large number of states.
引用
收藏
页码:1068 / 1072
页数:5
相关论文
共 50 条
  • [1] Deep Neural Networks with Cascaded Output Layers
    [J]. Bai, Jie (baijie@tongji.edu.cn), 1600, Science Press (45):
  • [2] SPEAKER ADAPTATION OF DEEP NEURAL NETWORKS USING A HIERARCHY OF OUTPUT LAYERS
    Price, Ryan
    Iso, Ken-ichi
    Shinoda, Koichi
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 153 - 158
  • [3] Credit Risk Assessment Algorithm using Deep Neural Networks with Clustering and Merging
    Li, Ying
    Lin, Xianghong
    Wang, Xiangwen
    Shen, Fanqi
    Gong, Zuzheng
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 173 - 176
  • [4] Minimum Euclidean distance evaluation using deep neural networks
    Lee, Jong-Ho
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2019, 112
  • [5] Non-Parametric Clustering Using Deep Neural Networks
    Avgerinos, Christos
    Solachidis, Vassilios
    Vretos, Nicholas
    Daras, Petros
    [J]. IEEE ACCESS, 2020, 8 : 153630 - 153640
  • [6] Controlling the performance of deep neural networks using a single parameter
    Luo, Yi
    Li, Sheng
    Zhang, Xinpeng
    Feng, Guorui
    [J]. EVOLUTIONARY INTELLIGENCE, 2024, 17 (5-6) : 4303 - 4313
  • [7] A Study on Layers of Deep Neural Networks
    Lim, Hyun-il
    [J]. 2020 THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS'2020), 2020, : 31 - 34
  • [8] Deep Residual Output Layers for Neural Language Generation
    Pappas, Nikolaos
    Henderson, James
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [9] Behavior Prediction Scheme Using Hierarchical Clustering and Deep Neural Networks
    Altameem, Arwa A.
    Hafez, Alaaeldin M.
    [J]. JOURNAL OF NANOELECTRONICS AND OPTOELECTRONICS, 2022, 17 (05) : 861 - 872
  • [10] Randomnet: clustering time series using untrained deep neural networks
    Li, Xiaosheng
    Xi, Wenjie
    Lin, Jessica
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2024,