Performance analysis of ASR system in hybrid DNN-HMM framework using a PWL euclidean activation function

被引:3
|
作者
Dutta, Anirban [1 ]
Ashishkumar, Gudmalwar [1 ]
Rao, Ch V. Rama [1 ]
机构
[1] Natl Inst Technol Meghalaya, Dept Elect & Commun Engn, Shillong 793003, Meghalaya, India
关键词
deep learning; euclidean; piecewise linear; speech recognition; activation function; SPEECH RECOGNITION; NEURAL-NETWORKS; LANGUAGE;
D O I
10.1007/s11704-020-9419-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Speech Recognition (ASR) is the process of mapping an acoustic speech signal into a human readable text format. Traditional systems exploit the Acoustic Component of ASR using the Gaussian Mixture Model - Hidden Markov Model (GMM-HMM) approach. Deep Neural Network (DNN) opens up new possibilities to overcome the shortcomings of conventional statistical algorithms. Recent studies modeled the acoustic component of ASR system using DNN in the so called hybrid DNN-HMM approach. In the context of activation functions used to model the non-linearity in DNN, Rectified Linear Units (ReLU) and maxout units are mostly used in ASR systems. This paper concentrates on the acoustic component of a hybrid DNN-HMM system by proposing an efficient activation function for the DNN network. Inspired by previous works, euclidean norm activation function is proposed to model the non-linearity of the DNN network. Such non-linearity is shown to belong to the family of Piecewise Linear (PWL) functions having distinct features. These functions can capture deep hierarchical features of the pattern. The relevance of the proposal is examined in depth both theoretically and experimentally. The performance of the developed ASR system is evaluated in terms of Phone Error Rate (PER) using TIMIT database. Experimental results achieve a relative increase in performance by using the proposed function over conventional activation functions.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Performance analysis of ASR system in hybrid DNN-HMM framework using a PWL euclidean activation function
    Anirban DUTTA
    Gudmalwar ASHISHKUMAR
    Ch V Rama RAO
    [J]. Frontiers of Computer Science, 2021, (04) : 196 - 206
  • [2] Performance analysis of ASR system in hybrid DNN-HMM framework using a PWL euclidean activation function
    Anirban Dutta
    Gudmalwar Ashishkumar
    Ch V. Rama Rao
    [J]. Frontiers of Computer Science, 2021, 15
  • [3] On quantifying the quality of acoustic models in hybrid DNN-HMM ASR
    Dighe, Pranay
    Asaei, Afsaneh
    Bourlard, Herve
    [J]. SPEECH COMMUNICATION, 2020, 119 : 24 - 35
  • [4] Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring
    Li, Qiujia
    Zhang, Chao
    Woodland, Philip C.
    [J]. SPEECH COMMUNICATION, 2023, 147 : 12 - 21
  • [5] English to Japanese Spoken Lecture Translation System by Using DNN-HMM and Phrase-based SMT
    Goto, Norioki
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS ICAICTA, 2015,
  • [6] A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge
    Tanaka, Tomohiro
    Masumura, Ryo
    Moriya, Takafumi
    Oba, Takanobu
    Aono, Yushi
    [J]. INTERSPEECH 2019, 2019, : 2210 - 2214
  • [7] Lip-reading via a DNN-HMM Hybrid System Using Combination of The Image-based and Model-based Features
    Rahmani, Mohammad Hasan
    Almasganj, Farshad
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (IPRIA), 2017, : 195 - 199
  • [8] Performance analysis of a DNN classifier for power system events using an interpretability method
    Santos, Orlem L. D.
    Dotta, Daniel
    Wang, Meng
    Chow, Joe H.
    Decker, Ildemar C.
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 136
  • [9] A Framework for Detecting System Performance Anomalies Using Tracing Data Analysis
    Kohyarnejadfard, Iman
    Aloise, Daniel
    Dagenais, Michel R.
    Shakeri, Mahsa
    [J]. ENTROPY, 2021, 23 (08)
  • [10] A novel hybrid framework for Cloud Intrusion Detection System using system call sequence analysis
    Chaudhari, Ashish
    Gohil, Bhavesh
    Rao, Udai Pratap
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3753 - 3769