Nonlinear dynamical system based acoustic modeling for ASR

被引:0
|
作者
Warakagoda, ND [1 ]
Johnsen, MH [1 ]
机构
[1] NTNU, Dept Telecommun, N-7034 Trondheim, Norway
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The work presented here is centered around a speech production model called Chained Dynamical System Model (CDSM) which is motivated by the fundamental limitations of the mainstream ASR approaches, The CDSM is essentially a smoothly time varying continuous state nonlinear dynamical system, consisting of two sub dynamical systems coupled as a chain so that one system controls the parameters of the next system. The speech recognition problem is posed as inverting the CDSM, for which we propose a solution based on the theory of Embedding. The resulting architecture, which we call Inverted CDSM (ICDSM) is evaluated in a set of experiments involving a speaker independent, continuous speech recognition task on the TIMIT database. Results of these experiments which can be compared with the corresponding results in the literature, confirm the feasibility and advantages of the approach.
引用
收藏
页码:525 / 528
页数:4
相关论文
共 50 条
  • [1] Agricultural Growth Modeling based on Nonlinear Dynamical System
    Sulaiman, A.
    Sadly, M.
    2012 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2012, : 331 - 334
  • [2] Acoustic cavitation. A typical, nonlinear dynamical system
    Lauterborn, W.
    Acustica, 1991, 75 (02):
  • [3] Investigation of knowledge transfer approaches to improve the acoustic modeling of Vietnamese ASR system
    Liu, Danyang
    Xu, Ji
    Zhang, Pengyuan
    Yan, Yonghong
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (05) : 1187 - 1195
  • [4] Investigation of Knowledge Transfer Approaches to Improve the Acoustic Modeling of Vietnamese ASR System
    Danyang Liu
    Ji Xu
    Pengyuan Zhang
    Yonghong Yan
    IEEE/CAAJournalofAutomaticaSinica, 2019, 6 (05) : 1187 - 1195
  • [5] DENSENET BLSTM FOR ACOUSTIC MODELING IN ROBUST ASR
    Strake, Maximilian
    Behr, Pascal
    Lohrenz, Timo
    Fingscheidt, Tim
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 6 - 12
  • [6] ACOUSTIC MODELING BASED ON EARLY-TO-LATE REVERBERATION RATIO FOR ROBUST ASR
    Matassoni, Marco
    Brutti, Alessio
    Svaizer, Piergiorgio
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 263 - 267
  • [7] A COMPREHENSIVE STUDY OF RESIDUAL CNNS FOR ACOUSTIC MODELING IN ASR
    Bozheniuk, Vitalii
    Zeyer, Albert
    Schlueter, Ralf
    Ney, Hermann
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7674 - 7678
  • [8] A STUDY ON MULTILINGUAL ACOUSTIC MODELING FOR LARGE VOCABULARY ASR
    Lin, Hui
    Deng, Li
    Yu, Dong
    Gong, Yi-fan
    Acero, Alex
    Lee, Chin-Hui
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4333 - +
  • [9] Comparison of Acoustic Modeling Techniques for Vietnamese and Khmer ASR
    Le, Viet Bac
    Besacier, Laurent
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 129 - 132
  • [10] Investigation on the combination of batch normalization and dropout in BLSTM-based acoustic modeling for ASR
    Li, Wenjie
    Cheng, Gaofeng
    Ge, Fengpei
    Zhang, Pengyuan
    Yan, Yonghong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2888 - 2892