HUMAN-LIKE DYNAMIC-PROGRAMMING NEURAL NETWORKS FOR DYNAMIC TIME WARPING SPEECH RECOGNITION

被引:2
|
作者
CHIU, CC
SHANBLATT, MA
机构
[1] SONY ELECTR INC, DIV FACTORY AUTOMAT, RES & DEV LABS, ORANGEBURG, NY 10962 USA
[2] MICHIGAN STATE UNIV, DEPT ELECT ENGN, E LANSING, MI 48824 USA
关键词
D O I
10.1142/S012906579500007X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a human-like dynamic programming neural network method for speech recognition using dynamic time warping. The networks are configured, much like human's, such that the minimum states of the network's energy function represent the near-best correlation between test and reference patterns. The dynamics and properties of the neural networks are analytically explained. Simulations for classifying speaker-dependent isolated words, consisting of 0 to 9 and A to Z, show that the method is better than conventional methods. The hardware implementation of this method is also presented.
引用
收藏
页码:79 / 89
页数:11
相关论文
共 50 条
  • [1] A DYNAMIC-PROGRAMMING PROCESSOR FOR SPEECH RECOGNITION
    QUENOT, GM
    GAUVAIN, JL
    GANGOLF, JJ
    MARIANI, JJ
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1989, 24 (02) : 349 - 357
  • [2] DYNAMIC-PROGRAMMING AND STATISTICAL MODELING IN AUTOMATIC SPEECH RECOGNITION
    RUSSELL, MJ
    MOORE, RK
    TOMLINSON, MJ
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1986, 37 (01) : 21 - 30
  • [3] Speech Recognition Using Dynamic Time Warping
    Amin, Talal Bin
    Mahmood, Iftekhar
    ICAST 2008: PROCEEDINGS OF 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN SPACE TECHNOLOGIES, 2008, : 72 - 77
  • [4] Speech recognition using dynamic programming of Bayesian neural networks
    Huang, CC
    Wang, JF
    Wu, CH
    Lee, JY
    CENTRAL AUDITORY PROCESSING AND NEURAL MODELING, 1998, : 71 - 76
  • [5] An HMM-Like Dynamic Time Warping Scheme for Automatic Speech Recognition
    Ding, Ing-Jr
    Hsu, Yen-Ming
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [7] On using dynamic programming for time warping in pattern recognition
    Mizutani, Eiji
    Dreyfus, Stuart
    INFORMATION SCIENCES, 2021, 580 : 684 - 704
  • [8] Speech recognition using Dynamic Time Warping (DTW)
    Permanasari, Yurika
    Harahap, Erwin H.
    Ali, Erwin Prayoga
    2ND INTERNATIONAL CONFERENCE ON APPLIED & INDUSTRIAL MATHEMATICS AND STATISTICS, 2019, 1366
  • [9] BROAD PHONETIC CLASSIFICATION AND SEGMENTATION OF CONTINUOUS SPEECH BY MEANS OF NEURAL NETWORKS AND DYNAMIC-PROGRAMMING
    MARTENS, JP
    DEPUYDT, L
    SPEECH COMMUNICATION, 1991, 10 (01) : 81 - 90
  • [10] Successes and critical failures of neural networks in capturing human-like speech recognition
    Adolfi, Federico
    Bowers, Jeffrey S.
    Poeppel, David
    NEURAL NETWORKS, 2023, 162 : 199 - 211