Effect of Articulatory Δ and ΔΔ Parameters on Multilayer Neural Network based Speech Recognition

被引:0
|
作者
Banik, Manoj [1 ]
Kotwal, Mohammed Rokibul Alam [2 ]
Hassan, Foyzul [3 ]
Islam, Gazi Md. Moshfiqul [2 ]
Rahman, Sharif Mohammad Musfiqur [2 ]
Hasan, Mohammad Mahedi [2 ,4 ]
Muhammad, Ghulam [5 ]
Huda, Mohammad Nurul [2 ]
机构
[1] Ahsanullah Univ Sci & Technol, Dept CSE, Dhaka, Bangladesh
[2] United Int Univ, Dept CSE, Dhaka, Bangladesh
[3] Enosis Solut, Dhaka, Bangladesh
[4] Blueliner Bangladesh, Dhaka, Bangladesh
[5] King Saud Univ, Coll CIS, Dept CE, Riyadh, Saudi Arabia
关键词
Distinctive Phonetic Features; Multi-Layer Neural Network; Local Features; Dynamic Parameters; Hidden Markov Models;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes an effect of articulatory dynamic parameters (Delta and Delta Delta) on neural network based automatic speech recognition(ASR). Articulatory features (AFs) or distinctive phonetic features (DPFs)-based system shows its superiority in performances over acoustic features- based in ASR. These performances can be further improved by incorporating articulatory dynamic parameters into it. In this paper, we have proposed such a phoneme recognition system that comprises three stages: (i) DPFs extraction using a multilayer neural network (MLN) from acoustic features, (ii) incorporation of dynamic parameters into another MLN for reducing DPF context, and (iii) addition of an Inhibition/Enhancement (In/En) network for categorizing the DPF movement more accurately and Gram-Schmidt (GS) orthogonalization procedure for decorrelating the inhibited/enhanced data vector before connecting with hidden Markov model (HMMs)-based classifier. From the experiments on Japanese Newspaper Article Sentences (JNAS), it is observed that the proposed method provides a higher phoneme correct rate over the method that does not incorporate dynamic articulatory parameters. Moreover, it reduces mixture components in HMM for obtaining a higher recognition performance.
引用
收藏
页码:624 / 627
页数:4
相关论文
共 50 条
  • [1] Speech recognition method based on multilayer chaotic neural network
    Ren, XL
    Hu, GR
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (01) : 110 - 114
  • [2] Articulatory feature extraction for speech recognition using neural network
    Huda, Mohammad Nurul
    Hasan, Mohammad Mahedi
    Hassan, Foyzul
    Kotwal, Mohammed Rokibul Alam
    Muhammad, Ghulam
    Rahman, Chowdhury Mofizur
    [J]. International Review on Computers and Software, 2011, 6 (01) : 25 - 31
  • [3] Recurrent Neural Network Based Phoneme Recognition Incorporating Articulatory Dynamic Parameters
    Kotwal, Mohammed Rokibul Alam
    Hassan, Foyzul
    Alam, Md. Mahabubul
    Jehad, Abdur Rahman Khan
    Arifuzzaman, Md.
    Huda, Mohammad Nurul
    [J]. ADVANCES IN COMPUTING AND COMMUNICATIONS, PT III, 2011, 192 : 349 - +
  • [4] Multilayer Neural Network Based Speech Emotion Recognition for Smart Assistance
    Kumar, Sandeep
    Haq, MohdAnul
    Jain, Arpit
    Jason, C. Andy
    Moparthi, Nageswara Rao
    Mittal, Nitin
    Alzamil, Zamil S.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1523 - 1540
  • [5] A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters
    Mao, Congmin
    Liu, Sujing
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (03) : 679 - 684
  • [6] Optimization of Multilayer Neural Network Parameters for Speaker Recognition
    Tovarek, Jaromir
    Partila, Pavol
    Rozhon, Jan
    Voznak, Miroslav
    Skapa, Jan
    Uhrin, Dominik
    Chmelikova, Zdenka
    [J]. MACHINE INTELLIGENCE AND BIO-INSPIRED COMPUTATION: THEORY AND APPLICATIONS X, 2016, 9850
  • [7] Phone-based speech synthesis with neural network and articulatory control
    Lo, WK
    Ching, PC
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2227 - 2230
  • [8] A NEW NEURAL-NETWORK FOR ARTICULATORY SPEECH RECOGNITION AND ITS APPLICATION TO VOWEL IDENTIFICATION
    ZACKS, J
    THOMAS, TR
    [J]. COMPUTER SPEECH AND LANGUAGE, 1994, 8 (03): : 189 - 209
  • [9] Hybrid convolutional neural networks for articulatory and acoustic information based speech recognition
    Mitra, Vikramjit
    Sivaraman, Ganesh
    Nam, Hosung
    Espy-Wilson, Carol
    Saltzman, Elliot
    Tiede, Mark
    [J]. SPEECH COMMUNICATION, 2017, 89 : 103 - 112
  • [10] Medical image recognition based on multilayer neural network
    Ma, Nan
    Hou, Yaxin
    [J]. MCB Molecular and Cellular Biomechanics, 2024, 21 (02):