Exploring the Effect of Tones for Myanmar Language Speech Recognition Using Convolutional Neural Network (CNN)

被引:1
|
作者
Mon, Aye Nyein [1 ]
Pa, Win Pa [1 ]
Thu, Ye Kyaw [2 ]
机构
[1] Univ Comp Studies, Nat Language Proc Lab, Yangon, Myanmar
[2] Okayama Prefectural Univ, Artificial Intelligence Lab, Okayama, Japan
来源
关键词
Tone information; Automatic Speech Recognition (ASR); Tonal language; Deep Neural Network (DNN); Convolutional Neural Network (CNN);
D O I
10.1007/978-981-10-8438-6_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tone information is very helpful to improve automatic speech recognition (ASR) performance in tonal languages such as Mandarin, Thai, Vietnamese, etc. Since Myanmar language is being considered as a tonal language, the effect of tones on both syllable and word-based ASR performance has been explored. In this work, experiments are done based on the modeling of tones by integrating them into the phoneme set and incorporating them into the Convolutional Neural Network (CNN), state-of-the-art acoustic model. Moreover, to be more effective tone modeling, tonal questions are used to build the phonetic decision tree. With tone information, experiments show that compared with Deep Neural Network (DNN) baseline, the performance of CNN model achieves nearly 2% for word-based ASR or more than 2% for syllable-based ASR improvement over DNN model. As a result, the CNN model with tone information gets 2.43% word error rate (WER) or 2.26% syllable error rate (SER) reductions than without using it.
引用
收藏
页码:314 / 326
页数:13
相关论文
共 50 条
  • [31] Staircase Recognition and Localization Using Convolutional Neural Network (CNN) for Cleaning Robot Application
    Ilyas, Muhammad
    Lakshmanan, Anirudh Krishna
    Le, Anh Vu
    Elara, Mohan Rajesh
    [J]. MATHEMATICS, 2023, 11 (18)
  • [32] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    A. Christy
    S. Vaithyasubramanian
    A. Jesudoss
    M. D. Anto Praveena
    [J]. International Journal of Speech Technology, 2020, 23 : 381 - 388
  • [33] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    Christy, A.
    Vaithyasubramanian, S.
    Jesudoss, A.
    Praveena, M. D. Anto
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 381 - 388
  • [34] Convolutional Neural Network applied in mime speech recognition using sEMG data
    Ai, Qing
    Zhang, Wei
    Zhang, Bixuan
    Li, Guang
    Yang, Meng
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3347 - 3352
  • [35] Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network
    Dua, Sakshi
    Kumar, Sethuraman Sambath
    Albagory, Yasser
    Ramalingam, Rajakumar
    Dumka, Ankur
    Singh, Rajesh
    Rashid, Mamoon
    Gehlot, Anita
    Alshamrani, Sultan S.
    AlGhamdi, Ahmed Saeed
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [36] Speech Recognition Model for Assamese Language Using Deep Neural Network
    Singh, Moirangthem Tiken
    Barman, Partha Pratim
    Gogoi, Rupjyoti
    [J]. 2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 2722 - 2727
  • [37] A Speech Recognition System for Bengali Language using Recurrent Neural Network
    Islam, Jahirul
    Mubassira, Masiath
    Islam, Md. Rakibul
    Das, Amit Kumar
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 73 - 76
  • [38] Gesture Recognition for American Sign Language Using Pytorch and Convolutional Neural Network
    Sethia, Devashsih
    Singh, Pallavi
    Mohapatra, B.
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, ICISA 2022, 2023, 959 : 307 - 317
  • [39] Convolutional Neural Network Array for Sign Language Recognition using Wearable IMUs
    Suri, Karush
    Gupta, Rinki
    [J]. 2019 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2019, : 483 - 488
  • [40] Indian Sign Language Gesture Recognition Using Deep Convolutional Neural Network
    Varsha, M.
    Nair, Chitra S.
    [J]. 2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 193 - 197