An acoustic model and linguistic analysis for Malayalam disyllabic words: a low resource language

被引:3
|
作者
Lekshmi, K. R. [1 ]
Sherly, Elizabeth [2 ]
机构
[1] Bharathiar Univ, Coimbatore, Tamil Nadu, India
[2] Indian Inst Informat Technol & Management Kerala, Trivandrum, Kerala, India
关键词
Convolutional neural network; Voicegram or spectrogram; Automatic speech recognition; Malayalam; Voice onset time; Formant analysis; Velar; Palatal; CONVOLUTIONAL NEURAL-NETWORKS; SPEECH; CONSONANTS; FEATURES;
D O I
10.1007/s10772-021-09807-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic Speech Recognition (ASR) has reaped a lot of attention in recent years. Despite the recent advancements in ASR, the potential for extracting the raw features from speech remains lacking. This paper proposes an Automatic Speech Recognition system on Malayalam speech data using spectrogram images and Convolutional Neural Network (CNN). The voicegram/spectrogram images of sound files are generated, which is fed into CNN. Convolutional Neural Network topology is defined with a set of Convolution and Fully Connected layers and used Softmax layer for classification. An accuracy of 93.33% achieved with this proposed model indicates that spectrogram image-based approaches have promising results in speech-based recognition. An analysis of acoustic characteristics of Malayalam disyllabic words selected to design the ASR system with formant analysis, voice onset time and spectral moments from 4000 tokens produced by 20 speakers is also conducted. A comparison between CNN model and multiple classifiers with acoustic features have been performed and proved the efficiency of deep Neural Networks over raw features.
引用
收藏
页码:483 / 495
页数:13
相关论文
共 50 条
  • [21] Unsupervised SMT: an analysis of Indic languages and a low resource language
    Saxena, Shefali
    Chauhan, Shweta
    Arora, Paras
    Daniel, Philemon
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024, 36 (06) : 865 - 884
  • [22] Analysis and modeling of dialect information in Ao, a low resource language
    Tzudir, Moakala
    Sarmah, Priyankoo
    Prasanna, S. R. Mahadeva
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 149 (05): : 2976 - 2987
  • [23] Language Model Prior for Low-Resource Neural Machine Translation
    Baziotis, Christos
    Haddow, Barry
    Birch, Alexandra
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7622 - 7634
  • [24] Resource Construction and Ensemble Learning Based Sentiment Analysis for the Low-resource Language Uyghur
    Yusup, Azragul
    Chen, Degang
    Ge, Yifei
    Mao, Hongliang
    Wang, Nujian
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (04): : 1009 - 1016
  • [25] Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model
    Nguyen, Xuan-Phi
    Joty, Shafiq
    Kui, Wu
    Aw, Ai Ti
    [J]. arXiv, 2022,
  • [26] Joint Learning Model for Low-Resource Agglutinative Language Morphological Tagging
    Abudouwaili, Gulinigeer
    Abiderexiti, Kahaerjiang
    Yi, Nian
    Wumaier, Aishan
    [J]. Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2023, : 27 - 37
  • [27] Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model
    Nguyen, Xuan-Phi
    Joty, Shafiq
    Kui, Wu
    Aw, Ai Ti
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [28] Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study
    Khemchandani, Yash
    Mehtani, Sarvesh
    Patil, Vaidehi
    Awasthi, Abhijeet
    Talukdar, Partha
    Sarawagi, Sunita
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1312 - 1323
  • [29] Enhancing Sentiment Analysis in Amharic: Leveraging Transformer-Based Language Model for Low-Resource African Languages
    Raychawdhary, Nilanjana
    Das, Amit
    Bhattacharya, Sutanu
    Dozier, Gerry
    Seals, Cheryl D.
    [J]. SOUTHEASTCON 2024, 2024, : 50 - 55
  • [30] Named-Entity Recognition for a Low-resource Language using Pre-Trained Language Model
    Yohannes, Hailemariam Mehari
    Amagasa, Toshiyuki
    [J]. 37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 837 - 844