An acoustic model and linguistic analysis for Malayalam disyllabic words: a low resource language

被引:3
|
作者
Lekshmi, K. R. [1 ]
Sherly, Elizabeth [2 ]
机构
[1] Bharathiar Univ, Coimbatore, Tamil Nadu, India
[2] Indian Inst Informat Technol & Management Kerala, Trivandrum, Kerala, India
关键词
Convolutional neural network; Voicegram or spectrogram; Automatic speech recognition; Malayalam; Voice onset time; Formant analysis; Velar; Palatal; CONVOLUTIONAL NEURAL-NETWORKS; SPEECH; CONSONANTS; FEATURES;
D O I
10.1007/s10772-021-09807-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic Speech Recognition (ASR) has reaped a lot of attention in recent years. Despite the recent advancements in ASR, the potential for extracting the raw features from speech remains lacking. This paper proposes an Automatic Speech Recognition system on Malayalam speech data using spectrogram images and Convolutional Neural Network (CNN). The voicegram/spectrogram images of sound files are generated, which is fed into CNN. Convolutional Neural Network topology is defined with a set of Convolution and Fully Connected layers and used Softmax layer for classification. An accuracy of 93.33% achieved with this proposed model indicates that spectrogram image-based approaches have promising results in speech-based recognition. An analysis of acoustic characteristics of Malayalam disyllabic words selected to design the ASR system with formant analysis, voice onset time and spectral moments from 4000 tokens produced by 20 speakers is also conducted. A comparison between CNN model and multiple classifiers with acoustic features have been performed and proved the efficiency of deep Neural Networks over raw features.
引用
收藏
页码:483 / 495
页数:13
相关论文
共 50 条
  • [1] An acoustic model and linguistic analysis for Malayalam disyllabic words: a low resource language
    K. R. Lekshmi
    Elizabeth Sherly
    [J]. International Journal of Speech Technology, 2021, 24 : 483 - 495
  • [2] Analysis of Low-Resource Acoustic Model Self-Training
    Novotney, Scott
    Schwartz, Richard
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 236 - 239
  • [3] ANALYSIS OF MULTILINGUAL BLSTM ACOUSTIC MODEL ON LOW AND HIGH RESOURCE LANGUAGES
    Karafiat, Martin
    Baskar, Murali Karthick
    Vesely, Karel
    Grezl, Frantisek
    Burget, Lukas
    Cernocky, Jan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5789 - 5793
  • [4] Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language
    Dasare, Ashwini
    Chowdhury, Amartya Roy
    Menon, Aditya Srinivas
    Anand, Konjengbam
    Deepak, K. T.
    Prasanna, S. R. M.
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 127 - 139
  • [5] Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion
    Farooq, Muhammad Umar
    Narayana, Darshan Adiga Haniya
    Hain, Thomas
    [J]. INTERSPEECH 2022, 2022, : 4850 - 4854
  • [6] FROM MODAL LANGUAGE TO MODEL LANGUAGE - HARTSHORNE,CHARLES AND LINGUISTIC ANALYSIS
    VANDERVEKEN, J
    [J]. HARTSHORNE, PROCESS PHILOSOPHY, AND THEOLOGY, 1989, : 33 - 51
  • [7] Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-Resource Speech Recognition
    Yi, Cheng
    Zhou, Shiyu
    Xu, Bo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 788 - 792
  • [8] Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    [J]. SPEECH COMMUNICATION, 2015, 68 : 23 - 40
  • [9] Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training
    Biswas, Astik
    Menon, Raghav
    van der Westhuizen, Ewald
    Niesler, Thomas
    [J]. INTERSPEECH 2019, 2019, : 3008 - 3012
  • [10] Linguistic Foundations of Low-Resource Languages for Speech Synthesis on the Example of the Kazakh Language
    Bekmanova, Gulmira
    Yergesh, Banu
    Sharipbay, Altynbek
    Omarbekova, Assel
    Zakirova, Alma
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART III, 2022, 13379 : 3 - 14