Application of deep learning approach for recognition of voiced Odia digits

被引:0
|
作者
Mohanty, Prithviraj [1 ]
Sahoo, Jyoti Prakash [1 ]
Nayak, Ajit Kumar [1 ]
机构
[1] SOA Deemed Univ, Dept Comp Sci & Informat Technol, ITER, Bhubaneswar, India
关键词
automatic speech recognition; ASR; convolutional neural network; CNN; deep neural network; DNN; MFCC; HMM; SVM; spectrogram; SPEECH RECOGNITION; NEURAL-NETWORK;
D O I
10.1504/IJCSE.2022.10047843
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatic speech recognition in a regional language like Odia is a challenging field of research. Voiced Odia digit recognition helps in designing automatic voice dialler systems. In this study, a deep learning approach is used for the recognition of voiced Odia digits. The spectrogram representation of voiced samples is given as the input to the deep learning models after considering the feature extraction using MFCC. Various performance metrics are obtained by considering several experiments with different epoch sizes and variation in the dataset using the train-validate-test ratio. Experimental outcomes reveal that the CNN model provides improved accuracy of 91.72% in epoch size of 500 with a split ratio of 80-10-10 as compared to the other two models that use VSL and DNN. From the reported outcome it unravels that, the proposed CNN model has better average recognition accuracy as compared with contemporary models like HMM and SVM.
引用
收藏
页码:513 / 522
页数:11
相关论文
共 50 条
  • [41] Deep Learning Application for Handwritten Arabic Word Recognition
    Alzrrog, Nori
    Bousquet, Jean-Francois
    El-Feghi, Idris
    [J]. 2022 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2022, : 95 - 100
  • [42] A Mobile Application for Plant Recognition through Deep Learning
    Gao, Min
    Lin, Lang
    Sinnott, Richard O.
    [J]. 2017 IEEE 13TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2017, : 29 - 38
  • [43] Deep Learning Based Application for Indoor Scene Recognition
    Afif, Mouna
    Ayachi, Riadh
    Said, Yahia
    Atri, Mohamed
    [J]. NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2827 - 2837
  • [44] Application of deep learning in recognition of accrued earnings management
    Li, Jia
    Sun, Zhoutianyang
    [J]. HELIYON, 2023, 9 (03)
  • [45] The Application of Deep Learning in Communication Signal Modulation Recognition
    Lin, Yun
    Tu, Ya
    Dou, Zheng
    Wu, Zhiqiang
    [J]. 2017 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2017, : 782 - 786
  • [46] Evaluation of Recognition of Water-meter Digits with Application Programs, APIs, and Machine Learning Algorithms
    Eurviriyanukul, Kwanchai
    Phiewluang, Kriatsanga
    Yawichai, Sirisak
    Chaichana, Sirilak
    [J]. 2020 8TH INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2020,
  • [47] Analysis and comparison of machine learning classifiers and deep neural networks techniques for recognition of Farsi handwritten digits
    Y. A. Nanehkaran
    Defu Zhang
    S. Salimi
    Junde Chen
    Yuan Tian
    Najla Al-Nabhan
    [J]. The Journal of Supercomputing, 2021, 77 : 3193 - 3222
  • [48] A Discriminative Feature Learning Approach for Deep Face Recognition
    Wen, Yandong
    Zhang, Kaipeng
    Li, Zhifeng
    Qiao, Yu
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 499 - 515
  • [49] A deep learning approach for handwritten Arabic names recognition
    Mustafa M.E.
    Elbashir M.K.
    [J]. International Journal of Advanced Computer Science and Applications, 2020, 11 (01): : 678 - 682
  • [50] Ear recognition with ensemble classifiers; A deep learning approach
    Sharkas, Maha
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 43919 - 43945