Continuous Kannada Noisy Speech Recognition

被引:0
|
作者
Pasha, Nadeem [1 ]
Roopa, S. [1 ]
机构
[1] Siddaganga Inst Technol, Dept Elect & Commun Engn, Tumakuru, Karnataka, India
关键词
ASR; DNN; Generalized Distillation Framework; Kaldi; MFSC; NEURAL-NETWORKS; ENHANCEMENT;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
ASR converts speech signal into corresponding text form. The performance of an ASR decreases under noisy environment. To overcome this problem a speech enhancement need to be performed on noisy speech before being fed to an ASR system. Speech enhancement techniques have been developed over past several decades, some of these techniques introduce musical noise. To achieve further improvement in recognition accuracy, a generalized distillation framework is used in which machines learns machines. In this paper, an ASR is implemented for noisy kannada language speech using generalized distillation framework. In this framework, a teacher machine is trained with clean speech and student machine with 4 different noise speech and teacher machine help student machine to learn by providing additional information needed. During test phase, a student machine is tested with 4 different noise speech other than used in training. A DNN acoustic model is build using a 39 dimension MFSC features and bi-gram language model is created using Kaldi Speech Recognition Toolkit. Experimental results shows that generalized distillation framework for kannada noisy speech achieved a reduction in WER compared to an HMM-GMM approach.
引用
收藏
页码:857 / 861
页数:5
相关论文
共 50 条
  • [1] Investigation on large vocabulary continuous Kannada speech recognition
    Vanajakshi, Puttaswamy Gowda
    Mathivanan, M.
    Kumaran, T. Senthil
    [J]. INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (01) : 1 - 24
  • [2] Continuous Kannada Speech Recognition System Under Degraded Condition
    P. S. Praveen Kumar
    G. Thimmaraja Yadava
    H. S. Jayanna
    [J]. Circuits, Systems, and Signal Processing, 2020, 39 : 391 - 419
  • [3] Continuous Speech Recognition of Kannada Language using Triphone Modeling
    Sajjan, Sharada C.
    Vijaya, C.
    [J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 451 - 455
  • [4] Continuous Kannada Speech Recognition System Under Degraded Condition
    Praveen Kumar, P. S.
    Thimmaraja Yadava, G.
    Jayanna, H. S.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (01) : 391 - 419
  • [5] Speaker Dependent Continuous Kannada Speech Recognition Using HMM
    Hemakumar, G.
    Punitha, P.
    [J]. 2014 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING APPLICATIONS (ICICA 2014), 2014, : 402 - 405
  • [6] Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise
    Mahadevaswamy
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (03) : 2039 - 2058
  • [8] Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech
    Mahadevaswamy
    Ravi, D. J.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2021, 121 (03) : 1781 - 1804
  • [9] Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech
    D. J. Mahadevaswamy
    [J]. Wireless Personal Communications, 2021, 121 : 1781 - 1804
  • [10] Performance Analysis of Hybrid Automatic Continuous Speech Recognition Framework for Kannada Dialect
    Kumar, Praveen P. S.
    Jayanna, H. S.
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,