Continuous Kannada Noisy Speech Recognition

被引:0
|
作者
Pasha, Nadeem [1 ]
Roopa, S. [1 ]
机构
[1] Siddaganga Inst Technol, Dept Elect & Commun Engn, Tumakuru, Karnataka, India
关键词
ASR; DNN; Generalized Distillation Framework; Kaldi; MFSC; NEURAL-NETWORKS; ENHANCEMENT;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
ASR converts speech signal into corresponding text form. The performance of an ASR decreases under noisy environment. To overcome this problem a speech enhancement need to be performed on noisy speech before being fed to an ASR system. Speech enhancement techniques have been developed over past several decades, some of these techniques introduce musical noise. To achieve further improvement in recognition accuracy, a generalized distillation framework is used in which machines learns machines. In this paper, an ASR is implemented for noisy kannada language speech using generalized distillation framework. In this framework, a teacher machine is trained with clean speech and student machine with 4 different noise speech and teacher machine help student machine to learn by providing additional information needed. During test phase, a student machine is tested with 4 different noise speech other than used in training. A DNN acoustic model is build using a 39 dimension MFSC features and bi-gram language model is created using Kaldi Speech Recognition Toolkit. Experimental results shows that generalized distillation framework for kannada noisy speech achieved a reduction in WER compared to an HMM-GMM approach.
引用
收藏
页码:857 / 861
页数:5
相关论文
共 50 条
  • [21] Noisy speech recognition based on speech enhancement
    Wang, Xia
    Tang, Hongmei
    Zhao, Xiaoqun
    [J]. SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 713 - +
  • [22] Continuous Tamil Speech Recognition technique under non stationary noisy environments
    Kalamani, M.
    Krishnamoorthi, M.
    Valarmathi, R. S.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 47 - 58
  • [23] Continuous Tamil Speech Recognition technique under non stationary noisy environments
    M. Kalamani
    M. Krishnamoorthi
    R. S. Valarmathi
    [J]. International Journal of Speech Technology, 2019, 22 : 47 - 58
  • [24] Performance Analysis of Isolated Speech Recognition System Using Kannada Speech Database
    Thalengala, Ananthakrishna
    Shama, Kumara
    Mangalore, Maithri
    [J]. PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2018, 26 (04): : 1849 - 1866
  • [25] Emotion recognition from noisy speech
    You, Mingyu
    Chen, Chun
    Bu, Jiajun
    Liu, Jia
    Tao, Jianhua
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1653 - +
  • [26] Speech enhancement applied to speech recognition in noisy environments
    [J]. Xu, Y.F., 2001, Press of Tsinghua University (41):
  • [27] Feature weighting in noisy speech recognition
    Huang, KC
    Juang, YT
    [J]. ELECTRONICS LETTERS, 2003, 39 (12) : 938 - 939
  • [28] A new noisy speech recognition method
    Zhao, XQ
    Wang, J
    [J]. International Symposium on Communications and Information Technologies 2005, Vols 1 and 2, Proceedings, 2005, : 282 - 286
  • [29] PROBLEMS AND SOLUTIONS FOR NOISY SPEECH RECOGNITION
    HATON, JP
    [J]. JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 439 - 448
  • [30] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
    GONG, YF
    [J]. SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291