Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise

被引:0
|
作者
机构
[1] Visvesvaraya Technological University,Department of Electronics and Communication Engineering, Vidyavardhaka College of Engineering
来源
关键词
Approximation coefficients; Detail coefficients; Monophones; Tri-phones; Deep neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic Speech Recognition system is developed for recognizing the continuous and spontaneous Kannada speech sentences in clean and noisy environments. The language models and acoustic models are constructed using Kaldi toolkit. The speech corpus is developed with the native female and male Kannada speakers and is partioned into training set and testing set. The Performance of the proposed system is analysed and evaluated using the metric Word Error Rate (WER). The Wavelet Packets amalgamated with Mel filter banks are utilized to perform feature vector generation. The proposed hand crafted features perform better than the baseline features such as Perceptual Linear Prediction, Mel Frequency Cepstral Coefficients interms of WER under both clean and nosiy environmental conditions.
引用
下载
收藏
页码:2039 / 2058
页数:19
相关论文
共 50 条
  • [41] Continuous Speech Recognition of Kannada Language using Triphone Modeling
    Sajjan, Sharada C.
    Vijaya, C.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 451 - 455
  • [42] Speaker Dependent Continuous Kannada Speech Recognition Using HMM
    Hemakumar, G.
    Punitha, P.
    2014 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING APPLICATIONS (ICICA 2014), 2014, : 402 - 405
  • [43] Robust technologies towards automatic speech recognition in car noise environments
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 776 - +
  • [44] INCORPORATING MASK MODELLING FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
    Koekueer, Muenevver
    Jancovic, Peter
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3929 - 3932
  • [45] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
    Wu, Kuo-Hao
    Chen, Chia-Ping
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077
  • [46] Binaural Deep Neural Network for Noise Robust Automatic Speech Recognition
    Jiang, Yi
    Zu, Yuan-Yuan
    INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND AUTOMATION (ICCEA 2014), 2014, : 512 - 517
  • [47] A companding front end for noise-robust automatic speech recognition
    Guinness, J
    Raj, B
    Schmidt-Nielsen, B
    Turicchia, L
    Sarpeshkar, R
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 249 - 252
  • [48] Use of speech presence uncertainty with MMSE spectral energy estimation for robust automatic speech recognition
    Stark, Anthony
    Paliwal, Kuldip
    SPEECH COMMUNICATION, 2011, 53 (01) : 51 - 61
  • [49] Noise-Robust speech recognition of Conversational Telephone Speech
    Chen, Gang
    Tolba, Hesham
    O'Shaughnessy, Douglas
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1101 - 1104
  • [50] Performance Analysis of Isolated Speech Recognition System Using Kannada Speech Database
    Thalengala, Ananthakrishna
    Shama, Kumara
    Mangalore, Maithri
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2018, 26 (04): : 1849 - 1866