Noise Elimination in Degraded Kannada Speech Signal for Speech Recognition

被引:0
|
作者
Yadava, Thimmaraja G. [1 ]
Prakash, Jai T. S. [1 ]
Jayanna, H. S. [1 ]
机构
[1] Siddaganga Inst Technol, Dept Informat Sci & Engn, Tumkur, Karnataka, India
关键词
Automatic Speech Recognition (ASR); Voice Activity Detection (VAD); Linear Prediction Coefficient (LPC); ENHANCEMENT;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we demonstrate the methods for preprocessing of noisy speech data to build an Automatic Speech Recognition (ASR) for Kannada language. The methods are spectral subtraction with Voice Activity Detection (VAD), Linear Prediction Coefficient (LPC) analysis of speech using autocorrelation and periodogram subtraction method. In spectral subtraction method, noisy speech data is segmented and windowed into 50% overlapped frames and is processed frame by frame. An application of VAD is to detect only active regions of speech signal. In LPC analysis of noisy speech using periodogram and autocorrelation subtraction methods, the autocorrelation coefficients are calculated first and then by subtracting the periodograms of additive noisy signal from corrupted speech signal, the noise is eliminated.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Recognition of speech produced in noise
    Pittman, AL
    Wiley, TL
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2001, 44 (03): : 487 - 496
  • [32] Speaker Dependent Continuous Kannada Speech Recognition Using HMM
    Hemakumar, G.
    Punitha, P.
    [J]. 2014 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING APPLICATIONS (ICICA 2014), 2014, : 402 - 405
  • [33] SPEECH SEPARATION BASED ON SIGNAL-NOISE-DEPENDENT DEEP NEURAL NETWORKS FOR ROBUST SPEECH RECOGNITION
    Tu, Yan-Hui
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 61 - 65
  • [34] Implementation of Phonetic Level Speech Recognition in Kannada using HTK
    Priya, Jeeva K.
    Sree, S. Sowmya
    Navya, T. V. S.
    Gupta, Deepa
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 82 - 85
  • [35] Signal preprocessing for speech recognition
    Kolokolov, AS
    [J]. AUTOMATION AND REMOTE CONTROL, 2002, 63 (03) : 494 - 501
  • [36] Signal Preprocessing for Speech Recognition
    A. S. Kolokolov
    [J]. Automation and Remote Control, 2002, 63 : 494 - 501
  • [37] Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition
    Garner, Philip N.
    [J]. SPEECH COMMUNICATION, 2011, 53 (08) : 991 - 1001
  • [38] Noise-Robust speech recognition of Conversational Telephone Speech
    Chen, Gang
    Tolba, Hesham
    O'Shaughnessy, Douglas
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1101 - 1104
  • [39] Preprocessing and Segmentation of the Speech Signal in the Frequency Domain for Speech Recognition
    A. S. Kolokolov
    [J]. Automation and Remote Control, 2003, 64 : 985 - 994
  • [40] Preprocessing and segmentation of the speech signal in the frequency domain for speech recognition
    Kolokolov, AS
    [J]. AUTOMATION AND REMOTE CONTROL, 2003, 64 (06) : 985 - 994