Taylor Series Expansion of Psychoacoustic Corruption Function for Noise Robust Speech Recognition

被引:0
|
作者
Das, Biswajit [1 ]
Panda, Ashish [1 ]
机构
[1] TCS Innovat Labs, Mumbai Yantra Pk, Thana 400601, Maharashtra, India
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present model domain adaptation algorithms for noise robust speech recognition. We have proposed a Taylor Series expansion of the psychoacoustic corruption function, which provides for superior noise robustness. The proposed joint adaptation algorithm consists of Psychoacoustic Model Compensation (Psy-Comp) and Model-domain Cepstral Mean Normalization (MCMN). While the Psy-Comp compensates for the additive noise component, the MCMN compensates for the channel component. The proposed algorithms are validated through experiments on noise corrupted TIMIT speech recognition database. We show that the joint compensation scheme provides 5% (absolute) performance gain compared to the Vector Taylor Series (VTS) scheme. Also, the proposed technique is computationally efficient and faster as compared to the VTS scheme. Keywords: Speech Recognition, Additive Noise, Model Compensation, Model Domain Mean Normalization, Vector Taylor Series
引用
收藏
页码:568 / 572
页数:5
相关论文
共 50 条
  • [1] Vector Taylor Series Expansion with Auditory Masking for Noise Robust Speech Recognition
    Das, Biswajit
    Panda, Ashish
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [2] ON NOISE ESTIMATION FOR ROBUST SPEECH RECOGNITION USING VECTOR TAYLOR SERIES
    Zhao, Yong
    Juang, Biing-Hwang
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4290 - 4293
  • [3] Psychoacoustic masking effect for noise robust speech recognition robot
    Miyanaga, Yoshikazu
    [J]. ISSCS 2019 - International Symposium on Signals, Circuits and Systems, 2019,
  • [4] Psychoacoustic Masking Effect for Noise Robust Speech Recognition Robot
    Miyanaga, Yoshikazu
    [J]. 2019 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS 2019), 2019,
  • [5] Psychoacoustic Model Compensation for Robust Continuous Speech Recognition in Additive Noise
    Das, Biswajit
    Panda, Ashish
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 511 - 515
  • [6] Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition
    Loweimi, Erfan
    Barker, Jon
    Hain, Thomas
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3798 - 3802
  • [7] NOISE ADAPTIVE TRAINING USING A VECTOR TAYLOR SERIES APPROACH FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
    Kalinli, Ozlem
    Seltzer, Michael L.
    Acero, Alex
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3825 - 3828
  • [8] A NOISE-ROBUST SPEECH RECOGNITION METHOD COMPOSED OF WEAK NOISE SUPPRESSION AND WEAK VECTOR TAYLOR SERIES ADAPTATION
    Komeiji, Shuji
    Arakawa, Takayuki
    Koshinaka, Takafumi
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 103 - 106
  • [9] Model adaptation employing DNN-based estimation of noise corruption function for noise-robust speech recognition
    Yoon, Ki-mu
    Kim, Wooil
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (01): : 47 - 50
  • [10] Speech recognition enhancement by psychoacoustic modeled noise suppression
    Lai, YP
    Hui, MC
    Kok, CW
    Siu, MH
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1335 - 1338