Noise-robust speech recognition using a new spectral estimation method "PHASOR"

被引:0
|
作者
Aikawa, K [1 ]
Ishizuka, K [1 ]
机构
[1] NTT Corp, Commun Sci Labs, Atsugi, Kanagawa 2430198, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new noise-robust spectral estimation method for speech recognition. The new method, called PHASOR, is characterized by inside-frame processing. The speech spectrum is estimated from a single impulse response obtained by summing multiple pitch periods in a frame with synchronizing the phase. PHASOR improves the spectral estimation accuracy and suppresses the additive noise because of the inside-frame processing. These improvement is more effective when the pitch fluctuates or changes in the frame. Speaker-dependent and speaker-independent phoneme recognition experiments demonstrate that the PHASOR greatly reduces the recognition error rate for speech data contaminated by noise. It also outperforms conventional noise reduction methods, cepstral mean normalization and spectral subtraction.
引用
收藏
页码:397 / 400
页数:4
相关论文
共 50 条
  • [31] Deep maxout networks applied to noise-robust speech recognition
    [J]. de-la-Calle-Silos, F. (fsilos@tsc.uc3m.es), 1600, Springer Verlag (8854):
  • [32] A companding front end for noise-robust automatic speech recognition
    Guinness, J
    Raj, B
    Schmidt-Nielsen, B
    Turicchia, L
    Sarpeshkar, R
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 249 - 252
  • [33] Noise-robust speech triage
    Bartos, Anthony L.
    Cipr, Tomas
    Nelson, Douglas J.
    Schwarz, Petr
    Banowetz, John
    Jerabek, Ladislav
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (04): : 2313 - 2320
  • [34] Noise-Robust Algorithm of Speech Features Extraction for Automatic Speech Recognition System
    Yakhnev, A. N.
    Pisarev, A. S.
    [J]. PROCEEDINGS OF THE XIX IEEE INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM 2016), 2016, : 206 - 208
  • [35] Model adaptation employing DNN-based estimation of noise corruption function for noise-robust speech recognition
    Yoon, Ki-mu
    Kim, Wooil
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (01): : 47 - 50
  • [36] Sequential noise estimation for noise-robust speech recognition based on 1st-order VTS approximation
    Ding, GH
    Wang, X
    Cao, Y
    Ding, F
    Tang, YZ
    [J]. 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2005, : 363 - 368
  • [37] Histogram equalization for noise-robust speech recognition using discrete-mixture HMMs
    Kosaka, Tetsuo
    Katoh, Masaharu
    Kohda, Masaki
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2008, 29 (01) : 66 - 73
  • [38] An Efficient Noise-Robust Automatic Speech Recognition System using Artificial Neural Networks
    Gupta, Santosh
    Bhurchandi, Kishor M.
    Keskar, Avinash G.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1873 - 1877
  • [39] Agricultural price information acquisition using noise-robust Mandarin auto speech recognition
    Xu J.
    Zhu Y.
    Xu P.
    Ma D.
    [J]. International Journal of Speech Technology, 2018, 21 (3) : 681 - 688
  • [40] Noise-robust automatic speech recognition using Mainlobe-Resilient time-frequency quantile-based noise estimation
    Lee, SW
    Ching, PC
    Lee, T
    [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3, PROCEEDINGS, 2004, : 425 - 428