Noise-Robust Feature Extraction Based on Forward Masking

被引:0
|
作者
Chiou, Sheng-Chiuan [1 ]
Chen, Chia-Ping [1 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Comp Sci & Engn, Kaohsiung 80424, Taiwan
关键词
noise robustness; speech recognition; synaptic adaptation; temporal integration; forward masking; AUTOMATIC SPEECH RECOGNITION; MODEL; ADAPTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Forward masking is a phenomenon of human auditory perception, that a weaker sound is masked by a preceding stronger masker. The actual cause of forward masking is not clear, but synaptic adaptation and temporal integration are heuristic explanations. In this paper, we postulate the mechanism of forward masking to be synaptic adaptation and temporal integration, and incorporate them in the feature extraction process of an automatic speech recognition system to improve noise-robustness. The synaptic adaptation is implemented by a high-pass filter, and the temporal integration is implemented by a bandpass filter. We apply both filters in the domain of log mel-spectrum. On the Aurora 3 tasks, we evaluate three modified mel-frequency cepstral coefficients: synaptic adaptation only, temporal integration only, and both synaptic adaptation and temporal integration. Experiments show that the overall improvement is 16.1%, 21.8%, and 26.2% respectively in the three cases over the baseline.
引用
收藏
页码:1243 / 1246
页数:4
相关论文
共 50 条
  • [1] Pitch synchronous based feature extraction for noise-robust speaker verification
    Gong Wei-Guo
    Yang Li-Ping
    Chen Di
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 295 - 298
  • [2] Fusion Feature Extraction Based on Auditory and Energy for Noise-Robust Speech Recognition
    Shi, Yanyan
    Bai, Jing
    Xue, Peiyun
    Shi, Dianxi
    [J]. IEEE ACCESS, 2019, 7 : 81911 - 81922
  • [3] Unsupervised noise-robust feature extraction for aerial image classification
    LIANG Ye
    LU Shuai
    WENG Rui
    HAN ChengZhe
    LIU Ming
    [J]. Science China(Technological Sciences)., 2020, 63 (08) - 1415
  • [4] Unsupervised noise-robust feature extraction for aerial image classification
    LIANG Ye
    LU Shuai
    WENG Rui
    HAN ChengZhe
    LIU Ming
    [J]. Science China Technological Sciences, 2020, (08) : 1406 - 1415
  • [5] Unsupervised noise-robust feature extraction for aerial image classification
    Ye Liang
    Shuai Lu
    Rui Weng
    ChengZhe Han
    Ming Liu
    [J]. Science China Technological Sciences, 2020, 63 : 1406 - 1415
  • [6] Unsupervised noise-robust feature extraction for aerial image classification
    Liang Ye
    Lu Shuai
    Weng Rui
    Han ChengZhe
    Liu Ming
    [J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (08) : 1406 - 1415
  • [7] HMM-based noise-robust feature compensation
    Sasou, Akira
    Asano, Futoshi
    Nakamura, Satoshi
    Tanaka, Kazuyo
    [J]. SPEECH COMMUNICATION, 2006, 48 (09) : 1100 - 1111
  • [8] An engineering model of the masking for the noise-robust speech recognition
    Park, KY
    Lee, SY
    [J]. NEUROCOMPUTING, 2003, 52-4 : 615 - 620
  • [9] Multiscale Superpixelwise Prophet Model for Noise-Robust Feature Extraction in Hyperspectral Images
    Ma, Ping
    Ren, Jinchang
    Sun, Genyun
    Zhao, Huimin
    Jia, Xiuping
    Yan, Yijun
    Zabalza, Jaime
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [10] Orthogonalized distinctive phonetic feature extraction for noise-robust automatic speech recognition
    Fukuda, T
    Nitta, T
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1110 - 1118