Orthogonalized distinctive phonetic feature extraction for noise-robust automatic speech recognition

被引:0
|
作者
Fukuda, T [1 ]
Nitta, T [1 ]
机构
[1] Toyohashi Univ Technol, Grad Sch Engn, Toyohashi, Aichi 4418580, Japan
来源
关键词
automatic speech recognition (ASR); feature extraction; distinctive phonetic feature (DPF); orthogonalization; local feature (LF);
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a noise-robust automatic speech recognition system that uses orthogonalized distinctive phonetic features (DPFs) as input of HMM with diagonal covariance. In an orthogonalized DPF extraction stage, first, a speech signal is converted to acoustic features composed of local features (LFs) and DeltaP, then a multilayer neural network (MLN) with 15 x 3 output units composed of context-dependent DPFs of a preceding context DPF vector, a current DPF vector, and a following context DPF vector maps the LFs to DPFs. Karhunen-Loeve transform (KLT) is then applied to orthogonalize each DPF vector in the context-dependent DPFs, using orthogonal bases calculated from a DPF vector that represents 38 Japanese phonemes. Each orthogonalized DPF vector is finally decor-related one another by using Gram-Schmidt orthogonalization procedure. related one another by using Gram In experiments, after evaluating the parameters of the MLN input and output units in the DPF extractor. the orthogonalized DPFs are compared with original DPFs. The orthogonalized DPFs are then evaluated in comparison with a standard parameter set of MFCCs and dynamic features. Next, noise robustness is tested using four types of additive noise. The experimental results show that the use of the proposed orthogonalized DPFs can significantly reduce the error rate in an isolated spoken-word recognition task both with clean speech and with speech contaminated by additive noise. Furthermore, we achieved significant improvements when combining the orthogonalized DPFs with conventional static MFCCs and DeltaP.
引用
收藏
页码:1110 / 1118
页数:9
相关论文
共 50 条
  • [41] Synchrony-Based Feature Extraction for Robust Automatic Speech Recognition
    de-La-Calle-Silos, Fernando
    Stern, Richard M.
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1158 - 1162
  • [42] FEATURE EXTRACTION WITH A MULTISCALE MODULATION ANALYSIS FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Mueller, Florian
    Mertins, Alfred
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7427 - 7431
  • [43] Feature extraction for robust speech recognition
    Dharanipragada, S
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, 2002, : 855 - 858
  • [44] Noise-robust speech feature processing with empirical mode decomposition
    Kuo-Hau Wu
    Chia-Ping Chen
    Bing-Feng Yeh
    EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [45] Noise-robust speech feature processing with empirical mode decomposition
    Wu, Kuo-Hau
    Chen, Chia-Ping
    Yeh, Bing-Feng
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
  • [46] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
    Satriawan, Cil Hardianto
    Lestari, Dessi Puji
    2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
  • [47] Dual-channel VTS feature compensation for noise-robust speech recognition on mobile devices
    Lopez-Espejo, Ivan
    Peinado, Antonio M.
    Gomez, Angel M.
    Gonzalez, Jose A.
    IET SIGNAL PROCESSING, 2017, 11 (01) : 17 - 25
  • [48] Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
    Shimada, Kazuki
    Bando, Yoshiaki
    Mimura, Masato
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (05) : 960 - 971
  • [49] Noise-robust speech recognition based on difference of power spectrum
    Xu, JF
    Wei, G
    ELECTRONICS LETTERS, 2000, 36 (14) : 1247 - 1248
  • [50] Unsupervised noise-robust feature extraction for aerial image classification
    LIANG Ye
    LU Shuai
    WENG Rui
    HAN ChengZhe
    LIU Ming
    Science China(Technological Sciences), 2020, 63 (08) : 1406 - 1415