Speech Recognition using Hilbert-Huang Transform Based Features

被引:0
|
作者
Hanna, Samer S. [1 ]
Korany, Noha [2 ]
Abd-el-Malek, Mina B. [1 ,3 ]
机构
[1] Alexandria Univ, Fac Engn, Dept Engn Math & Phys, Alexandria, Egypt
[2] Alexandria Univ, Fac Engn, Dept Elect Engn, Alexandria, Egypt
[3] Amer Univ Cairo, Dept Math & Actuarial Sci, Cairo, Egypt
关键词
Automatic Speech Recognition; Hilbert-Huang Transform; Mel-frequency Cepstral Coefficients; EMPIRICAL MODE DECOMPOSITION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Mel-frequency Cepstral Coefficients (MFCCs) are widely used for feature extraction in Automatic Speech Recognition (ASR) systems. MFCCs start by dividing the speech into windows and calculating the Fourier Transform (FT) of each window. The frequency resolution obtained using this scheme depends on the time width of the window. A small window would fail to provide a good frequency resolution, while a big window would fail to obtain a good time resolution. This phenomenon is explained by the way the FT defines frequency; it tries to map the signal to a set of predefined bases. In this work, we propose a speech feature extraction method, we will refer to as Mel Hilbert Frequency Cepstral Coefficients (MHFCCs). MHFCCs use the Hilbert-Huang Transform (HHT) instead of the windowing and the FT scheme used in MFCCs. The HHT is an adaptive time-frequency transform suitable for non-linear and non-stationary signals. It generates its bases from the signal itself. This enables it to obtain a high frequency resolution representation of the signal regardless of the duration of the time window used. Results show that MHFCCs outperform MFCCs in recognition accuracy for a small time window.
引用
收藏
页码:338 / 341
页数:4
相关论文
共 50 条
  • [1] Hilbert-Huang Transform based speech enhancement
    Shen, LR
    Li, XY
    Wang, HQ
    Yin, QB
    Zhang, RB
    [J]. Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 591 - 595
  • [2] Speech detection based on Hilbert-Huang Transform
    Wang, Wu
    Li, Xueyao
    Zhang, Rubo
    [J]. FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 290 - +
  • [3] Speech enhancement based on Hilbert-Huang transform
    Liu, ZF
    Liao, ZP
    Sang, EF
    [J]. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 4908 - 4912
  • [4] IRIS RECOGNITION BASED ON HILBERT-HUANG TRANSFORM
    Yang, Zhijing
    Yang, Zhihua
    Yang, Lihua
    [J]. ADVANCES IN DATA SCIENCE AND ADAPTIVE ANALYSIS, 2009, 1 (04) : 623 - 641
  • [5] Speech pitch determination based on Hilbert-Huang transform
    Huang, H
    Pan, JQ
    [J]. SIGNAL PROCESSING, 2006, 86 (04) : 792 - 803
  • [6] Method of Speech Enhancement Based on Hilbert-Huang Transform
    Li, Xueyao
    Zou, Xiaojie
    Zhang, Rubo
    Liu, Guanqun
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 8419 - 8424
  • [7] Speech enhancement based on Hilbert-Huang Transform theory
    Zou, Xiaojie
    Li, Xueyao
    Zhang, Rubo
    [J]. FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 208 - +
  • [8] Hilbert Huang Transform based Speech Recognition
    Vani, H. Y.
    Anusuya, M. A.
    [J]. 2016 SECOND INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2016,
  • [9] Time-frequency Analysis Based on Hilbert-Huang Transform for Depression Recognition in Speech
    Liu, Zhenyu
    Xu, Yaping
    Ding, ZhiJie
    Chen, Qiongqiong
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1072 - 1076
  • [10] Adaptive speech enhancement algorithm based on Hilbert-Huang transform
    Jiang, Na
    Li, Jiyuan
    [J]. Ingenierie des Systemes d'Information, 2019, 24 (01): : 57 - 60