A New Algorithm for Speech Feature Extraction Using Polynomial Chirplet Transform

被引:1
|
作者
Do-Duc, Hao [1 ,2 ,3 ]
Chau-Thanh, Duc [1 ,2 ]
Tran-Thai, Son [1 ,2 ]
机构
[1] Univ Sci, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] FPT Univ, Ho Chi Minh City, Vietnam
关键词
Speech feature; Time-frequency analysis; Polynomial chirplet transform; Gender recognition; Dialect recognition; Speech recognition; DISCRETE-FREQUENCY; TIME; REPRESENTATION;
D O I
10.1007/s00034-023-02561-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Time-frequency analysis (TFA) is a powerful tool for signal feature representation. In the time-frequency plane, the primary data properties are shown with both instantaneous values and trends of frequency change during time. With a complicated and non-stationary signal such as human speech, the conventional TFA tools, including Fourier transform, wavelet transform, or linear chirplet transform (LCT), cannot reveal and represent speech behaviors well. This research proposes a new method for speech representation with a TFA perspective using polynomial chirplet transform (PCT). Inspired by the Weierstrass theorem, PCT uses a polynomial function for instantaneous frequency (IF) estimation. This polynomial also shapes the modulated atom for the transform. With the strength of a high-degree polynomial, PCT can capture many meaningful features in human speech and then robust the recognition models by improving the features representation. Experimental results in the speech processing tasks have demonstrated the potential of PCT. Furthermore, it will perform better if PCT is optimized with an adaptive strategy to identify the IF function.
引用
收藏
页码:2320 / 2340
页数:21
相关论文
共 50 条
  • [31] Frequency-varying group delay estimation using frequency domain polynomial chirplet transform
    Yang, Y.
    Peng, Z. K.
    Zhang, W. M.
    Meng, G.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2014, 46 (01) : 146 - 162
  • [32] Parametric Identification of Nonlinear Vibration Systems Via Polynomial Chirplet Transform
    Deng, Y.
    Cheng, C. M.
    Yang, Y.
    Peng, Z. K.
    Yang, W. X.
    Zhang, W. M.
    JOURNAL OF VIBRATION AND ACOUSTICS-TRANSACTIONS OF THE ASME, 2016, 138 (05):
  • [33] New algorithm for pitch detection of speech signals using wavelet transform
    Zhong, Jinhong
    Yang, Shanlin
    Lin, Yirong
    Lu, Kui
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2002, 24 (09):
  • [34] A new feature in speech recognition based on wavelet transform
    Hao, Y
    Zhu, XY
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1526 - 1529
  • [35] An encrypted speech retrieval algorithm based on Chirp-Z transform and perceptual hashing second feature extraction
    Qiu-yu Zhang
    Zi-xian Ge
    Ying-jie Hu
    Jian Bai
    Yi-bo Huang
    Multimedia Tools and Applications, 2020, 79 : 6337 - 6361
  • [36] An encrypted speech retrieval algorithm based on Chirp-Z transform and perceptual hashing second feature extraction
    Zhang, Qiu-yu
    Ge, Zi-xian
    Hu, Ying-jie
    Bai, Jian
    Huang, Yi-bo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 6337 - 6361
  • [37] Feature Extraction Using Parameterized Multisynchrosqueezing Transform
    Li, Xinyan
    Zhao, Huimuin
    Yu, Ling
    Chen, Huayue
    Den, Wuquan
    Deng, Wu
    IEEE SENSORS JOURNAL, 2022, 22 (14) : 14263 - 14272
  • [38] Adaptive linear chirplet synchroextracting transform for time-frequency feature extraction of non-stationary signals
    Yan, Zhu
    Jiao, Jingpin
    Xu, Yonggang
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2024, 220
  • [39] Improved Feature Parameter Extraction from Speech Signals Using Machine Learning Algorithm
    Abdusalomov, Akmalbek Bobomirzaevich
    Safarov, Furkat
    Rakhimov, Mekhriddin
    Turaev, Boburkhon
    Whangbo, Taeg Keun
    SENSORS, 2022, 22 (21)
  • [40] Speech Representation Using Linear Chirplet Transform and Its Application in Speaker-Related Recognition
    Do, Hao D.
    Chau, Duc T.
    Tran, Son T.
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 13501 : 719 - 729