A New Algorithm for Speech Feature Extraction Using Polynomial Chirplet Transform

被引:1
|
作者
Do-Duc, Hao [1 ,2 ,3 ]
Chau-Thanh, Duc [1 ,2 ]
Tran-Thai, Son [1 ,2 ]
机构
[1] Univ Sci, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] FPT Univ, Ho Chi Minh City, Vietnam
关键词
Speech feature; Time-frequency analysis; Polynomial chirplet transform; Gender recognition; Dialect recognition; Speech recognition; DISCRETE-FREQUENCY; TIME; REPRESENTATION;
D O I
10.1007/s00034-023-02561-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Time-frequency analysis (TFA) is a powerful tool for signal feature representation. In the time-frequency plane, the primary data properties are shown with both instantaneous values and trends of frequency change during time. With a complicated and non-stationary signal such as human speech, the conventional TFA tools, including Fourier transform, wavelet transform, or linear chirplet transform (LCT), cannot reveal and represent speech behaviors well. This research proposes a new method for speech representation with a TFA perspective using polynomial chirplet transform (PCT). Inspired by the Weierstrass theorem, PCT uses a polynomial function for instantaneous frequency (IF) estimation. This polynomial also shapes the modulated atom for the transform. With the strength of a high-degree polynomial, PCT can capture many meaningful features in human speech and then robust the recognition models by improving the features representation. Experimental results in the speech processing tasks have demonstrated the potential of PCT. Furthermore, it will perform better if PCT is optimized with an adaptive strategy to identify the IF function.
引用
收藏
页码:2320 / 2340
页数:21
相关论文
共 50 条
  • [41] Radar Maneuvering Target Motion Parameter Estimation Based on Hough Transform and Polynomial Chirplet Transform
    Lin, Hua
    Zeng, Chao
    Zhang, Hai
    Jiang, Ge
    IEEE ACCESS, 2021, 9 : 35178 - 35195
  • [42] Double-adaptive chirplet transform for radar signature extraction
    Abratkiewicz, Karol
    IET RADAR SONAR AND NAVIGATION, 2020, 14 (10): : 1463 - 1474
  • [43] Feature Extraction from Raw EEG Signals by Using Second Order Polynomial Fitting Algorithm
    Aydemir, Oender
    Kayikcioglu, Temel
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 37 - 40
  • [44] Analysis of Seismocardiographic Signals Using Polynomial Chirplet Transform and Smoothed Pseudo Wigner-Ville Distribution
    Taebi, Amirtaha
    Mansy, Hansen A.
    2017 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM (SPMB), 2017,
  • [45] Pattern feature extraction algorithm based on quantum Fourier transform
    Zhou, Rigui
    Yang, Shuqun
    Xu, Xinwei
    Cao, Yongzhong
    Ding, Qiulin
    Nanjing Hangkong Hangtian Daxue Xuebao/Journal of Nanjing University of Aeronautics and Astronautics, 2008, 40 (01): : 134 - 136
  • [46] Prediction of Rotor Slot Size Variations in Induction Motor Using Polynomial Chirplet Transform and Regression Algorithms
    Kumar, J. Anish
    Swaroopan, N. M. Jothi
    Shanker, N. R.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (05) : 6099 - 6109
  • [47] Shape Feature Extraction Algorithm Based on Curvelet Transform and Moment
    Kong, Fanzhi
    Qiao, Xujun
    2010 SECOND ETP/IITA WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING, 2010, : 419 - 422
  • [48] Prediction of Rotor Slot Size Variations in Induction Motor Using Polynomial Chirplet Transform and Regression Algorithms
    Kumar, J. Anish
    Swaroopan, N. M. Jothi
    Shanker, N. R.
    IEEE ACCESS, 2022, 10 : 6099 - 6109
  • [49] Prediction of Rotor Slot Size Variations in Induction Motor Using Polynomial Chirplet Transform and Regression Algorithms
    J. Anish Kumar
    N. M. Jothi Swaroopan
    N. R. Shanker
    Arabian Journal for Science and Engineering, 2023, 48 (5) : 6099 - 6109
  • [50] Improving the filter bank of a classic speech feature extraction algorithm
    Skowronski, MD
    Harris, JG
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL IV: DIGITAL SIGNAL PROCESSING-COMPUTER AIDED NETWORK DESIGN-ADVANCED TECHNOLOGY, 2003, : 281 - 284