A New Algorithm for Speech Feature Extraction Using Polynomial Chirplet Transform

被引:1
|
作者
Do-Duc, Hao [1 ,2 ,3 ]
Chau-Thanh, Duc [1 ,2 ]
Tran-Thai, Son [1 ,2 ]
机构
[1] Univ Sci, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] FPT Univ, Ho Chi Minh City, Vietnam
关键词
Speech feature; Time-frequency analysis; Polynomial chirplet transform; Gender recognition; Dialect recognition; Speech recognition; DISCRETE-FREQUENCY; TIME; REPRESENTATION;
D O I
10.1007/s00034-023-02561-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Time-frequency analysis (TFA) is a powerful tool for signal feature representation. In the time-frequency plane, the primary data properties are shown with both instantaneous values and trends of frequency change during time. With a complicated and non-stationary signal such as human speech, the conventional TFA tools, including Fourier transform, wavelet transform, or linear chirplet transform (LCT), cannot reveal and represent speech behaviors well. This research proposes a new method for speech representation with a TFA perspective using polynomial chirplet transform (PCT). Inspired by the Weierstrass theorem, PCT uses a polynomial function for instantaneous frequency (IF) estimation. This polynomial also shapes the modulated atom for the transform. With the strength of a high-degree polynomial, PCT can capture many meaningful features in human speech and then robust the recognition models by improving the features representation. Experimental results in the speech processing tasks have demonstrated the potential of PCT. Furthermore, it will perform better if PCT is optimized with an adaptive strategy to identify the IF function.
引用
收藏
页码:2320 / 2340
页数:21
相关论文
共 50 条
  • [1] A New Algorithm for Speech Feature Extraction Using Polynomial Chirplet Transform
    Hao Do-Duc
    Duc Chau-Thanh
    Son Tran-Thai
    Circuits, Systems, and Signal Processing, 2024, 43 : 2320 - 2340
  • [2] Speech feature extraction using linear Chirplet transform and its applications
    Do, Hao Duc
    Chau, Duc Thanh
    Tran, Son Thai
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2023, 7 (03) : 376 - 391
  • [3] Fault feature extraction by using adaptive chirplet transform
    Guo, Qianjin
    Yu, Haibin
    Hu, Jingtao
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 5643 - 5647
  • [4] Analysis of moving source characteristics using polynomial chirplet transform
    Xu, Lingji
    Yang, Yixin
    Yu, Shiduo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (04): : EL320 - EL326
  • [5] FEATURE EXTRACTION ALGORITHM USING NEW CEPSTRAL TECHNIQUES FOR ROBUST SPEECH RECOGNITION
    Korba, Mohamed Cherif Amara
    Bourouba, Houcine
    Djemili, Rafik
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2020, 33 (02) : 90 - 101
  • [6] Rolling Bearing Fault Feature Extraction Using Chirplet Decomposition Based on Genetic Algorithm
    Lin, Ying
    Jiang, Hongkai
    Hu, Yanan
    Wei, Dongdong
    2018 INTERNATIONAL CONFERENCE ON SENSING, DIAGNOSTICS, PROGNOSTICS, AND CONTROL (SDPC), 2018, : 79 - 84
  • [7] An improved speech feature extraction algorithm using DWT
    Wu, Xiang
    Tian, Feng
    Liu, Jingao
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1086 - 1090
  • [8] Subband feature extraction using lapped orthogonal transform for speech recognition
    Tufekci, Z
    Gowdy, JN
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 149 - 152
  • [10] Speech recognition using the extraction of particular feature by the discrete wavelet transform
    Midorikawa, Y
    Akita, M
    INTERNATIONAL JOURNAL OF APPLIED ELECTROMAGNETICS AND MECHANICS, 2001, 13 (1-4) : 13 - 18