Application of Teager Energy Operator on Linear and Mel Scales for Whispered Speech Recognition

被引:4
|
作者
Markovic, Branko R. [1 ]
Galic, Jovan [1 ]
Mijic, Miomir [1 ]
机构
[1] Sch Elect Engn, Dept Acoust, Blvd Kralja Aleksandra 73, Belgrade 11000, Serbia
关键词
Teager energy operator; cepstral mean subtraction; whispered speech recognition; linear scale; mel scale; dynamic time warping; hidden Markov models;
D O I
10.24425/118075
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents experimental results on whispered speech recognition based on Teager Energy Operator for linear and mel cepstral coefficients including the Cepstral Mean Subtraction normalization technique. The feature vectors taken into consideration are Linear Frequency Cepstral Coefficients, Teager Energy based Linear Frequency Cepstral Coefficients, Mel Frequency Cepstral Coefficients and Teager Energy based Mel Frequency Cepstral Coefficients. A speaker dependent scenario is used. For the recognition process, Dynamic Time Warping and Hidden Markov Models methods are applied. Results show a respectable improvement in whispered speech recognition as achieved by using the Teager Energy Operator with Cepstral Mean Subtraction.
引用
收藏
页码:3 / 9
页数:7
相关论文
共 50 条
  • [41] Teager Energy Subband Filtered Features for Near and Far-Field Automatic Speech Recognition
    Kamble, Madhu R.
    Nayak, Shekhar
    Shaik, M. Ali Basha
    Rath, Shakti P.
    Vij, Vikram
    Patil, Hemant A.
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 491 - 496
  • [42] Data-Driven Temporal Filtering on Teager Energy Time Trajectory for Robust Speech Recognition
    赵军辉
    谢湘
    匡镜明
    [J]. Journal of Beijing Institute of Technology, 2006, (02) : 195 - 200
  • [43] Teager Energy Operator and its Application in the Study of Induction Motor Rotor Broken Bars Fault
    Yin, Shihua
    Hu, Niaoqing
    Chen, Ling
    Hu, Lei
    [J]. 2015 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM), 2015,
  • [44] Bi-mel-scale frequency cepstrum and its application in telephone speech recognition
    CHEN Jingdong
    XU Bo
    HUANG Taiyi(National Laboratory of Pattern Recognition
    [J]. Chinese Journal of Acoustics, 1998, (03) : 234 - 243
  • [45] Application of Empirical Mode Decomposition and Teager Energy Operator to EEG Signals for Mental Task Classification
    Kaleem, M. F.
    Sugavaneswaran, L.
    Guergachi, A.
    Krishnan, S.
    [J]. 2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 4590 - 4593
  • [46] Teager energy operator for multi-modulation extraction and its application for gearbox fault detection
    Bozchalooi, I. Soltani
    Liang, Ming
    [J]. SMART MATERIALS AND STRUCTURES, 2010, 19 (07)
  • [47] Data-driven Rescaled Teager Energy Cepstral Coefficients for Noise-robust Speech Recognition
    Hsu, Miau-Luan
    Chen, Chia-Ping
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [48] RECOGNITION OF VESSEL ACOUSTIC SIGNATURES USING NON-LINEAR TEAGER ENERGY BASED FEATURES
    Can, Gokmen
    Akbas, Cem Emre
    Cetin, A. Enis
    [J]. 2016 INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE FOR MULTIMEDIA UNDERSTANDING (IWCIM), 2016,
  • [49] HYBRID WAVELET PACKET-TEAGER ENERGY OPERATOR ANALYSIS AND ITS APPLICATION FOR GEARBOX FAULT DIAGNOSIS
    LIU Xiaofeng QIN Shuren BO Lin College of Mechanical Engineering
    [J]. Chinese Journal of Mechanical Engineering, 2007, (06) : 79 - 83
  • [50] Application of a Variation of Empirical Mode Decomposition and Teager Energy Operator to EEG Signals for Mental Task Classification
    Kaleem, M.
    Guergachi, A.
    Krishnan, S.
    [J]. 2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 965 - 968