Application of Teager Energy Operator on Linear and Mel Scales for Whispered Speech Recognition

被引:4
|
作者
Markovic, Branko R. [1 ]
Galic, Jovan [1 ]
Mijic, Miomir [1 ]
机构
[1] Sch Elect Engn, Dept Acoust, Blvd Kralja Aleksandra 73, Belgrade 11000, Serbia
关键词
Teager energy operator; cepstral mean subtraction; whispered speech recognition; linear scale; mel scale; dynamic time warping; hidden Markov models;
D O I
10.24425/118075
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents experimental results on whispered speech recognition based on Teager Energy Operator for linear and mel cepstral coefficients including the Cepstral Mean Subtraction normalization technique. The feature vectors taken into consideration are Linear Frequency Cepstral Coefficients, Teager Energy based Linear Frequency Cepstral Coefficients, Mel Frequency Cepstral Coefficients and Teager Energy based Mel Frequency Cepstral Coefficients. A speaker dependent scenario is used. For the recognition process, Dynamic Time Warping and Hidden Markov Models methods are applied. Results show a respectable improvement in whispered speech recognition as achieved by using the Teager Energy Operator with Cepstral Mean Subtraction.
引用
收藏
页码:3 / 9
页数:7
相关论文
共 50 条
  • [1] Recognition of Stress in Speech using Wavelet Analysis and Teager Energy Operator
    He, Ling
    Lech, Margaret
    Memon, Sheeraz
    Allen, Nicholas
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 605 - +
  • [2] Robust speech recognition in noisy backgrounds based on teager energy operator and auditory process
    Zhao, JH
    Kuang, JM
    Dai, QH
    [J]. CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 550 - 554
  • [3] AUTOMATIC EMOTION RECOGNITION IN SPEECH SIGNAL USING TEAGER ENERGY OPERATOR AND MFCC FEATURES
    He, Ling
    Lech, Margaret
    Allen, Nicholas
    [J]. 2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 695 - 699
  • [4] Stressed Speech Emotion Recognition using feature fusion of Teager Energy Operator and MFCC
    Bandela, Surekha Reddy
    Kumar, T. Kishore
    [J]. 2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,
  • [5] APPLICATION OF NEURAL NETWORKS IN WHISPERED SPEECH RECOGNITION
    Grozdic, Dorde T.
    Markovic, Branko
    Galic, Jovan
    Jovicic, Slobodan T.
    [J]. 2012 20TH TELECOMMUNICATIONS FORUM (TELFOR), 2012, : 728 - 731
  • [6] Emotion Recognition of Stressed Speech using Teager Energy and Linear Prediction Features
    Reddy, Surekha B.
    Kumar, T. Kishore
    [J]. 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2018), 2018, : 422 - 425
  • [7] Wavelet speech enhancement based on the Teager Energy operator
    Bahoura, M
    Rouat, J
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (01) : 10 - 12
  • [8] Teager_Mel and PLP Fusion Feature Based Speech Emotion Recognition
    Chen, Xiao
    Li, Haifeng
    Ma, Lin
    Liu, Xinlei
    Chen, Jing
    [J]. 2015 FIFTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2015, : 1109 - 1114
  • [9] Research on Speech Emotion Recognition Based on Teager Energy Operator Coefficients and Inverted MFCC Feature Fusion
    Wang, Feifan
    Shen, Xizhong
    [J]. ELECTRONICS, 2023, 12 (17)
  • [10] Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition
    Sailor, Hardik B.
    Patil, Hemant A.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (06): : EL500 - EL506