Multi-Resolution Feature Extraction Algorithm in Emotional Speech Recognition

被引:2
|
作者
Zelenik, Ales [1 ]
Kacic, Zdravko [2 ]
机构
[1] NXP Semicond Gratkorn GmbH, A-8101 Gratkorn, Austria
[2] Fac Elect Engn & Comp Sci, Maribor 2000, Slovenia
关键词
Speech; emotion recognition; segmentation; multi-resolution;
D O I
10.5755/j01.eee.21.5.13328
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper a new approach for recognizing emotional speech from audio recordings is presented. In order to obtain the optimum processing window width for feature extraction and to achieve the highest level of recognition rates, a trade-off between time and frequency resolution must be made. At this point, we define a new procedure that combines the advantages of narrower and wider windows and takes advantage of dynamic adjustment of the time and frequency resolution of individual feature characteristics. To achieve higher recognition rates two major procedures are added to the multi-resolution feature-extraction concept, one being the exclusion of features calculated on different processing window widths and the other the idea to use only the parts of recordings with most explicit emotions. To confirm the benefits of the algorithm the audio recordings from the emotional speech database Interface along with four different classifiers were used in evaluation. The highest level of emotion recognition rate with multi-resolution approach exceeded the recognition rate of the best single-resolution approach by 3.5 % with the average improvement of 1.5 % in absolute terms.
引用
收藏
页码:54 / 58
页数:5
相关论文
共 50 条
  • [1] Fuzzy clustering recognition algorithm of medical image with multi-resolution feature
    Wang Bo
    Wang Ying
    Cui Lijie
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (01):
  • [2] Reduced Feature Extraction for Emotional Speech Recognition
    Palo, Hemanta Kumar
    Mohanty, Mihir Narayan
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [3] A committee of networks classifier with multi-resolution feature extraction for automatic target recognition
    Wang, LC
    Der, S
    Nasrabadi, NM
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1596 - 1601
  • [4] Multi-resolution feature fusion for face recognition
    Pong, Kuong-Hon
    Lam, Kin-Man
    PATTERN RECOGNITION, 2014, 47 (02) : 556 - 567
  • [5] Multi-resolution feature extraction in human face
    Song, Y
    He, K
    Zhou, JL
    Liu, ZM
    Li, K
    ICIA 2004: Proceedings of 2004 International Conference on Information Acquisition, 2004, : 417 - 421
  • [6] MULTI-QUARTZNET: MULTI-RESOLUTION CONVOLUTION FOR SPEECH RECOGNITION WITH MULTI-LAYER FEATURE FUSION
    Luo, Jian
    Wang, Jianzong
    Cheng, Ning
    Jiang, Guilin
    Xiao, Jing
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 82 - 88
  • [7] Occlusion recognition algorithm based on multi-resolution feature auto-selection
    Xie X.
    Lai G.
    Na Z.
    Luo X.
    Wang D.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (07): : 1154 - 1163
  • [8] Multi-resolution local moment feature for gait recognition
    Shi, Cui-Ping
    Li, Hong-Gui
    Lian, Xu
    Li, Xing-Guo
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3709 - +
  • [9] Data Augmentation Using Virtual Microphone Array Synthesis and Multi-Resolution Feature Extraction for Isolated Word Dysarthric Speech Recognition
    Mariya Celin, T.A.
    Nagarajan, T.
    Vijayalakshmi, P.
    Celin, T.A.M. (mariyacelinta@ssn.edu.in), 1600, Institute of Electrical and Electronics Engineers Inc., United States (14): : 346 - 354
  • [10] Intelligent flow feature extraction and multi-resolution visualization
    College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2008, 5 (571-576):