Multi-Resolution Feature Extraction Algorithm in Emotional Speech Recognition

被引：2

作者：

Zelenik, Ales ^{[1
]}

Kacic, Zdravko ^{[2
]}

机构：

[1] NXP Semicond Gratkorn GmbH, A-8101 Gratkorn, Austria

[2] Fac Elect Engn & Comp Sci, Maribor 2000, Slovenia

来源：

ELEKTRONIKA IR ELEKTROTECHNIKA | 2015年 / 21卷 / 05期

关键词：

Speech; emotion recognition; segmentation; multi-resolution;

D O I：

10.5755/j01.eee.21.5.13328

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper a new approach for recognizing emotional speech from audio recordings is presented. In order to obtain the optimum processing window width for feature extraction and to achieve the highest level of recognition rates, a trade-off between time and frequency resolution must be made. At this point, we define a new procedure that combines the advantages of narrower and wider windows and takes advantage of dynamic adjustment of the time and frequency resolution of individual feature characteristics. To achieve higher recognition rates two major procedures are added to the multi-resolution feature-extraction concept, one being the exclusion of features calculated on different processing window widths and the other the idea to use only the parts of recordings with most explicit emotions. To confirm the benefits of the algorithm the audio recordings from the emotional speech database Interface along with four different classifiers were used in evaluation. The highest level of emotion recognition rate with multi-resolution approach exceeded the recognition rate of the best single-resolution approach by 3.5 % with the average improvement of 1.5 % in absolute terms.

引用

页码：54 / 58

页数：5

共 50 条

[1] Fuzzy clustering recognition algorithm of medical image with multi-resolution feature
Wang Bo
Wang Ying
Cui Lijie
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (01):
[2] Reduced Feature Extraction for Emotional Speech Recognition
Palo, Hemanta Kumar
Mohanty, Mihir Narayan
2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
[3] A committee of networks classifier with multi-resolution feature extraction for automatic target recognition
Wang, LC
Der, S
Nasrabadi, NM
1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1596 - 1601
[4] Multi-resolution feature fusion for face recognition
Pong, Kuong-Hon
Lam, Kin-Man
PATTERN RECOGNITION, 2014, 47 (02) : 556 - 567
[5] Multi-resolution feature extraction in human face
Song, Y
He, K
Zhou, JL
Liu, ZM
Li, K
ICIA 2004: Proceedings of 2004 International Conference on Information Acquisition, 2004, : 417 - 421
[6] MULTI-QUARTZNET: MULTI-RESOLUTION CONVOLUTION FOR SPEECH RECOGNITION WITH MULTI-LAYER FEATURE FUSION
Luo, Jian
Wang, Jianzong
Cheng, Ning
Jiang, Guilin
Xiao, Jing
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 82 - 88
[7] Occlusion recognition algorithm based on multi-resolution feature auto-selection
Xie X.
Lai G.
Na Z.
Luo X.
Wang D.
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (07): : 1154 - 1163
[8] Multi-resolution local moment feature for gait recognition
Shi, Cui-Ping
Li, Hong-Gui
Lian, Xu
Li, Xing-Guo
PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3709 - +
[9] Data Augmentation Using Virtual Microphone Array Synthesis and Multi-Resolution Feature Extraction for Isolated Word Dysarthric Speech Recognition
Mariya Celin, T.A.
Nagarajan, T.
Vijayalakshmi, P.
Celin, T.A.M. (mariyacelinta@ssn.edu.in), 1600, Institute of Electrical and Electronics Engineers Inc., United States (14): : 346 - 354
[10] Intelligent flow feature extraction and multi-resolution visualization
College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2008, 5 (571-576):

← 1 2 3 4 5 →