An evolutionary feature synthesis approach for content-based audio retrieval

被引：0

作者：

Toni Mäkinen

Serkan Kiranyaz

Jenni Raitoharju

Moncef Gabbouj

机构：

[1] Tampere University of Technology,Department of Signal Processing

来源：

EURASIP Journal on Audio, Speech, and Music Processing | / 2012卷

关键词：

Content-based retrieval; Evolutionary computation; Particle swarm optimization; Feature selection; Feature generation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A vast amount of audio features have been proposed in the literature to characterize the content of audio signals. In order to overcome specific problems related to the existing features (such as lack of discriminative power), as well as to reduce the need for manual feature selection, in this article, we propose an evolutionary feature synthesis technique with a built-in feature selection scheme. The proposed synthesis process searches for optimal linear/nonlinear operators and feature weights from a pre-defined multi-dimensional search space to generate a highly discriminative set of new (artificial) features. The evolutionary search process is based on a stochastic optimization approach in which a multi-dimensional particle swarm optimization algorithm, along with fractional global best formation and heterogeneous particle behavior techniques, is applied. Unlike many existing feature generation approaches, the dimensionality of the synthesized feature vector is also searched and optimized within a set range in order to better meet the varying requirements set by many practical applications and classifiers. The new features generated by the proposed synthesis approach are compared with typical low-level audio features in several classification and retrieval tasks. The results demonstrate a clear improvement of up to 15–20% in average retrieval performance. Moreover, the proposed synthesis technique surpasses the synthesis performance of evolutionary artificial neural networks, exhibiting a considerable capability to accurately distinguish among different audio classes.

引用

共 50 条

[1] An evolutionary feature synthesis approach for content-based audio retrieval
Makinen, Toni
Kiranyaz, Serkan
Raitoharju, Jenni
Gabbouj, Moncef
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
[2] Evolutionary Feature Synthesis for Content-Based Audio Retrieval
Kiranyaz, Serkan
Raitoharju, Jenni
Gabbouj, Moncef
2013 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA'13), 2013,
[3] EVOLUTIONARY FEATURE GENERATION FOR CONTENT-BASED AUDIO CLASSIFICATION AND RETRIEVAL
Makinen, Toni
Kiranyaz, Serkan
Pulkkinen, Jenni
Gabbouj, Moncef
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1474 - 1478
[4] Content-Based Image Retrieval of Skin Lesions by Evolutionary Feature Synthesis
Ballerini, Lucia
Li, Xiang
Fisher, Robert B.
Aldridge, Ben
Rees, Jonathan
APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT I, PROCEEDINGS, 2010, 6024 : 312 - +
[5] MULTI-DIMENSIONAL EVOLUTIONARY FEATURE SYNTHESIS FOR CONTENT-BASED IMAGE RETRIEVAL
Kiranyaz, Serkan
Pulkkinen, Jenni
Ince, Turker
Gabbouj, Moncef
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
[6] Content-Based Audio Classification and Retrieval Using Segmentation, Feature Extraction and Neural Network Approach
Patil, Nilesh M.
Nemade, Milind U.
ADVANCES IN COMPUTER COMMUNICATION AND COMPUTATIONAL SCIENCES, IC4S 2018, 2019, 924 : 263 - 281
[7] Content-based classification and retrieval of audio
Zhang, T
Kuo, CCJ
ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VIII, 1998, 3461 : 432 - 443
[8] Content-based retrieval of music and audio
Foote, JT
MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II, 1997, 3229 : 138 - 147
[9] Features for Content-Based Audio Retrieval
Mitrovic, Dalibor
Zeppelzauer, Matthias
Breiteneder, Christian
ADVANCES IN COMPUTERS, VOL 78: IMPROVING THE WEB, 2010, 78 : 71 - 150
[10] Content-based audio classification and retrieval using the nearest feature line method
Li, SZ
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (05): : 619 - 625

← 1 2 3 4 5 →