An evolutionary feature synthesis approach for content-based audio retrieval

被引:0
|
作者
Toni Mäkinen
Serkan Kiranyaz
Jenni Raitoharju
Moncef Gabbouj
机构
[1] Tampere University of Technology,Department of Signal Processing
关键词
Content-based retrieval; Evolutionary computation; Particle swarm optimization; Feature selection; Feature generation;
D O I
暂无
中图分类号
学科分类号
摘要
A vast amount of audio features have been proposed in the literature to characterize the content of audio signals. In order to overcome specific problems related to the existing features (such as lack of discriminative power), as well as to reduce the need for manual feature selection, in this article, we propose an evolutionary feature synthesis technique with a built-in feature selection scheme. The proposed synthesis process searches for optimal linear/nonlinear operators and feature weights from a pre-defined multi-dimensional search space to generate a highly discriminative set of new (artificial) features. The evolutionary search process is based on a stochastic optimization approach in which a multi-dimensional particle swarm optimization algorithm, along with fractional global best formation and heterogeneous particle behavior techniques, is applied. Unlike many existing feature generation approaches, the dimensionality of the synthesized feature vector is also searched and optimized within a set range in order to better meet the varying requirements set by many practical applications and classifiers. The new features generated by the proposed synthesis approach are compared with typical low-level audio features in several classification and retrieval tasks. The results demonstrate a clear improvement of up to 15–20% in average retrieval performance. Moreover, the proposed synthesis technique surpasses the synthesis performance of evolutionary artificial neural networks, exhibiting a considerable capability to accurately distinguish among different audio classes.
引用
收藏
相关论文
共 50 条
  • [1] An evolutionary feature synthesis approach for content-based audio retrieval
    Makinen, Toni
    Kiranyaz, Serkan
    Raitoharju, Jenni
    Gabbouj, Moncef
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [2] Evolutionary Feature Synthesis for Content-Based Audio Retrieval
    Kiranyaz, Serkan
    Raitoharju, Jenni
    Gabbouj, Moncef
    2013 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA'13), 2013,
  • [3] EVOLUTIONARY FEATURE GENERATION FOR CONTENT-BASED AUDIO CLASSIFICATION AND RETRIEVAL
    Makinen, Toni
    Kiranyaz, Serkan
    Pulkkinen, Jenni
    Gabbouj, Moncef
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1474 - 1478
  • [4] Content-Based Image Retrieval of Skin Lesions by Evolutionary Feature Synthesis
    Ballerini, Lucia
    Li, Xiang
    Fisher, Robert B.
    Aldridge, Ben
    Rees, Jonathan
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT I, PROCEEDINGS, 2010, 6024 : 312 - +
  • [5] MULTI-DIMENSIONAL EVOLUTIONARY FEATURE SYNTHESIS FOR CONTENT-BASED IMAGE RETRIEVAL
    Kiranyaz, Serkan
    Pulkkinen, Jenni
    Ince, Turker
    Gabbouj, Moncef
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [6] Content-Based Audio Classification and Retrieval Using Segmentation, Feature Extraction and Neural Network Approach
    Patil, Nilesh M.
    Nemade, Milind U.
    ADVANCES IN COMPUTER COMMUNICATION AND COMPUTATIONAL SCIENCES, IC4S 2018, 2019, 924 : 263 - 281
  • [7] Content-based classification and retrieval of audio
    Zhang, T
    Kuo, CCJ
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VIII, 1998, 3461 : 432 - 443
  • [8] Content-based retrieval of music and audio
    Foote, JT
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II, 1997, 3229 : 138 - 147
  • [9] Features for Content-Based Audio Retrieval
    Mitrovic, Dalibor
    Zeppelzauer, Matthias
    Breiteneder, Christian
    ADVANCES IN COMPUTERS, VOL 78: IMPROVING THE WEB, 2010, 78 : 71 - 150
  • [10] Content-based audio classification and retrieval using the nearest feature line method
    Li, SZ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (05): : 619 - 625