An evolutionary feature synthesis approach for content-based audio retrieval

被引:0
|
作者
Toni Mäkinen
Serkan Kiranyaz
Jenni Raitoharju
Moncef Gabbouj
机构
[1] Tampere University of Technology,Department of Signal Processing
关键词
Content-based retrieval; Evolutionary computation; Particle swarm optimization; Feature selection; Feature generation;
D O I
暂无
中图分类号
学科分类号
摘要
A vast amount of audio features have been proposed in the literature to characterize the content of audio signals. In order to overcome specific problems related to the existing features (such as lack of discriminative power), as well as to reduce the need for manual feature selection, in this article, we propose an evolutionary feature synthesis technique with a built-in feature selection scheme. The proposed synthesis process searches for optimal linear/nonlinear operators and feature weights from a pre-defined multi-dimensional search space to generate a highly discriminative set of new (artificial) features. The evolutionary search process is based on a stochastic optimization approach in which a multi-dimensional particle swarm optimization algorithm, along with fractional global best formation and heterogeneous particle behavior techniques, is applied. Unlike many existing feature generation approaches, the dimensionality of the synthesized feature vector is also searched and optimized within a set range in order to better meet the varying requirements set by many practical applications and classifiers. The new features generated by the proposed synthesis approach are compared with typical low-level audio features in several classification and retrieval tasks. The results demonstrate a clear improvement of up to 15–20% in average retrieval performance. Moreover, the proposed synthesis technique surpasses the synthesis performance of evolutionary artificial neural networks, exhibiting a considerable capability to accurately distinguish among different audio classes.
引用
收藏
相关论文
共 50 条
  • [21] Daubechies Wavelets Based Robust Audio Fingerprinting for Content-Based Audio Retrieval
    Sun, Wei
    Lu, Zhe-Ming
    Yu, Fa-Xin
    Shen, Rong-Jun
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2012, 4 (02) : 49 - 69
  • [22] Symmetry feature in content-based image retrieval
    He, JR
    Li, MJ
    Zhang, HJ
    Zhang, CS
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 417 - 420
  • [23] Content-based audio retrieval using perceptual hash
    Li, Qiong
    Wu, Jing
    He, Xin
    2008 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2008, : 791 - 794
  • [24] Feature selection for content-based image retrieval
    Esin Guldogan
    Moncef Gabbouj
    Signal, Image and Video Processing, 2008, 2 : 241 - 250
  • [25] Feature representation and compression for content-based retrieval
    Xie, H
    Ortega, A
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2001, 2001, 4310 : 111 - 122
  • [26] Feature selection for content-based image retrieval
    Guldogan, Esin
    Gabbouj, Moncef
    SIGNAL IMAGE AND VIDEO PROCESSING, 2008, 2 (03) : 241 - 250
  • [27] Classification of general audio data for content-based retrieval
    Li, DG
    Sethi, IK
    Dimitrova, N
    McGee, T
    PATTERN RECOGNITION LETTERS, 2001, 22 (05) : 533 - 544
  • [28] Content-based audio retrieval using a generalized algorithm
    Piamsa-Nga, P
    Subramanya, SR
    Alexandridis, NA
    Srakaew, S
    Blankenship, G
    Papakonstantinou, G
    Tsanakas, P
    Tzafestas, S
    ADVANCES IN INTELLIGENT SYSTEMS: CONCEPTS, TOOLS AND APPLICATIONS, 1999, 21 : 231 - 242
  • [29] Content-based audio retrieval based on Gabor wavelet filtering
    Lin, RS
    Chen, LH
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (06) : 823 - 837
  • [30] An approach to content-based video retrieval
    Lee, AJT
    Hong, RW
    Chang, MF
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 273 - 276