Audio Classification and Retrieval Using Wavelets and Gaussian Mixture Models

被引:1
|
作者
Chuan, Ching-Hua [1 ]
机构
[1] Univ North Florida, Sch Comp, Coll Comp Engn & Construct, Jacksonville, FL 32224 USA
关键词
Audio Classification; Compact Vector Representation; Gaussian Mixture Models; Retrieval; Wavelets;
D O I
10.4018/jmdem.2013010101
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents an audio classification and retrieval system using wavelets for extracting low-level acoustic features. The author performed multiple-level decomposition using discrete wavelet transform to extract acoustic features from audio recordings at different scales and times. The extracted features are then translated into a compact vector representation. Gaussian mixture models with expectation maximization algorithm are used to build models for audio classes and individual audio examples. The system is evaluated using three audio classification tasks: speech/music, male/female speech, and music genre. They also show how wavelets and Gaussian mixture models are used for class-based audio retrieval in two approaches: indexing using only wavelets versus indexing by Gaussian components. By evaluating the system through 10-fold cross-validation, the author shows the promising capability of wavelets and Gaussian mixture models for audio classification and retrieval. They also compare how parameters including frame size, wavelet level, Gaussian components, and sampling size affect performance in Gaussian models.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [31] Compressed domain image retrieval using JPEG2000 and Gaussian mixture models
    Teynor, A
    Müller, W
    Kowarschick, W
    VISUAL INFORMATION AND INFORMATION SYSTEMS, 2006, 3736 : 132 - 142
  • [32] A relevance feedback approach for content based image retrieval using Gaussian mixture models
    Marakakis, Apostolos
    Galatsanos, Nikolaos
    Likas, Aristidis
    Stafylopatis, Andreas
    ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 2, 2006, 4132 : 84 - 93
  • [33] Erratum to: Weighted subspace modeling for semantic concept retrieval using gaussian mixture models
    Chao Chen
    Mei-Ling Shyu
    Shu-Ching Chen
    Information Systems Frontiers, 2018, 20 : 417 - 417
  • [34] Color texture image retrieval based on Gaussian copula models of Gabor wavelets
    Li, Chaorong
    Huang, Yuanyuan
    Zhu, Lihong
    PATTERN RECOGNITION, 2017, 64 : 118 - 129
  • [35] Hyperspectral Image Classification Using Gaussian Mixture Models and Markov Random Fields
    Li, Wei
    Prasad, Saurabh
    Fowler, James E.
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (01) : 153 - 157
  • [36] Continuous classification of myoelectric signals for powered prostheses using Gaussian mixture models
    Chan, ADC
    Englehart, KB
    PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: A NEW BEGINNING FOR HUMAN HEALTH, 2003, 25 : 2841 - 2844
  • [37] Real Life Emotion Classification using Spectral Features and Gaussian Mixture Models
    Koolagudi, Shashidhar G.
    Barthwal, Anurag
    Devliyal, Swati
    Rao, K. Sreenivasa
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3892 - 3899
  • [38] Moving Vehicle Classification Using Pixel Quantity Based on Gaussian Mixture Models
    Putra, Bayu Charisma
    Setiyono, Budi
    Sulistyaningrum, Dwi Ratna
    Soetrisno
    Mukhlash, Imam
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 254 - 257
  • [39] Vehicle acoustic classification in netted sensor systems using Gaussian mixture models
    Necioglu, BF
    Christou, CT
    George, EB
    Jacyna, CM
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XIV, 2005, 5809 : 409 - 419
  • [40] Classification of Data Generated by Gaussian Mixture Models Using Deep ReLU Networks
    Zhou, Tian-Yi
    Huo, Xiaoming
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 54