Audio Classification and Retrieval Using Wavelets and Gaussian Mixture Models

被引:1
|
作者
Chuan, Ching-Hua [1 ]
机构
[1] Univ North Florida, Sch Comp, Coll Comp Engn & Construct, Jacksonville, FL 32224 USA
关键词
Audio Classification; Compact Vector Representation; Gaussian Mixture Models; Retrieval; Wavelets;
D O I
10.4018/jmdem.2013010101
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents an audio classification and retrieval system using wavelets for extracting low-level acoustic features. The author performed multiple-level decomposition using discrete wavelet transform to extract acoustic features from audio recordings at different scales and times. The extracted features are then translated into a compact vector representation. Gaussian mixture models with expectation maximization algorithm are used to build models for audio classes and individual audio examples. The system is evaluated using three audio classification tasks: speech/music, male/female speech, and music genre. They also show how wavelets and Gaussian mixture models are used for class-based audio retrieval in two approaches: indexing using only wavelets versus indexing by Gaussian components. By evaluating the system through 10-fold cross-validation, the author shows the promising capability of wavelets and Gaussian mixture models for audio classification and retrieval. They also compare how parameters including frame size, wavelet level, Gaussian components, and sampling size affect performance in Gaussian models.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [41] Real Life Emotion Classification from Speech Using Gaussian Mixture Models
    Koolagudi, Shashidhar G.
    Barthwal, Anurag
    Devliyal, Swati
    Rao, K. Sreenivasa
    CONTEMPORARY COMPUTING, 2012, 306 : 250 - +
  • [42] Frequency and Space Domain Features for Image Classification Using Gaussian Mixture Models
    Fu, Bin
    Ren, Zhen
    2008 INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS SYMPOSIA, PROCEEDINGS, 2008, : 441 - +
  • [43] Gaussian mixture models of texture and colour for image database retrieval
    Permuter, H
    Francos, J
    Jermyn, IH
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 569 - 572
  • [44] Comparison of Gaussian mixture and linear mixture models for classification of hyperspectral data
    Beaven, SG
    Stein, D
    Hoff, LE
    IGARSS 2000: IEEE 2000 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOL I - VI, PROCEEDINGS, 2000, : 1597 - 1599
  • [45] Discriminative Model Selection for Gaussian Mixture Models for Classification
    Liu, Xiao-Hua
    Liu, Cheng-Lin
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 62 - 66
  • [46] Coding using Gaussian mixture and generalized Gaussian models
    Su, JK
    Mersereau, RM
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL I, 1996, : 217 - 220
  • [47] Semantic Scene Classification with Generalized Gaussian Mixture Models
    Elguebaly, Tarek
    Bouguila, Nizar
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2015), 2015, 9164 : 159 - 166
  • [48] Combustion Sound Classification Employing Gaussian Mixture Models
    Lupu, E.
    Ghiurcau, M. V.
    Hodor, V.
    Emerich, S.
    PROCEEDINGS OF 2010 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2010), VOLS. 1-3, 2010,
  • [49] Gaussian Mixture Models for Probabilistic Classification of Breast Cancer
    Prabakaran, Indira
    Wu, Zhengdong
    Lee, Changgun
    Tong, Brian
    Steeman, Samantha
    Koo, Gabriel
    Zhang, Paul J.
    Guvakova, Marina A.
    CANCER RESEARCH, 2019, 79 (13) : 3492 - 3502
  • [50] mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models
    Scrucca, Luca
    Fop, Michael
    Murphy, T. Brendan
    Raftery, Adrian E.
    R JOURNAL, 2016, 8 (01): : 289 - 317