Content-based audio classification and segmentation by using support vector machines

被引:2
|
作者
Lie Lu
Hong-Jiang Zhang
Stan Z. Li
机构
[1] Microsoft Research Asia 5F Beijing Sigma Center,
[2] No.49 Zhichun Road Hai Dian District,undefined
[3] Beijing,undefined
[4] 100080,undefined
[5] China (e-mail: {llu,undefined
[6] hjzhang,undefined
[7] szli}@microsoft.com) ,undefined
来源
Multimedia Systems | 2003年 / 8卷
关键词
Key words: Audio content analysis, audio classification and segmentation, support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
Content-based audio classification and segmentation is a basis for further audio/video analysis. In this paper, we present our work on audio segmentation and classification which employs support vector machines (SVMs). Five audio classes are considered in this paper: silence, music, background sound, pure speech, and non- pure speech which includes speech over music and speech over noise. A sound stream is segmented by classifying each sub-segment into one of these five classes. We have evaluated the performance of SVM on different audio type-pairs classification with testing unit of different- length and compared the performance of SVM, K-Nearest Neighbor (KNN), and Gaussian Mixture Model (GMM). We also evaluated the effectiveness of some new proposed features. Experiments on a database composed of about 4- hour audio data show that the proposed classifier is very efficient on audio classification and segmentation. It also shows the accuracy of the SVM-based method is much better than the method based on KNN and GMM.
引用
收藏
页码:482 / 492
页数:10
相关论文
共 50 条
  • [21] A Classification-Based Segmentation of cDNA Microarray Images using Support Vector Machines
    Giannakeas, Nikolaos
    Karvelis, Petros S.
    Fotiadis, Dimitrios I.
    2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, : 875 - +
  • [22] Relevance Feedback for Content-Based Image Retrieval Using Support Vector Machines and Feature Selection
    Marakakis, Apostolos
    Galatsanos, Nikolaos
    Likas, Aristidis
    Stafylopatis, Andreas
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 942 - +
  • [23] Update relevant image weights for content-based image retrieval using support vector machines
    Tian, Q
    Hong, PY
    Huang, TS
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1199 - 1202
  • [24] Content-Based Classification and Segmentation of Mixed-Type Audio by Using MPEG-7 Features
    Dogan, Ebru
    Sert, Mustafa
    Yazicit, Adnan
    2009 FIRST INTERNATIONAL CONFERENCE ON ADVANCES IN MULTIMEDIA, 2009, : 152 - +
  • [25] Incorporate support vector machines to content-based image retrieval with relevant feedback
    Hong, PY
    Tian, Q
    Huang, TS
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 750 - 753
  • [26] Content-based classification, search, and retrieval of audio
    Wold, E
    Blum, T
    Keislar, D
    Wheaton, J
    IEEE MULTIMEDIA, 1996, 3 (03) : 27 - 36
  • [27] Probabilistic segmentation of time-series audio signals using Support Vector Machines
    Kalantarian, Haik
    Mortazavi, Bobak
    Pourhomayoun, Mohammad
    Alshurafa, Nabil
    Sarrafzadeh, Majid
    MICROPROCESSORS AND MICROSYSTEMS, 2016, 46 : 96 - 104
  • [28] Vector image segmentation for content-based vector image retrieval
    Hayashi, Takahiro
    Onai, Rikio
    Abe, Koji
    2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 695 - +
  • [29] Entropy-based active learning with support vector machines for content-based image retrieval
    Jing, F
    Li, MJ
    Zhang, HJ
    Zhang, B
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 85 - 88
  • [30] Content-based Image Retrieval by Exploring Bandletized Regions through Support Vector Machines
    Ashraf, Rehan
    Bashir, Khalid
    Mahmood, Toqeer
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (02) : 245 - 269