Content-based audio classification and segmentation by using support vector machines

被引:2
|
作者
Lie Lu
Hong-Jiang Zhang
Stan Z. Li
机构
[1] Microsoft Research Asia 5F Beijing Sigma Center,
[2] No.49 Zhichun Road Hai Dian District,undefined
[3] Beijing,undefined
[4] 100080,undefined
[5] China (e-mail: {llu,undefined
[6] hjzhang,undefined
[7] szli}@microsoft.com) ,undefined
来源
Multimedia Systems | 2003年 / 8卷
关键词
Key words: Audio content analysis, audio classification and segmentation, support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
Content-based audio classification and segmentation is a basis for further audio/video analysis. In this paper, we present our work on audio segmentation and classification which employs support vector machines (SVMs). Five audio classes are considered in this paper: silence, music, background sound, pure speech, and non- pure speech which includes speech over music and speech over noise. A sound stream is segmented by classifying each sub-segment into one of these five classes. We have evaluated the performance of SVM on different audio type-pairs classification with testing unit of different- length and compared the performance of SVM, K-Nearest Neighbor (KNN), and Gaussian Mixture Model (GMM). We also evaluated the effectiveness of some new proposed features. Experiments on a database composed of about 4- hour audio data show that the proposed classifier is very efficient on audio classification and segmentation. It also shows the accuracy of the SVM-based method is much better than the method based on KNN and GMM.
引用
收藏
页码:482 / 492
页数:10
相关论文
共 50 条
  • [1] Content-based audio classification and segmentation by using support vector machines
    Lu, L
    Zhang, HJ
    Li, SZ
    MULTIMEDIA SYSTEMS, 2003, 8 (06) : 482 - 491
  • [2] Content-based audio classification and retrieval by support vector machines
    Guo, GD
    Li, SZ
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (01): : 209 - 215
  • [3] Content-based audio classification using support vector machines and independent component analysis
    Wang, Jia-Ching
    Wang, Jhing-Fa
    Lin, Cai-Bei
    Jian, Kun-Ting
    Kuok, Wai-He
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 157 - +
  • [4] Content-based video classification using support vector machines
    Suresh, V
    Mohan, CK
    Swamy, RK
    Yegnanarayana, B
    NEURAL INFORMATION PROCESSING, 2004, 3316 : 726 - 731
  • [5] Content-based affective image classification and retrieval using support vector machines
    Wu, QF
    Zhou, CL
    Wang, CN
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 239 - 247
  • [6] Audio signal classification using support vector machines
    Chen, Lei-Ting
    Wang, Ming-Jen
    Wang, Chia-Jiu
    Tai, Heng-Ming
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 188 - 193
  • [7] Content-based Semantic Indexing of Image Using Fuzzy Support Vector Machines
    Li, Jianming
    Huang, Shuguang
    He, Rongsheng
    Qian, Kunming
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 138 - 143
  • [8] Content-based image orientation detection with support vector machines
    Wang, YM
    Zhang, HJ
    IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES, PROCEEDINGS, 2001, : 17 - 23
  • [9] Content-based image classification with wavelet relevance vector machines
    Arvind Tolambiya
    S. Venkataraman
    Prem K. Kalra
    Soft Computing, 2010, 14 (2) : 137 - 137
  • [10] Content-based image classification with wavelet relevance vector machines
    Arvind Tolambiya
    S. Venkatraman
    Prem K. Kalra
    Soft Computing, 2010, 14 : 129 - 136