Content-based audio classification and segmentation by using support vector machines

被引:2
|
作者
Lie Lu
Hong-Jiang Zhang
Stan Z. Li
机构
[1] Microsoft Research Asia 5F Beijing Sigma Center,
[2] No.49 Zhichun Road Hai Dian District,undefined
[3] Beijing,undefined
[4] 100080,undefined
[5] China (e-mail: {llu,undefined
[6] hjzhang,undefined
[7] szli}@microsoft.com) ,undefined
来源
Multimedia Systems | 2003年 / 8卷
关键词
Key words: Audio content analysis, audio classification and segmentation, support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
Content-based audio classification and segmentation is a basis for further audio/video analysis. In this paper, we present our work on audio segmentation and classification which employs support vector machines (SVMs). Five audio classes are considered in this paper: silence, music, background sound, pure speech, and non- pure speech which includes speech over music and speech over noise. A sound stream is segmented by classifying each sub-segment into one of these five classes. We have evaluated the performance of SVM on different audio type-pairs classification with testing unit of different- length and compared the performance of SVM, K-Nearest Neighbor (KNN), and Gaussian Mixture Model (GMM). We also evaluated the effectiveness of some new proposed features. Experiments on a database composed of about 4- hour audio data show that the proposed classifier is very efficient on audio classification and segmentation. It also shows the accuracy of the SVM-based method is much better than the method based on KNN and GMM.
引用
收藏
页码:482 / 492
页数:10
相关论文
共 50 条
  • [41] Detection of abrupt spectral changes using support vector machines an application to audio signal segmentation
    Davy, M
    Godsill, S
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1313 - 1316
  • [42] Integration of audio and visual information for content-based video segmentation
    Huang, JC
    Liu, Z
    Wang, Y
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, 1998, : 526 - 530
  • [43] Audio classification and segmentation for sports video structure extraction using support vector machine
    Bai, Liang
    Lao, Song-Yang
    Liao, Hu-Xiong
    Chen, Jian-Yun
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3303 - +
  • [44] Content-based audio classification and retrieval using the nearest feature line method
    Li, SZ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (05): : 619 - 625
  • [45] Content-Based Image Retrieval Technique by Combining Swarm Optimization Algorithm and Support Vector Machines
    Seo, Kwang-Kyu
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2013, 10 (08) : 1693 - 1700
  • [46] Bag classification using support vector machines
    Kartoun, Uri
    Stern, Helman
    Edan, Yael
    APPLIED SOFT COMPUTING TECHNOLOGIES: THE CHALLENGE OF COMPLEXITY, 2006, 34 : 665 - 674
  • [47] Wafer Classification Using Support Vector Machines
    Baly, Ramy
    Hajj, Hazem
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2012, 25 (03) : 373 - 383
  • [48] A CBIR CLASSIFICATION USING SUPPORT VECTOR MACHINES
    Sugamya, Katta
    Pabboju, Suresh
    Babu, A. Vinaya
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN HUMAN MACHINE INTERACTION (HMI), 2016, : 135 - +
  • [49] Classification of Torreya Using Support Vector Machines
    Wang, Xiaodong
    Chang, Jianli
    2012 THIRD INTERNATIONAL CONFERENCE ON TELECOMMUNICATION AND INFORMATION (TEIN 2012), 2012, : 212 - 216
  • [50] Cloud classification using support vector machines
    Azimi-Sadjadi, MR
    Zekavat, SA
    IGARSS 2000: IEEE 2000 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOL I - VI, PROCEEDINGS, 2000, : 669 - 671