Content-based audio classification and segmentation by using support vector machines

被引:2
|
作者
Lie Lu
Hong-Jiang Zhang
Stan Z. Li
机构
[1] Microsoft Research Asia 5F Beijing Sigma Center,
[2] No.49 Zhichun Road Hai Dian District,undefined
[3] Beijing,undefined
[4] 100080,undefined
[5] China (e-mail: {llu,undefined
[6] hjzhang,undefined
[7] szli}@microsoft.com) ,undefined
来源
Multimedia Systems | 2003年 / 8卷
关键词
Key words: Audio content analysis, audio classification and segmentation, support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
Content-based audio classification and segmentation is a basis for further audio/video analysis. In this paper, we present our work on audio segmentation and classification which employs support vector machines (SVMs). Five audio classes are considered in this paper: silence, music, background sound, pure speech, and non- pure speech which includes speech over music and speech over noise. A sound stream is segmented by classifying each sub-segment into one of these five classes. We have evaluated the performance of SVM on different audio type-pairs classification with testing unit of different- length and compared the performance of SVM, K-Nearest Neighbor (KNN), and Gaussian Mixture Model (GMM). We also evaluated the effectiveness of some new proposed features. Experiments on a database composed of about 4- hour audio data show that the proposed classifier is very efficient on audio classification and segmentation. It also shows the accuracy of the SVM-based method is much better than the method based on KNN and GMM.
引用
收藏
页码:482 / 492
页数:10
相关论文
共 50 条
  • [31] An application of one-class support vector machines in content-based image retrieval
    Seo, Kwang-Kyu
    EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (02) : 491 - 498
  • [32] Ensemble one-class support vector machines for content-based image retrieval
    Wu, Roung-Shiunn
    Chung, Wen-Hsin
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 4451 - 4459
  • [33] A CASE STUDY ON FEATURE SENSITIVITY FOR AUDIO EVENT CLASSIFICATION USING SUPPORT VECTOR MACHINES
    Martin-Morato, Irene
    Cobos, Maximo
    Ferri, Francesc J.
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [34] An Automatic classification of bird species using audio feature extraction and support vector machines
    Rai, Pallavi
    Golchha, Vikram
    Srivastava, Aishwarya
    Vyas, Garima
    Mishra, Sourav
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 1, 2016, : 144 - 148
  • [35] Hierarchical system for content-based audio classification and retrieval
    Zhang, T
    Kuo, CCJ
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS III, 1998, 3527 : 398 - 409
  • [36] A study on content-based classification and retrieval of audio database
    Liu, MC
    Wan, CR
    2001 INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2001, : 339 - 345
  • [37] Content-based audio classification with generalized ellipsoid distance
    Cheng, CC
    Hsu, CT
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 328 - 335
  • [38] Applying neural network on the content-based audio classification
    Shao, X
    Xu, CS
    Kankanhalli, MS
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1821 - 1825
  • [39] Classification of general audio data for content-based retrieval
    Li, DG
    Sethi, IK
    Dimitrova, N
    McGee, T
    PATTERN RECOGNITION LETTERS, 2001, 22 (05) : 533 - 544
  • [40] Segmentation of images using support vector machines
    Chen, QY
    Yang, Q
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 3304 - 3306