A new approach for classification of generic audio data

被引:4
|
作者
Lin, RS [1 ]
Chen, LH [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp & Informat Sci, Hsinchu 30050, Taiwan
关键词
audio classification; spectrogram; Bayesian decision function; multivariable Gaussian distribution;
D O I
10.1142/S0218001405003958
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existing audio retrieval systems fall into one of two categories: single-domain systems that can accept data of only a single type (e.g. speech) or multiple-domain systems that offer content-based retrieval for multiple types of audio data. Since a single-domain system has limited applications, a multiple-domain system will be more useful. However, different types of audio data will have different properties, this will make a multiple-domain system harder to be developed. If we can classify audio information in advance, the above problems can be solved. In this paper, we will propose a real-time classification method to classify audio signals into several basic audio types such as pure speech, music, song, speech with music background, and speech with environmental noise background. In order to make the proposed method robust for a variety of audio sources, we use Bayesian decision function for multivariable Gaussian distribution instead of manually adjusting a threshold for each discriminator. The proposed approach can be applied to content-based audio/video retrieval. In the experiment, the efficiency and effectiveness of this method are shown by an accuracy rate of more than 96% for general audio data classification.
引用
收藏
页码:63 / 78
页数:16
相关论文
共 50 条
  • [1] A generic audio classification and segmentation approach for multimedia indexing and retrieval
    Kiranyaz, S
    Qureshi, AF
    Gabbouj, M
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 1062 - 1081
  • [2] Heuristic approach for generic audio data segmentation and annotation
    Zhang, T
    Kuo, CCJ
    ACM MULTIMEDIA 99, PROCEEDINGS, 1999, : 67 - 76
  • [3] Audio Data Classification by Means of New Algorithms
    Stastny, Jiri
    Skorpil, Vladislav
    Fejfar, Jiri
    2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2013, : 507 - 511
  • [4] Classification of Metadata Categories in Data Warehousing - A Generic Approach
    Gabriel, Roland
    Hoppe, Tobias
    Pastwa, Alexander
    AMCIS 2010 PROCEEDINGS, 2010,
  • [5] A scheme for the classification of audio data
    Subramanya, SR
    Sabharwal, C
    Subbiah, P
    Vishwanathan, N
    COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2000, : 340 - 344
  • [6] Automatic classification of audio data
    Costa, CHL
    Valle, JD
    Koerich, AL
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 562 - 567
  • [7] Affective Classification of Generic Audio Clips using Regression Models
    Malandrakis, Nikolaos
    Sundaram, Shiva
    Potamianos, Alexandros
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2831 - 2835
  • [8] Audio-Visual Atoms for Generic Video Concept Classification
    Jiang, Wei
    Cotton, Courtenay
    Chang, Shih-Fu
    Ellis, Dan
    Loui, Alexander C.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2010, 6 (03)
  • [9] An unsupervised audio segmentation and classification approach
    Pan, Wenjuan
    Yao, Yong
    Liu, Zhijing
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 303 - 306
  • [10] A Classification Method for Environmental Audio Data
    Li, Ying
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 2, 2010, : 355 - 361