A new approach for classification of generic audio data

被引:4
|
作者
Lin, RS [1 ]
Chen, LH [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp & Informat Sci, Hsinchu 30050, Taiwan
关键词
audio classification; spectrogram; Bayesian decision function; multivariable Gaussian distribution;
D O I
10.1142/S0218001405003958
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existing audio retrieval systems fall into one of two categories: single-domain systems that can accept data of only a single type (e.g. speech) or multiple-domain systems that offer content-based retrieval for multiple types of audio data. Since a single-domain system has limited applications, a multiple-domain system will be more useful. However, different types of audio data will have different properties, this will make a multiple-domain system harder to be developed. If we can classify audio information in advance, the above problems can be solved. In this paper, we will propose a real-time classification method to classify audio signals into several basic audio types such as pure speech, music, song, speech with music background, and speech with environmental noise background. In order to make the proposed method robust for a variety of audio sources, we use Bayesian decision function for multivariable Gaussian distribution instead of manually adjusting a threshold for each discriminator. The proposed approach can be applied to content-based audio/video retrieval. In the experiment, the efficiency and effectiveness of this method are shown by an accuracy rate of more than 96% for general audio data classification.
引用
收藏
页码:63 / 78
页数:16
相关论文
共 50 条
  • [31] Determining a New Home Classification A Data Mining Approach
    Lopez-Saca, Fidel
    Castro-Lopez, Jose
    Figueroa-Gonzalez, Josue
    Gonzalez-Brambila, Silvia B.
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2017, : 415 - 420
  • [32] A NEW APPROACH FOR DATA CLASSIFICATION USING FUZZY LOGIC
    Taneja, Shweta
    Suri, Bhawna
    Narwal, Himanshu
    Jain, Anchit
    Kathuria, Akshay
    Gupta, Sachin
    2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), 2016, : 22 - 27
  • [33] Content based audio classification: a neural network approach
    Mitra, Vikramjit
    Wang, Chia-Jiu
    SOFT COMPUTING, 2008, 12 (07) : 639 - 646
  • [34] Audio Classification of Bird Species: a Statistical Manifold Approach
    Briggs, Forrest
    Raich, Raviv
    Fern, Xiaoli Z.
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 51 - 60
  • [35] An Improved Approach to Audio Segmentation and Classification in Broadcasting Industries
    Sun, Jingzhou
    Wang, Yongbin
    JOURNAL OF DATABASE MANAGEMENT, 2019, 30 (02) : 44 - 66
  • [36] Content based audio classification: a neural network approach
    Vikramjit Mitra
    Chia-Jiu Wang
    Soft Computing, 2008, 12 : 639 - 646
  • [37] Data augmentation approaches for improving animal audio classification
    Nanni, Loris
    Maguolo, Gianluca
    Paci, Michelangelo
    ECOLOGICAL INFORMATICS, 2020, 57
  • [38] Dementia classification using attention mechanism on audio data
    Milana, Shkhanukova
    2023 IEEE 21ST WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, SAMI, 2023, : 103 - 107
  • [39] A Framework for Classification and Segmentation of Massive Audio Data Streams
    Aggarwal, Charu C.
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 1013 - 1017
  • [40] Large scale data based audio scene classification
    Sophiya E.
    Jothilakshmi S.
    International Journal of Speech Technology, 2018, 21 (04) : 825 - 836