Audio Classification Using Dominant Spatial Patterns in Time-Frequency Space

被引:0
|
作者
Molla, Md. Khademul Islam [1 ,2 ]
Hirose, Keikichi [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] Rajshahi Univ, Dept Comp Sci & Eng, Rajshahi, Bangladesh
关键词
Linear discriminant analysis; non-negative matrix factorization; speech/music discrimination; time-frequency representation; NONNEGATIVE MATRIX FACTORIZATION; SPEECH/MUSIC DISCRIMINATOR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel audio discrimination algorithm using spatial features in time-frequency (TF) space. Three types of audio signals speech, music without vocal and music with background vocal are taken into consideration for classification. The audio segment is transformed into TF domain yielding the spatial illustration of energy. Non negative matrix factorization (NMF) is applied to TF space to extract a set of vectors which represents the dominant subspace of spatial energy distribution. The inverse Fourier transform is applied to individual dominant vectors to derive the features for audio discrimination. The classification is performed by using multiclass linear discriminant analysis (mcLDA). The experimental results show that the proposed algorithm is more noise robust and performs better than the recently reported methods.
引用
收藏
页码:2914 / 2918
页数:5
相关论文
共 50 条
  • [1] Audio signal classification using time-frequency parameters
    Umapathy, K
    Krishnan, S
    Jimaa, S
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A249 - A252
  • [2] Time-Frequency Processing for Spatial Audio
    Rumsey, Francis
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2010, 58 (7-8): : 655 - 659
  • [3] Multigroup classification of audio signals using time-frequency parameters
    Umapathy, K
    Krishnan, S
    Jimaa, S
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (02) : 308 - 315
  • [4] Enhancing Spectrogram for Audio Classification Using Time-Frequency Enhancer
    Xing, Haoran
    Zhang, Shiqi
    Takeuchi, Daiki
    Niizumi, Daisuke
    Harada, Noboru
    Makino, Shoji
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1155 - 1160
  • [5] Multigroup classification of audio signals using time-frequency parameters
    Dept. of Elec. and Comp. Engineering, University of Western Ontario, London, Ont. N6A 5B9, Canada
    不详
    不详
    [J]. 1600, 308-315 (April 2005):
  • [6] AUDIO CLASSIFICATION FROM TIME-FREQUENCY TEXTURE
    Yu, Guoshen
    Slotine, Jean-Jacques
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1677 - +
  • [7] JOINT TIME-FREQUENCY SCATTERING FOR AUDIO CLASSIFICATION
    Anden, Joakim
    Lostanlen, Vincent
    Mallat, Stephane
    [J]. 2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [8] Classification of Time-Frequency Regions in Stereo Audio
    Harma, Aki
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2011, 59 (10): : 707 - 720
  • [9] LEARNING SEPARABLE TIME-FREQUENCY FILTERBANKS FOR AUDIO CLASSIFICATION
    Pu, Jie
    Panagakis, Yannis
    Pantic, Maja
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3000 - 3004
  • [10] TFECN: Time-Frequency Enhanced ConvNet for Audio Classification
    Wang, Mengwei
    Yang, Zhe
    [J]. INTERSPEECH 2023, 2023, : 281 - 285