Speech and Music Classification Using Hybrid Form of Spectrogram and Fourier Transformation

被引:0
|
作者
Neammalai, Piyawat [1 ]
Phimoltares, Suphakant [1 ]
Lursinsap, Chidchanok [1 ]
机构
[1] Chulalongkorn Univ, Fac Sci, Dept Math & Comp Sci, Adv Virtual & Intelligent Comp AVIC Ctr, Bangkok, Thailand
关键词
Speech music classification; Spectrogram; Fourier Transform; AUDIO CLASSIFICATION; SEGMENTATION; FEATURES;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents the technique for feature extraction to classify speech and music audio data. The combination of image processing and signal processing is used to classify audio data. There are three main steps. First, the audio data is segments and transformed to spectrogram image and then apply image processing methods to find the salient characteristics on the spectrogram image. The next step transforms the salient spectrogram image using 2-dimensional Fourier Transform and then calculates the energy of signal at the specific frequencies to form the feature vector. Next, in classification process, Support Vector Machine is used as bi-classification tool. The method is tested on an audio database containing 510 instances with 1.5 seconds length of each. The experimental results show that the acceptable classification accuracy of our proposed technique is achieved.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Classification of DNA sequences using numerical mapping techniques and Fourier transformation
    Sayisal haritalama teknikleri ve Fourier dönöşömö kullanilarak DNA dizilimlerinin siniflandirilmasi
    [J]. Daş, Bihter (bihterdas@gmail.com), 1600, Gazi Universitesi (31):
  • [42] Improvement Of Speech Emotion Recognition with Neural Network Classifier by Using Speech Spectrogram
    Prasomphan, Sathit
    [J]. 2015 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2015), 2015, : 73 - 76
  • [43] Speech and music pitch trajectory classification using recurrent neural networks for monaural speech segregation
    Kim, Han-Gyu
    Jang, Gil-Jin
    Oh, Yung-Hwan
    Choi, Ho-Jin
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (10): : 8193 - 8213
  • [44] Speech and music pitch trajectory classification using recurrent neural networks for monaural speech segregation
    Han-Gyu Kim
    Gil-Jin Jang
    Yung-Hwan Oh
    Ho-Jin Choi
    [J]. The Journal of Supercomputing, 2020, 76 : 8193 - 8213
  • [45] CLASSIFICATION OF VOWELS IN CONTINUOUS SPEECH USING MLP AND A HYBRID NET
    KNAGENHJELM, P
    BRAUER, P
    [J]. SPEECH COMMUNICATION, 1990, 9 (01) : 31 - 34
  • [46] Automatic Music Genre Classification Using Hybrid Genetic Algorithms
    Karkavitsas, George V.
    Tsihrintzis, George A.
    [J]. INTELLIGENT INTERACTIVE MULTIMEDIA SYSTEMS AND SERVICES (IIMSS 2011), 2011, 11 : 323 - 335
  • [47] A Hybrid Model For Music Genre Classification Using LSTM And SVM
    Fulzele, Prasenjeet
    Singh, Rajat
    Kaushik, Naman
    Pandey, Kavita
    [J]. 2018 ELEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2018, : 386 - 388
  • [48] SPECTROGRAM PATCH BASED ACOUSTIC EVENT DETECTION AND CLASSIFICATION IN SPEECH OVERLAPPING CONDITIONS
    Espi, Miquel
    Fujimoto, Masakiyo
    Kubo, Yotaro
    Nakatani, Tomohiro
    [J]. 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 117 - 121
  • [49] Classification System of National Music Rhythm Spectrogram Based on Biological Neural Network
    Mi, Dan
    Qin, Lu
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [50] Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification
    Wang, Wenwu
    Mustafa, Hafiz
    [J]. EXPLORING MUSIC CONTENTS, 2011, 6684 : 84 - 101