Speech/music discrimination based on spectral peak analysis and multi-layer perceptron

被引:0
|
作者
Keum, Ji-Soo [1 ]
Lee, Hyon-Soo [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Engn, 1 Seocheon Dong, Yongin, Gyeonggi, South Korea
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a new Speech/Music discrimination method based on spectral peak feature and Multilayer Perceptron. The focus was on feature extraction that reflects spectral peak duration characteristics and high performance using small number of train dataset. Spectral peak features were extracted from audio spectral peak tracks and the feature was normalized by length of segment. Then, we grouping the frequency channel to reflect the spectral distribution. For train, only 25 seconds of speech (Korean) and 50 seconds of music are used This method was evaluated on speech and music for 24,258 seconds of audio data. An average accuracy was 96.58% for speech and 91.82% for music. The results of this experiment found that proposed method was suitable for Speech/Music discrimination.
引用
收藏
页码:56 / +
页数:3
相关论文
共 50 条
  • [1] MULTI-LAYER PERCEPTRON BASED SPEECH ACTIVITY DETECTION FOR SPEAKER VERIFICATION
    Ganapathy, Sriram
    Rajan, Padmanabhan
    Hermansky, Hynek
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 321 - 324
  • [2] Speech Emotion Recognition Using Multi-Layer Perceptron Classifier
    Yuan, Xiaochen
    Wong, Wai Pang
    Lam, Chan Tong
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 644 - 648
  • [3] Phoneme segmentation of continuous speech using multi-layer perceptron
    Suh, Y
    Lee, Y
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1297 - 1300
  • [4] An extension of multi-layer perceptron based on layer-topology
    Zuters, J
    ENFORMATIKA, VOL 7: IEC 2005 PROCEEDINGS, 2005, : 178 - 181
  • [5] An Extension of Multi-Layer Perceptron Based on Layer-Topology
    Zuters, Janis
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 7, 2005, 7 : 178 - 181
  • [6] Discrimination of systolic and diastolic dysfunctions using multi-layer perceptron in heart rate variability analysis
    Isler, Yalcin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2016, 76 : 113 - 119
  • [7] Speech/Music discrimination using spectral peak feature for speaker indexing
    Keum, Ji-Soo
    Lee, Hyon-Soo
    2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 299 - 302
  • [8] Seismic data denoising based on multi-layer perceptron
    Wang Q.
    Tang J.
    Zhang L.
    Liu X.
    Xu Z.
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2020, 55 (02): : 272 - 281
  • [9] Multi-layer Perceptron Based Video Surveillance System
    Harihar, Vijai Kumar
    Sukumaran, Sandeep
    Sirajuddin, Samar
    Sali, Aswathy
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 490 - 495
  • [10] Multi-layer perceptron based modelling of nonlinear systems
    Lightbody, G
    Irwin, GW
    FUZZY SETS AND SYSTEMS, 1996, 79 (01) : 93 - 112