Cepstral coefficients effectiveness for gunshot classifying

被引:0
|
作者
Svatos, Jakub [1 ]
Holub, Jan [1 ]
机构
[1] Czech Tech Univ, Fac Elect Engn, Dept Measurement, Prague, Czech Republic
关键词
acoustic measurements; gunshot detection; cepstral coefficients; multiple signal classification; neural network; FEATURES; CLASSIFICATION; MFCC;
D O I
10.1088/1361-6501/ad3c5d
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper analyses the efficiency of various frequency cepstral coefficients (FCC) in a non-speech application, specifically in classifying acoustic impulse events-gunshots. There are various methods for such event identification available. The majority of these methods are based on time or frequency domain algorithms. However, both of these domains have their limitations and disadvantages. In this article, an FCC, combining the advantages of both frequency and time domains, is presented and analyzed. These originally speech features showed potential not only in speech-related applications but also in other acoustic applications. The comparison of the classification efficiency based on features obtained using four different FCC, namely mel-FCC (MFCC), inverse mel-frequency cepstral coefficients (IMFCC), linear-frequency cepstral coefficients (LFCC), and gammatone-frequency cepstral coefficients (GTCC) is presented. An optimal frame length for an FCC calculation is also explored. Various gunshots from short guns and rifle guns of different calibers and multiple acoustic impulse events, similar to the gunshots, to represent false alarms are used. More than 600 acoustic events records have been acquired and used for training and validation of two designed classifiers, support vector machine, and neural network. Accuracy, recall and Matthew's correlation coefficient measure the classification success rate. The results reveal the superiority of GFCC to other analyzed methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] WAVELET BASED CEPSTRAL COEFFICIENTS FOR NEURAL NETWORK SPEECH RECOGNITION
    Adam, T. B.
    Salam, M. S.
    Gunawan, T. S.
    2013 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2013), 2013, : 447 - 451
  • [42] A comparison of cepstral coefficients and spectral moments in the classification of Romanian fricatives
    Spinu, Laura
    Lilley, Jason
    JOURNAL OF PHONETICS, 2016, 57 : 40 - 58
  • [43] Audio bandwidth extension based on temporal smoothing cepstral coefficients
    Liu, Xin
    Bao, Chang-Chun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [44] Mel-frequency Cepstral Coefficients for Eye Movement Identification
    Nguyen Viet Cuong
    Vu Dinh
    Lam Si Tung Ho
    2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 253 - 260
  • [45] Musical instrument recognition using cepstral coefficients and temporal features
    Eronen, A
    Klapuri, A
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 753 - 756
  • [46] Comparison of Frequency Cepstral Coefficients in Impulse Acoustic Events Detection
    Svatos, Jakub
    Holub, Jan
    Olasoji, Oluwaseun
    MODELLING AND SIMULATION FOR AUTONOMOUS SYSTEMS, MESAS 2023, 2025, 14615 : 3 - 10
  • [47] Recognition of emotion from speech using evolutionary cepstral coefficients
    Ali Bakhshi
    Stephan Chalup
    Ali Harimi
    Seyed Mostafa Mirhassani
    Multimedia Tools and Applications, 2020, 79 : 35739 - 35759
  • [48] Modified Group Delay Cepstral Coefficients for Voice Liveness Detection
    Singh, Shrishti
    Khoria, Kuldeep
    Patil, Hemant A.
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 146 - 150
  • [49] Perceptual MVDR-based cepstral coefficients for speaker recognition
    Liang, Chunyan
    Zhang, Xiang
    Yang, Lin
    Zhang, Jianping
    Yan, Yonghong
    Shengxue Xuebao/Acta Acustica, 2012, 37 (06): : 673 - 678
  • [50] Audio bandwidth extension based on temporal smoothing cepstral coefficients
    Xin Liu
    Chang-Chun Bao
    EURASIP Journal on Audio, Speech, and Music Processing, 2014