Cepstral coefficients effectiveness for gunshot classifying

被引：0

作者：

Svatos, Jakub ^{[1
]}

Holub, Jan ^{[1
]}

机构：

[1] Czech Tech Univ, Fac Elect Engn, Dept Measurement, Prague, Czech Republic

来源：

MEASUREMENT SCIENCE AND TECHNOLOGY | 2024年 / 35卷 / 07期

关键词：

acoustic measurements; gunshot detection; cepstral coefficients; multiple signal classification; neural network; FEATURES; CLASSIFICATION; MFCC;

D O I：

10.1088/1361-6501/ad3c5d

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

This paper analyses the efficiency of various frequency cepstral coefficients (FCC) in a non-speech application, specifically in classifying acoustic impulse events-gunshots. There are various methods for such event identification available. The majority of these methods are based on time or frequency domain algorithms. However, both of these domains have their limitations and disadvantages. In this article, an FCC, combining the advantages of both frequency and time domains, is presented and analyzed. These originally speech features showed potential not only in speech-related applications but also in other acoustic applications. The comparison of the classification efficiency based on features obtained using four different FCC, namely mel-FCC (MFCC), inverse mel-frequency cepstral coefficients (IMFCC), linear-frequency cepstral coefficients (LFCC), and gammatone-frequency cepstral coefficients (GTCC) is presented. An optimal frame length for an FCC calculation is also explored. Various gunshots from short guns and rifle guns of different calibers and multiple acoustic impulse events, similar to the gunshots, to represent false alarms are used. More than 600 acoustic events records have been acquired and used for training and validation of two designed classifiers, support vector machine, and neural network. Accuracy, recall and Matthew's correlation coefficient measure the classification success rate. The results reveal the superiority of GFCC to other analyzed methods.

引用

页数：11

共 50 条

[41] WAVELET BASED CEPSTRAL COEFFICIENTS FOR NEURAL NETWORK SPEECH RECOGNITION
Adam, T. B.
Salam, M. S.
Gunawan, T. S.
2013 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2013), 2013, : 447 - 451
[42] A comparison of cepstral coefficients and spectral moments in the classification of Romanian fricatives
Spinu, Laura
Lilley, Jason
JOURNAL OF PHONETICS, 2016, 57 : 40 - 58
[43] Audio bandwidth extension based on temporal smoothing cepstral coefficients
Liu, Xin
Bao, Chang-Chun
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
[44] Mel-frequency Cepstral Coefficients for Eye Movement Identification
Nguyen Viet Cuong
Vu Dinh
Lam Si Tung Ho
2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 253 - 260
[45] Musical instrument recognition using cepstral coefficients and temporal features
Eronen, A
Klapuri, A
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 753 - 756
[46] Comparison of Frequency Cepstral Coefficients in Impulse Acoustic Events Detection
Svatos, Jakub
Holub, Jan
Olasoji, Oluwaseun
MODELLING AND SIMULATION FOR AUTONOMOUS SYSTEMS, MESAS 2023, 2025, 14615 : 3 - 10
[47] Recognition of emotion from speech using evolutionary cepstral coefficients
Ali Bakhshi
Stephan Chalup
Ali Harimi
Seyed Mostafa Mirhassani
Multimedia Tools and Applications, 2020, 79 : 35739 - 35759
[48] Modified Group Delay Cepstral Coefficients for Voice Liveness Detection
Singh, Shrishti
Khoria, Kuldeep
Patil, Hemant A.
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 146 - 150
[49] Perceptual MVDR-based cepstral coefficients for speaker recognition
Liang, Chunyan
Zhang, Xiang
Yang, Lin
Zhang, Jianping
Yan, Yonghong
Shengxue Xuebao/Acta Acustica, 2012, 37 (06): : 673 - 678
[50] Audio bandwidth extension based on temporal smoothing cepstral coefficients
Xin Liu
Chang-Chun Bao
EURASIP Journal on Audio, Speech, and Music Processing, 2014

← 1 2 3 4 5 →