Environment Sound Recognition for Digital Audio Forensics Using Linear Predictive Coding Features

被引：0

作者：

AlQahtani, Mubarak Obaid ^{[1
]}

Al Mazyad, Abdulaziz S. ^{[2
]}

机构：

[1] King Saud Univ, Ctr Excellence Informat Assurance, Riyadh, Saudi Arabia

[2] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia

来源：

DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS, PT 2 | 2011年 / 189卷

关键词：

Linear Predictive Coding (LPC); Zero Crossing (ZC); Mel frequency cepstral coefficients (MFCC); Moving Picture Experts Group (MPEG); Audio Waveform (AWF); Audio Power (AP); Audio Spectrum Envelop (ASE); Audio Spectrum Centroid (ASC); Audio Spectrum Spread (ASS); Hidden Markov model (HMM); K-Nearest Neighbors (K-NN);

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Linear Predictive Coding coefficients are of the main extraction feature in digital forensic. In this paper. we perform several experiments focusing oil the problems of environments recognition from audio particularly for forensic: application. We investigated the effect of temporal Linear Predictive Coding coefficient as feature extraction on environment sound recognition to compute the Linear Predictive Coding coefficient for each frame for all files. The per.. formance is evaluated against varying number of training sounds and samples per training file and compare with Zero Crossing feature and Moving Picture Experts Group-7 low level description feature. We use K-Nearest Neighbors as classifier feature to detect which the environment for any audio testing file. Experimental results show that higher recognition accuracy is achieved by increasing the number of training tiles and by decreasing the number of samples per training file.

引用

页码：301 / +

页数：3

共 50 条

[41] High capacity, secure audio watermarking technique integrating spread spectrum and linear predictive coding
Korany, Noha O.
Elboghdadly, Namat M.
Elabdein, Mohamed Z.
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 50645 - 50668
[42] SPEECH RECOGNITION BY POLARIZED LINEAR PREDICTIVE ERROR CODING - POLPEC METHOD.
Akagi, Masato
Iijima, Taizo
Electronics & communications in Japan, 1982, 65 (08): : 9 - 18
[43] Linear predictive coding distinguishes spectral EEG features of Parkinson's disease
Anjum, Md Fahim
Dasgupta, Soura
Mudumbai, Raghuraman
Singh, Arun
Cavanagh, James F.
Narayanan, Nandakumar S.
PARKINSONISM & RELATED DISORDERS, 2020, 79 : 79 - 85
[44] Blind Source Separation for a Robust Audio Recognition Scheme in Multiple Sound-Sources Environment
Han, Wei
Zhou, Songbin
Li, Chang
Liu, Yisen
Liu, Zhe
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2015, 8 : 1564 - 1568
[45] LINEAR TRANSFORMATION CODING AND PREDICTIVE CODING - 2 METHODS OF DIGITAL ENCODING FOR CONTINUOUS SOURCES WITH DISCRETE PARAMETERS
NITADORI, K
ELECTRONICS & COMMUNICATIONS IN JAPAN, 1970, 53 (02): : 37 - &
[46] Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition
Borde, Prashant
Varpe, Amarsinh
Manza, Ramesh
Yannawar, Pravin
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (02) : 167 - 175
[47] Recognition of vision-based activities of daily living using linear predictive coding of histogram of directional derivative
Sidharth B. Bhorge
Ramchandra R. Manthalkar
Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 199 - 214
[48] Recognition of vision-based activities of daily living using linear predictive coding of histogram of directional derivative
Bhorge, Sidharth B.
Manthalkar, Ramchandra R.
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (01) : 199 - 214
[49] Comparison of MPEG-7 audio spectrum projection features and mfcc applied to speaker recognition, sound classification and audio segmentation
Kim, HG
Sikora, T
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 925 - 928
[50] Environmental Sound Classification Using Local Binary Pattern and Audio Features Collaboration
Toffa, Ohini Kafui
Mignotte, Max
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3978 - 3985

← 1 2 3 4 5 →