Environment Sound Recognition for Digital Audio Forensics Using Linear Predictive Coding Features

被引:0
|
作者
AlQahtani, Mubarak Obaid [1 ]
Al Mazyad, Abdulaziz S. [2 ]
机构
[1] King Saud Univ, Ctr Excellence Informat Assurance, Riyadh, Saudi Arabia
[2] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
关键词
Linear Predictive Coding (LPC); Zero Crossing (ZC); Mel frequency cepstral coefficients (MFCC); Moving Picture Experts Group (MPEG); Audio Waveform (AWF); Audio Power (AP); Audio Spectrum Envelop (ASE); Audio Spectrum Centroid (ASC); Audio Spectrum Spread (ASS); Hidden Markov model (HMM); K-Nearest Neighbors (K-NN);
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Linear Predictive Coding coefficients are of the main extraction feature in digital forensic. In this paper. we perform several experiments focusing oil the problems of environments recognition from audio particularly for forensic: application. We investigated the effect of temporal Linear Predictive Coding coefficient as feature extraction on environment sound recognition to compute the Linear Predictive Coding coefficient for each frame for all files. The per.. formance is evaluated against varying number of training sounds and samples per training file and compare with Zero Crossing feature and Moving Picture Experts Group-7 low level description feature. We use K-Nearest Neighbors as classifier feature to detect which the environment for any audio testing file. Experimental results show that higher recognition accuracy is achieved by increasing the number of training tiles and by decreasing the number of samples per training file.
引用
收藏
页码:301 / +
页数:3
相关论文
共 50 条
  • [41] High capacity, secure audio watermarking technique integrating spread spectrum and linear predictive coding
    Korany, Noha O.
    Elboghdadly, Namat M.
    Elabdein, Mohamed Z.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 50645 - 50668
  • [42] SPEECH RECOGNITION BY POLARIZED LINEAR PREDICTIVE ERROR CODING - POLPEC METHOD.
    Akagi, Masato
    Iijima, Taizo
    Electronics & communications in Japan, 1982, 65 (08): : 9 - 18
  • [43] Linear predictive coding distinguishes spectral EEG features of Parkinson's disease
    Anjum, Md Fahim
    Dasgupta, Soura
    Mudumbai, Raghuraman
    Singh, Arun
    Cavanagh, James F.
    Narayanan, Nandakumar S.
    PARKINSONISM & RELATED DISORDERS, 2020, 79 : 79 - 85
  • [44] Blind Source Separation for a Robust Audio Recognition Scheme in Multiple Sound-Sources Environment
    Han, Wei
    Zhou, Songbin
    Li, Chang
    Liu, Yisen
    Liu, Zhe
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2015, 8 : 1564 - 1568
  • [45] LINEAR TRANSFORMATION CODING AND PREDICTIVE CODING - 2 METHODS OF DIGITAL ENCODING FOR CONTINUOUS SOURCES WITH DISCRETE PARAMETERS
    NITADORI, K
    ELECTRONICS & COMMUNICATIONS IN JAPAN, 1970, 53 (02): : 37 - &
  • [46] Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition
    Borde, Prashant
    Varpe, Amarsinh
    Manza, Ramesh
    Yannawar, Pravin
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (02) : 167 - 175
  • [47] Recognition of vision-based activities of daily living using linear predictive coding of histogram of directional derivative
    Sidharth B. Bhorge
    Ramchandra R. Manthalkar
    Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 199 - 214
  • [48] Recognition of vision-based activities of daily living using linear predictive coding of histogram of directional derivative
    Bhorge, Sidharth B.
    Manthalkar, Ramchandra R.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (01) : 199 - 214
  • [49] Comparison of MPEG-7 audio spectrum projection features and mfcc applied to speaker recognition, sound classification and audio segmentation
    Kim, HG
    Sikora, T
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 925 - 928
  • [50] Environmental Sound Classification Using Local Binary Pattern and Audio Features Collaboration
    Toffa, Ohini Kafui
    Mignotte, Max
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3978 - 3985