A lightweight feature extraction technique for deepfake audio detection

被引:5
|
作者
Chakravarty, Nidhi [1 ]
Dua, Mohit [1 ]
机构
[1] Natl Inst Technol, Dept Comp Engn, Kurukshetra, India
关键词
Audio deepfake; Mel spectrogram; ResNet50; LDA;
D O I
10.1007/s11042-024-18217-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of audio deepfakes has prompted concerns over reputational integrity and dependability. Deepfakes with audio can now be produced more easily, which makes it harder to spot them. Technologies that can identify audio-level deepfakes must be developed in order to address this issue. As a result, we have recognised the importance of feature extraction for these systems and we have created an improved method for feature extraction. On audio Mel spectrogram, we have employed a modified ResNet50 to extract features. Then, Linear Discriminant Analysis (LDA) dimensionality reduction technique have been used to optimise the feature complexity. The chosen features by LDA are then utilised to train these machine learning (ML) models using the backend classification algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbour (KNN), and Naive Bayes (NB). The ASVspoof 2019 Logical Access (LA) partition is utilised for training, ASVspoof 2021 deep fake partition are used to evaluate the systems. Also, we have used DECRO dataset for evakuating our proposed model under unseen noisy dataset. We have used 20% audios from training dataset for validation purpose. When compared to other models, our proposed method performs better than traditional feature extraction methods such as Mel Frequency Cepstral Coefficients (MFCC) and Gammatone Cepstral Coefficients (GTCC). It achieves an impressive Equal Error Rate (EER) of only 0.4% and an accuracy of 99.7%.
引用
收藏
页码:67443 / 67467
页数:25
相关论文
共 50 条
  • [31] MULTI-SCALE PERMUTATION ENTROPY FOR AUDIO DEEPFAKE DETECTION
    Wang, Chenglong
    He, Jiayi
    Yi, Jiangyan
    Tao, Jianhua
    Zhang, Chu Yuan
    Zhang, Xiaohui
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1406 - 1410
  • [32] Domain Generalization via Aggregation and Separation for Audio Deepfake Detection
    Xie, Yuankun
    Cheng, Haonan
    Wang, Yutian
    Ye, Long
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 344 - 358
  • [33] Recurrent Convolutional Structures for Audio Spoof and Video Deepfake Detection
    Chintha, Akash
    Thai, Bao
    Sohrawardi, Saniat Javid
    Bhatt, Kartavya
    Hickerson, Andrea
    Wright, Matthew
    Ptucha, Raymond
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (05) : 1024 - 1037
  • [34] Speaker Recognition-Assisted Robust Audio Deepfake Detection
    Pan, Jiahui
    Nie, Shuai
    Zhang, Hui
    He, Shulin
    Zhang, Kanghao
    Liang, Shan
    Zhang, Xueliang
    Tao, Jianhua
    INTERSPEECH 2022, 2022, : 4202 - 4206
  • [35] Speech Audio Deepfake Detection via Convolutional Neural Networks
    Valente, Lucas P.
    de Souza, Marcelo M. S.
    da Rocha, Alan M.
    IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS 2024, IEEE EAIS 2024, 2024, : 382 - 387
  • [36] SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection
    Kawa, Piotr
    Plata, Marcin
    Syga, Piotr
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 792 - 799
  • [37] AUDIO DEEPFAKE DETECTION SYSTEM WITH NEURAL STITCHING FOR ADD 2022
    Yan, Rui
    Wen, Cheng
    Zhou, Shuran
    Guo, Tingwei
    Zou, Wei
    Li, Xiangang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9226 - 9230
  • [38] A Remote Sensing Target Detection Model Based on Lightweight Feature Enhancement and Feature Refinement Extraction
    Guo, Dongen
    Zhou, Zhuoke
    Guo, Fengshuo
    Jia, Chaoxin
    Huang, Xiaohong
    Feng, Jiangfan
    Shen, Zhen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 9569 - 9581
  • [39] AN EVALUATION OF AUDIO FEATURE EXTRACTION TOOLBOXES
    Moffat, David
    Ronan, David
    Reiss, Joshua D.
    DAFX-15: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, 2015, : 277 - 283
  • [40] A hierarchical feature selection strategy for deepfake video detection
    Mohiuddin, Sk
    Sheikh, Khalid Hassan
    Malakar, Samir
    Velasquez, Juan D.
    Sarkar, Ram
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (13): : 9363 - 9380