A lightweight feature extraction technique for deepfake audio detection

被引:5
|
作者
Chakravarty, Nidhi [1 ]
Dua, Mohit [1 ]
机构
[1] Natl Inst Technol, Dept Comp Engn, Kurukshetra, India
关键词
Audio deepfake; Mel spectrogram; ResNet50; LDA;
D O I
10.1007/s11042-024-18217-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of audio deepfakes has prompted concerns over reputational integrity and dependability. Deepfakes with audio can now be produced more easily, which makes it harder to spot them. Technologies that can identify audio-level deepfakes must be developed in order to address this issue. As a result, we have recognised the importance of feature extraction for these systems and we have created an improved method for feature extraction. On audio Mel spectrogram, we have employed a modified ResNet50 to extract features. Then, Linear Discriminant Analysis (LDA) dimensionality reduction technique have been used to optimise the feature complexity. The chosen features by LDA are then utilised to train these machine learning (ML) models using the backend classification algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbour (KNN), and Naive Bayes (NB). The ASVspoof 2019 Logical Access (LA) partition is utilised for training, ASVspoof 2021 deep fake partition are used to evaluate the systems. Also, we have used DECRO dataset for evakuating our proposed model under unseen noisy dataset. We have used 20% audios from training dataset for validation purpose. When compared to other models, our proposed method performs better than traditional feature extraction methods such as Mel Frequency Cepstral Coefficients (MFCC) and Gammatone Cepstral Coefficients (GTCC). It achieves an impressive Equal Error Rate (EER) of only 0.4% and an accuracy of 99.7%.
引用
收藏
页码:67443 / 67467
页数:25
相关论文
共 50 条
  • [1] Temporal Feature Prediction in Audio-Visual Deepfake Detection
    Gao, Yuan
    Wang, Xuelong
    Zhang, Yu
    Zeng, Ping
    Ma, Yingjie
    ELECTRONICS, 2024, 13 (17)
  • [2] Lightweight Deepfake Detection Based on Multi-Feature Fusion
    Yasir, Siddiqui Muhammad
    Kim, Hyun
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [3] AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
    Oorloff, Trevine
    Koppisetti, Surya
    Bonettini, Nicole
    Solanki, Divyaraj
    Ben Colman
    Yacoob, Yaser
    Shahriyari, Ali
    Bharaj, Gaurav
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27092 - 27102
  • [4] FakeSound: Deepfake General Audio Detection
    Xie, Zeyu
    Li, Baihan
    Xu, Xuenan
    Liang, Zheng
    Yu, Kai
    Wu, Mengyue
    INTERSPEECH 2024, 2024, : 112 - 116
  • [5] Does Audio Deepfake Detection Generalize?
    Mueller, Nicolas M.
    Czempin, Pavel
    Diekmann, Franziska
    Froghyar, Adam
    Bottinger, Konstantin
    INTERSPEECH 2022, 2022, : 2783 - 2787
  • [6] Deepfake audio detection by speaker verification
    Pianese, Alessandro
    Cozzolino, Davide
    Poggi, Giovanni
    Verdoliva, Luisa
    2022 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2022,
  • [7] Enhanced Feature Extraction for Speech Detection in Media Audio
    Jang, Inseon
    Ahn, ChungHyun
    Seo, Jeongil
    Jang, Younseon
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 479 - 483
  • [8] Targeted Augmented Data for Audio Deepfake Detection
    Astrid, Marcella
    Ghorbel, Enjie
    Aouada, Djamila
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 346 - 350
  • [9] Whisper plus AASIST for DeepFake Audio Detection
    Luo, Qian
    Sivasundari, Kalyani Vinayagam
    HCI FOR CYBERSECURITY, PRIVACY AND TRUST, PT II, HCI-CPT 2024, 2024, 14729 : 121 - 133
  • [10] Retrieval-Augmented Audio Deepfake Detection
    Kang, Zuheng
    He, Yayun
    Zhao, Botao
    Qu, Xiaoyang
    Peng, Junqing
    Xiao, Jing
    Wang, Jianzong
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 376 - 384