A lightweight feature extraction technique for deepfake audio detection

被引:5
|
作者
Chakravarty, Nidhi [1 ]
Dua, Mohit [1 ]
机构
[1] Natl Inst Technol, Dept Comp Engn, Kurukshetra, India
关键词
Audio deepfake; Mel spectrogram; ResNet50; LDA;
D O I
10.1007/s11042-024-18217-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of audio deepfakes has prompted concerns over reputational integrity and dependability. Deepfakes with audio can now be produced more easily, which makes it harder to spot them. Technologies that can identify audio-level deepfakes must be developed in order to address this issue. As a result, we have recognised the importance of feature extraction for these systems and we have created an improved method for feature extraction. On audio Mel spectrogram, we have employed a modified ResNet50 to extract features. Then, Linear Discriminant Analysis (LDA) dimensionality reduction technique have been used to optimise the feature complexity. The chosen features by LDA are then utilised to train these machine learning (ML) models using the backend classification algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbour (KNN), and Naive Bayes (NB). The ASVspoof 2019 Logical Access (LA) partition is utilised for training, ASVspoof 2021 deep fake partition are used to evaluate the systems. Also, we have used DECRO dataset for evakuating our proposed model under unseen noisy dataset. We have used 20% audios from training dataset for validation purpose. When compared to other models, our proposed method performs better than traditional feature extraction methods such as Mel Frequency Cepstral Coefficients (MFCC) and Gammatone Cepstral Coefficients (GTCC). It achieves an impressive Equal Error Rate (EER) of only 0.4% and an accuracy of 99.7%.
引用
收藏
页码:67443 / 67467
页数:25
相关论文
共 50 条
  • [21] Fake-Buster: A Lightweight Solution for Deepfake Detection
    Hubens, Nathan
    Mancas, Matei
    Gosselin, Bernard
    Preda, Marius
    Zaharia, Titus
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [22] Audio-deepfake detection: Adversarial attacks and countermeasures
    Rabhi, Mouna
    Bakiras, Spiridon
    Di Pietro, Roberto
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [23] Image Feature Detectors for Deepfake Video Detection
    Kharbat, Faten F.
    Elamsy, Tarik
    Mahmoud, Ahmed
    Abdullah, Rami
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [24] DEEPFAKE DETECTION USING MULTIPLE FEATURE FUSION
    Zhang, Ya
    Jin, Xin
    Jiang, Qian
    Dong, Yunyun
    Wu, Nan
    Yao, Shaowen
    Zhou, Wei
    ADVANCES IN DIGITAL FORENSICS XVIII, 2022, 653 : 123 - 139
  • [25] A Robust Lightweight Deepfake Detection Network Using Transformers
    Zhang, Yaning
    Wang, Tianyi
    Shu, Minglei
    Wang, Yinglong
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2022, 13629 : 275 - 288
  • [26] An improved feature extraction for Hindi language audio impersonation attack detection
    Chakravarty, Nidhi
    Dua, Mohit
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (25) : 66565 - 66590
  • [27] An Overview of Audio Event Detection Methods from Feature Extraction to Classification
    Babaee, Elham
    Anuar, Nor Badrul
    Wahab, Ainuddin Wahid Abdul
    Shamshirband, Shahaboddin
    Chronopoulos, Anthony T.
    APPLIED ARTIFICIAL INTELLIGENCE, 2017, 31 (9-10) : 661 - 714
  • [28] Audio Deepfake Approaches
    Shaaban, Ousama A.
    Yildirim, Remzi
    Alguttar, Abubaker A.
    IEEE ACCESS, 2023, 11 : 132652 - 132682
  • [29] Ship Target Detection Algorithm Combined with Lightweight Feature Extraction Network
    Li, Dengfeng
    Gao, Ming
    Ye, Wentao
    Computer Engineering and Applications, 2023, 59 (23) : 211 - 218
  • [30] A Fusion of Feature Extraction and Feature Selection Technique for Network Intrusion Detection
    Hamid, Yasir
    Sugumaran, M.
    Journaux, Ludovic
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (08): : 151 - 158