A lightweight feature extraction technique for deepfake audio detection

被引:5
|
作者
Chakravarty, Nidhi [1 ]
Dua, Mohit [1 ]
机构
[1] Natl Inst Technol, Dept Comp Engn, Kurukshetra, India
关键词
Audio deepfake; Mel spectrogram; ResNet50; LDA;
D O I
10.1007/s11042-024-18217-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of audio deepfakes has prompted concerns over reputational integrity and dependability. Deepfakes with audio can now be produced more easily, which makes it harder to spot them. Technologies that can identify audio-level deepfakes must be developed in order to address this issue. As a result, we have recognised the importance of feature extraction for these systems and we have created an improved method for feature extraction. On audio Mel spectrogram, we have employed a modified ResNet50 to extract features. Then, Linear Discriminant Analysis (LDA) dimensionality reduction technique have been used to optimise the feature complexity. The chosen features by LDA are then utilised to train these machine learning (ML) models using the backend classification algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbour (KNN), and Naive Bayes (NB). The ASVspoof 2019 Logical Access (LA) partition is utilised for training, ASVspoof 2021 deep fake partition are used to evaluate the systems. Also, we have used DECRO dataset for evakuating our proposed model under unseen noisy dataset. We have used 20% audios from training dataset for validation purpose. When compared to other models, our proposed method performs better than traditional feature extraction methods such as Mel Frequency Cepstral Coefficients (MFCC) and Gammatone Cepstral Coefficients (GTCC). It achieves an impressive Equal Error Rate (EER) of only 0.4% and an accuracy of 99.7%.
引用
收藏
页码:67443 / 67467
页数:25
相关论文
共 50 条
  • [41] A hierarchical feature selection strategy for deepfake video detection
    Sk Mohiuddin
    Khalid Hassan Sheikh
    Samir Malakar
    Juan D. Velásquez
    Ram Sarkar
    Neural Computing and Applications, 2023, 35 : 9363 - 9380
  • [42] Noise Robust Audio Spoof Detection Using Hybrid Feature Extraction and LCNN
    Joshi S.
    Dua M.
    SN Computer Science, 5 (4)
  • [43] UNSUPERVISED FEATURE EXTRACTION FOR MULTIMEDIA EVENT DETECTION AND RANKING USING AUDIO CONTENT
    Amid, Ehsan
    Mesaros, Annamaria
    Palomaki, Kalle J.
    Laaksonen, Jorma
    Kurimo, Mikko
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [44] Precise pitch profile feature extraction from musical audio for key detection
    Zhu, Yongwei
    Kankanhalli, Mohan S.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (03) : 575 - 584
  • [45] Fast Audio Feature Extraction From Compressed Audio Data
    Schuller, Gerald
    Gruhne, Matthias
    Friedrich, Tobias
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1262 - 1271
  • [46] Investigating Time-Frequency Representations for Audio Feature Extraction in Singing Technique Classification
    Yamamoto, Yuya
    Nam, Juhan
    Terasawa, Hiroko
    Hiraga, Yuzuru
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 890 - 896
  • [47] Pattern lock screen detection method based on lightweight deep feature extraction
    Fatih Ertam
    Omer Faruk Yakut
    Turker Tuncer
    Neural Computing and Applications, 2023, 35 : 1549 - 1567
  • [48] Pattern lock screen detection method based on lightweight deep feature extraction
    Ertam, Fatih
    Yakut, Omer Faruk
    Tuncer, Turker
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1549 - 1567
  • [49] A lightweight feature extraction backbone for gangue detection with improved Resnet50
    Li, Zengsong
    Lu, Jingui
    Liu, Zeyi
    INTERNATIONAL JOURNAL OF COAL PREPARATION AND UTILIZATION, 2024,
  • [50] LSCB: a lightweight feature extraction block for SAR automatic target recognition and detection
    Zhou, Guangyu
    Yu, Jimin
    Zhou, Shangbo
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (08) : 2548 - 2572