Deepfake Audio Detection via MFCC Features Using Machine Learning

被引:0
|
作者
Hamza, Ameer [1 ]
Javed, Abdul Rehman Rehman [2 ,3 ]
Iqbal, Farkhund [4 ]
Kryvinska, Natalia [5 ]
Almadhor, Ahmad S. [6 ]
Jalil, Zunera [2 ]
Borghol, Rouba [7 ]
机构
[1] Air University, Faculty of Computing and AI, Islamabad,44000, Pakistan
[2] Air University, Department of Cyber Security, Islamabad,44000, Pakistan
[3] Lebanese American University, Department of Electrical and Computer Engineering, Byblos, Lebanon
[4] Zayed University, College of Technological Innovation, Abu Dhabi, United Arab Emirates
[5] Comenius University in Bratislava, Faculty of Management, Department of Information Systems, Bratislava,82005, Slovakia
[6] Jouf University, College of Computer and Information Sciences, Sakaka,72388, Saudi Arabia
[7] Rochester Institute of Technology of Dubai, Dubai, United Arab Emirates
关键词
Audio acoustics - Deep learning - Learning algorithms - Speech recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Deepfake content is created or altered synthetically using artificial intelligence (AI) approaches to appear real. It can include synthesizing audio, video, images, and text. Deepfakes may now produce natural-looking content, making them harder to identify. Much progress has been achieved in identifying video deepfakes in recent years; nevertheless, most investigations in detecting audio deepfakes have employed the ASVSpoof or AVSpoof dataset and various machine learning, deep learning, and deep learning algorithms. This research uses machine and deep learning-based approaches to identify deepfake audio. Mel-frequency cepstral coefficients (MFCCs) technique is used to acquire the most useful information from the audio. We choose the Fake-or-Real dataset, which is the most recent benchmark dataset. The dataset was created with a text-to-speech model and is divided into four sub-datasets: for-rece, for-2-sec, for-norm and for-original. These datasets are classified into sub-datasets mentioned above according to audio length and bit rate. The experimental results show that the support vector machine (SVM) outperformed the other machine learning (ML) models in terms of accuracy on for-rece and for-2-sec datasets, while the gradient boosting model performed very well using for-norm dataset. The VGG-16 model produced highly encouraging results when applied to the for-original dataset. The VGG-16 model outperforms other state-of-the-art approaches. © 2013 IEEE.
引用
收藏
页码:134018 / 134028
相关论文
共 50 条
  • [21] Does Audio Deepfake Detection Generalize?
    Mueller, Nicolas M.
    Czempin, Pavel
    Diekmann, Franziska
    Froghyar, Adam
    Bottinger, Konstantin
    INTERSPEECH 2022, 2022, : 2783 - 2787
  • [22] Deepfake audio detection by speaker verification
    Pianese, Alessandro
    Cozzolino, Davide
    Poggi, Giovanni
    Verdoliva, Luisa
    2022 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2022,
  • [23] Deepfake Video Detection via Predictive Representation Learning
    Ge, Shiming
    Lin, Fanzhao
    Li, Chenyu
    Zhang, Daichi
    Wang, Weiping
    Zeng, Dan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
  • [24] Unmasking the Fake: Machine Learning Approach for Deepfake Voice Detection
    Gujjar, Muhammad Usama Tanveer
    Munir, Kashif
    Amjad, Madiha
    Rehman, Atiq Ur
    Bermak, Amine
    IEEE ACCESS, 2024, 12 : 197442 - 197453
  • [25] DeepfakeUCL: Deepfake Detection via Unsupervised Contrastive Learning
    Fung, Sheldon
    Lu, Xuequan
    Zhang, Chao
    Li, Chang-Tsun
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [26] Compressed Domain Invariant Adversarial Representation Learning for Robust Audio Deepfake Detection
    Yuan, Chengsheng
    Chen, Yifei
    Zhou, Zhili
    Xia, Zhihua
    Huang, Yongfeng
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1111 - 1115
  • [27] What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection
    Zhang, Xiaohui
    Yi, Jiangyan
    Wang, Chenglong
    Zhang, Chu Yuan
    Zeng, Siding
    Tao, Jianhua
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19569 - 19577
  • [28] Targeted Augmented Data for Audio Deepfake Detection
    Astrid, Marcella
    Ghorbel, Enjie
    Aouada, Djamila
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 346 - 350
  • [29] Anomaly Detection of Deepfake Audio Based on Real Audio Using Generative Adversarial Network Model
    Song, Daeun
    Lee, Nayoung
    Kim, Jiwon
    Choi, Eunjung
    IEEE ACCESS, 2024, 12 : 184311 - 184326
  • [30] Whisper plus AASIST for DeepFake Audio Detection
    Luo, Qian
    Sivasundari, Kalyani Vinayagam
    HCI FOR CYBERSECURITY, PRIVACY AND TRUST, PT II, HCI-CPT 2024, 2024, 14729 : 121 - 133