FTDKD: Frequency-Time Domain Knowledge Distillation for Low-Quality Compressed Audio Deepfake Detection

被引:0
|
作者
Wang, Bo [1 ]
Tang, Yeling [1 ]
Wei, Fei [2 ]
Ba, Zhongjie [3 ]
Ren, Kui [3 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116081, Peoples R China
[2] Alibaba Grp, Hangzhou 311121, Zhejiang, Peoples R China
[3] Zhejiang Univ, Sch Cyber Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Audio deepfake detection; low-quality compressed audio; knowledge distillation;
D O I
10.1109/TASLP.2024.3492796
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In recent years, the field of audio deepfake detection has witnessed significant advancements. Nonetheless, the majority of solutions have concentrated on high-quality audio, largely overlooking the challenge of low-quality compressed audio in real-world scenarios. Low-quality compressed audio typically suffers from a loss of high-frequency details and time-domain information, which significantly undermines the performance of advanced deepfake detection systems when confronted with such data. In this paper, we introduce a deepfake detection model that employs knowledge distillation across the frequency and time domains. Our approach aims to train a teacher model with high-quality data and a student model with low-quality compressed data. Subsequently, we implement frequency-domain and time-domain distillation to facilitate the student model's learning of high-frequency information and time-domain details from the teacher model. Experimental evaluations on the ASVspoof 2019 LA and ASVspoof 2021 DF datasets illustrate the effectiveness of our methodology. On the ASVspoof 2021 DF dataset, which consists of low-quality compressed audio, we achieved an Equal Error Rate (EER) of 2.82%. To our knowledge, this performance is the best among all deepfake voice detection systems tested on the ASVspoof 2021 DF dataset. Additionally, our method proves to be versatile, showing notable performance on high-quality data with an EER of 0.30% on the ASVspoof 2019 LA dataset, closely approaching state-of-the-art results.
引用
收藏
页码:4905 / 4918
页数:14
相关论文
共 37 条
  • [1] ADD: Frequency Attention and Multi-View Based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images
    Le Minh Binh
    Woo, Simon
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 122 - 130
  • [2] Low-Quality Deepfake Detection via Unseen Artifacts
    Chhabra S.
    Thakral K.
    Mittal S.
    Vatsa M.
    Singh R.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1573 - 1585
  • [3] Cross-Domain Deepfake Detection Based on Latent Domain Knowledge Distillation
    Wang, Chunpeng
    Meng, Lingshan
    Xia, Zhiqiu
    Ren, Na
    Ma, Bin
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 896 - 900
  • [4] Compressed Domain Invariant Adversarial Representation Learning for Robust Audio Deepfake Detection
    Yuan, Chengsheng
    Chen, Yifei
    Zhou, Zhili
    Xia, Zhihua
    Huang, Yongfeng
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1111 - 1115
  • [5] Employing Super Resolution to Improve Low-Quality Deepfake Detection
    Perera, Anjana Samindra
    Atukorale, Ajantha S.
    Kumarasinghe, Prabhash
    2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [6] Spatial-frequency feature fusion based deepfake detection through knowledge distillation
    Wang, Bo
    Wu, Xiaohan
    Wang, Fei
    Zhang, Yushu
    Wei, Fei
    Song, Zengren
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [7] Low-Quality Error Detection for Noisy Knowledge Graphs
    Bu, Chenyang
    Yu, Xingchen
    Hong, Yan
    Jiang, Tingting
    JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 48 - 64
  • [8] Low-Quality Deepfake Video Detection Model Targeting Compression-Degraded Spatiotemporal Inconsistencies
    Mi, Zhongjie
    Jiang, Xinghao
    Sun, Tanfeng
    Xu, Ke
    Xu, Qiang
    Meng, Laijin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IX, ICIC 2024, 2024, 14870 : 267 - 280
  • [9] Exposing low-quality deepfake videos of Social Network Service using Spatial Restored Detection Framework
    Li, Ying
    Bian, Shan
    Wang, Chuntao
    Polat, Kemal
    Alhudhaif, Adi
    Alenezi, Fayadh
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [10] Towards Automatically Refining Low-Quality Domain Knowledge: A Case Study in Healthcare
    Bielski, Pawel
    Jendral, Soenke
    Witterauf, Lena
    Bach, Jakob
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT III, 2025, 2135 : 361 - 367