Learning spatial-frequency interaction for generalizable deepfake detection

被引：0

作者：

Zhai, Tianbo ^{[1
]}

Lu, Kaiyin ^{[1
]}

Li, Jiajun ^{[1
]}

Wang, Yukai ^{[1
]}

Zhang, Wenjie ^{[2
]}

Yu, Peipeng ^{[1
]}

Xia, Zhihua ^{[1
]}

机构：

[1] Jinan Univ, Coll Cyber Secur, Engn Res Ctr Trustworthy AI, Minist Educ, Guangzhou 510632, Peoples R China

[2] Ningbo Univ, Coll Informat Sci & Engn, Ningbo, Peoples R China

来源：

IET IMAGE PROCESSING | 2024年 / 18卷 / 14期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

face recognition; image classification; image forensics; image processing; IMAGE FORGERY LOCALIZATION;

D O I：

10.1049/ipr2.13276

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, face forgery detection has gained significant attention, resulting in considerable advancements. However, most existing methods rely on CNNs to extract artefacts from the spatial domain, overlooking the pervasive frequency-domain artefacts present in deepfake content, which poses challenges in achieving robust and generalized detection. To address these issues, we propose the dual-stream frequency-spatial fusion network is proposed for deepfake detection. The dual-stream frequency-spatial fusion network consists of three components: the spatial forgery feature extraction module, the frequency forgery feature extraction module, and the spatial-frequency feature fusion module. The spatial forgery feature extraction module employs spatial-channel attention to extract spatial domain features, targeting artefacts in the spatial domain. The frequency forgery feature extraction module leverages the focused linear attention to detect frequency domain anomalies in internal regions, enabling the identification of generated content. The spatial-frequency feature fusion module then fuses forgery features extracted from both the spatial and frequency domains, facilitating accurate detection of splicing artefacts and internally generated forgeries. This approach enhances the model's ability to more accurately capture forgery characteristics. Extensive experiments on several widely-used benchmarks demonstrate that our carefully designed network exhibits superior generalization and robustness, significantly improving deepfake detection performance.

引用

页码：4666 / 4679

页数：14

共 50 条

[31] THE DEPENDENCE OF MONOCULAR RIVALRY ON SPATIAL-FREQUENCY - SOME INTERACTION VARIABLES
MAPPERSON, B
LOVEGROVE, W
PERCEPTION, 1984, 13 (02) : 141 - 152
[32] Towards Generalizable Deepfake Detection with Locality-Aware AutoEncoder
Du, Mengnan
Pentyala, Shiva
Li, Yuening
Hu, Xia
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 325 - 334
[33] Generalizable Deepfake Detection With Phase-Based Motion Analysis
Prashnani, Ekta
Goebel, Michael
Manjunath, B. S.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 100 - 112
[34] SELECTIVE DOMAIN-INVARIANT FEATURE FOR GENERALIZABLE DEEPFAKE DETECTION
Lai, Yingxin
Yang, Guoqing
He, Yifan
Luo, Zhiming
Li, Shaozi
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2335 - 2339
[35] JOINT SPATIAL SPATIAL-FREQUENCY REPRESENTATION
JACOBSON, LD
WECHSLER, H
SIGNAL PROCESSING, 1988, 14 (01) : 37 - 68
[36] Learning Features of Intra-Consistency and Inter-Diversity: Keys Toward Generalizable Deepfake Detection
Chen, Han
Lin, Yuzhen
Li, Bin
Tan, Shunquan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1468 - 1480
[37] Multiple spatial-frequency channels for the detection of orientation modulation patterns
Huang, P.
Chen, I.
Lee, I.
PERCEPTION, 2001, 30 : 87 - 87
[38] Spatial-Frequency Mutual Learning for Face Super-Resolution
Wang, Chenyang
Jiang, Junjun
Zhong, Zhiwei
Liu, Xianming
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22356 - 22366
[39] Effects of spatial-frequency uncertainty on signal-detection behaviour
Huebner, R.
PERCEPTION, 1994, 23 : 3 - 3
[40] LEARNING IN GRATING WAVEFORM DISCRIMINATION - SPECIFICITY FOR ORIENTATION AND SPATIAL-FREQUENCY
FIORENTINI, A
BERARDI, N
VISION RESEARCH, 1981, 21 (07) : 1149 - 1158

← 1 2 3 4 5 →