Detecting Audio Deepfakes: Integrating CNN and BiLSTM with Multi-Feature Concatenation

被引：0

作者：

Wani, Taiba Majid ^{[1
]}

Qadri, Syed Asif Ahmad ^{[2
]}

Comminiello, Danilo ^{[1
]}

Amerini, Irene ^{[1
]}

机构：

[1] Sapienza Univ Rome, Rome, Italy

[2] Natl Tsing Hua Univ, Hsinchu, Taiwan

来源：

PROCEEDINGS OF THE 2024 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2024 | 2024年

关键词：

Audio Deepfakes; Feature Concatenation; MFCC; CQCC; CQT; Mel spectrograms; CNN; BiLSTM;

D O I：

10.1145/3658664.3659647

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Audio deepfake detection is emerging as a crucial field in digital media, as distinguishing real audio from deepfakes becomes increasingly challenging due to the advancement of deepfake technologies. These methods threaten information authenticity and pose serious security risks. Addressing this challenge, we propose a novel architecture that combines Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (BiLSTM) for effective deepfake audio detection. Our approach is distinguished by the feature concatenation of a comprehensive set of acoustic features: Mel Frequency Cepstral Coefficients (MFCC), Mel spectrograms, Constant Q Cepstral Coefficients (CQCC), and Constant-Q Transform (CQT) vectors. In the proposed architecture, features processed by a CNN are concatenated into two multi-dimensional features for comprehensive analysis, then analyzed by a BiLSTM network to capture temporal dynamics and contextual dependencies in audio data. This synergistic method ensures an understanding of both spatial and sequential audio characteristics. We validate our model on the ASVSpoof 2019 and FoR datasets, using accuracy and Equal Error Rate (EER) metrics for the evaluation.

引用

页码：271 / 276

页数：6

共 50 条

[21] Channel Response Based Multi-Feature Audio Splicing Forgery Detection and Localization
Rouniyar, Sanjay Kumar
Yu Yingjuan
Hu, Yongjian
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON E-BUSINESS, INFORMATION MANAGEMENT AND COMPUTER SCIENCE, 2018, : 46 - 53
[22] Copy-move detection of digital audio based on multi-feature decision
Xie, Zhaozhi
Lu, Wei
Liu, Xianjin
Xue, Yingjie
Yeung, Yuileong
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2018, 43 : 37 - 46
[23] MF-CNN: a New Approach for LDoS Attack Detection Based on Multi-feature Fusion and CNN
Tang, Dan
Tang, Liu
Shi, Wei
Zhan, Sijia
Yang, Qiuwei
MOBILE NETWORKS & APPLICATIONS, 2021, 26 (04): : 1705 - 1722
[24] PBG-NET: OBJECT DETECTION WITH A MULTI-FEATURE AND ITERATIVE CNN MODEL
Lou, Yingxin
Fu, Guangtao
Jiang, Zhuqing
Men, Aidong
Zhou, Yun
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
[25] Multi-feature concatenation and multi-classifier stacking: An interpretable and generalizable machine learning method for MDD discrimination with rsfMRI
Luo, Yunsong
Chen, Wenyu
Zhan, Ling
Qiu, Jiang
Jia, Tao
NEUROIMAGE, 2024, 285
[26] Real-Time Object Detection With Reduced Region Proposal Network via Multi-Feature Concatenation
Shih, Kuan-Hung
Chiu, Ching-Te
Lin, Jiou-Ai
Bu, Yen-Yu
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (06) : 2164 - 2173
[27] A Multi-Layer Neural Network Model Integrating BiLSTM and CNN for Chinese Sentiment Recognition
Yang, Shanliang
Sun, Qi
Zhou, Huyong
Gong, Zhengjie
PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE (ICCAI 2018), 2018, : 23 - 29
[28] A multi-feature fusion method based on bilstm-attention-crf for chinese named entity recognition
Zhang, Zhiyuan
Sun, Shuihua
Xu, Shiao
Xu, Fan
Liu, Jianhua
Journal of Network Intelligence, 2021, 6 (03): : 518 - 534
[29] A Fast VVC Intra Prediction Based on Gradient Analysis and Multi-Feature Fusion CNN
Jing, Zhiyong
Zhu, Wendi
Zhang, Qiuwen
ELECTRONICS, 2023, 12 (09)
[30] A hybrid CNN and BLSTM network for human complex activity recognition with multi-feature fusion
Huan, Ruohong
Zhan, Ziwei
Ge, Luoqi
Chi, Kaikai
Chen, Peng
Liang, Ronghua
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (30) : 36159 - 36182

← 1 2 3 4 5 →