Detecting Audio Deepfakes: Integrating CNN and BiLSTM with Multi-Feature Concatenation

被引:0
|
作者
Wani, Taiba Majid [1 ]
Qadri, Syed Asif Ahmad [2 ]
Comminiello, Danilo [1 ]
Amerini, Irene [1 ]
机构
[1] Sapienza Univ Rome, Rome, Italy
[2] Natl Tsing Hua Univ, Hsinchu, Taiwan
关键词
Audio Deepfakes; Feature Concatenation; MFCC; CQCC; CQT; Mel spectrograms; CNN; BiLSTM;
D O I
10.1145/3658664.3659647
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Audio deepfake detection is emerging as a crucial field in digital media, as distinguishing real audio from deepfakes becomes increasingly challenging due to the advancement of deepfake technologies. These methods threaten information authenticity and pose serious security risks. Addressing this challenge, we propose a novel architecture that combines Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (BiLSTM) for effective deepfake audio detection. Our approach is distinguished by the feature concatenation of a comprehensive set of acoustic features: Mel Frequency Cepstral Coefficients (MFCC), Mel spectrograms, Constant Q Cepstral Coefficients (CQCC), and Constant-Q Transform (CQT) vectors. In the proposed architecture, features processed by a CNN are concatenated into two multi-dimensional features for comprehensive analysis, then analyzed by a BiLSTM network to capture temporal dynamics and contextual dependencies in audio data. This synergistic method ensures an understanding of both spatial and sequential audio characteristics. We validate our model on the ASVSpoof 2019 and FoR datasets, using accuracy and Equal Error Rate (EER) metrics for the evaluation.
引用
收藏
页码:271 / 276
页数:6
相关论文
共 50 条
  • [21] Channel Response Based Multi-Feature Audio Splicing Forgery Detection and Localization
    Rouniyar, Sanjay Kumar
    Yu Yingjuan
    Hu, Yongjian
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON E-BUSINESS, INFORMATION MANAGEMENT AND COMPUTER SCIENCE, 2018, : 46 - 53
  • [22] Copy-move detection of digital audio based on multi-feature decision
    Xie, Zhaozhi
    Lu, Wei
    Liu, Xianjin
    Xue, Yingjie
    Yeung, Yuileong
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2018, 43 : 37 - 46
  • [23] MF-CNN: a New Approach for LDoS Attack Detection Based on Multi-feature Fusion and CNN
    Tang, Dan
    Tang, Liu
    Shi, Wei
    Zhan, Sijia
    Yang, Qiuwei
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (04): : 1705 - 1722
  • [24] PBG-NET: OBJECT DETECTION WITH A MULTI-FEATURE AND ITERATIVE CNN MODEL
    Lou, Yingxin
    Fu, Guangtao
    Jiang, Zhuqing
    Men, Aidong
    Zhou, Yun
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [25] Multi-feature concatenation and multi-classifier stacking: An interpretable and generalizable machine learning method for MDD discrimination with rsfMRI
    Luo, Yunsong
    Chen, Wenyu
    Zhan, Ling
    Qiu, Jiang
    Jia, Tao
    NEUROIMAGE, 2024, 285
  • [26] Real-Time Object Detection With Reduced Region Proposal Network via Multi-Feature Concatenation
    Shih, Kuan-Hung
    Chiu, Ching-Te
    Lin, Jiou-Ai
    Bu, Yen-Yu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (06) : 2164 - 2173
  • [27] A Multi-Layer Neural Network Model Integrating BiLSTM and CNN for Chinese Sentiment Recognition
    Yang, Shanliang
    Sun, Qi
    Zhou, Huyong
    Gong, Zhengjie
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE (ICCAI 2018), 2018, : 23 - 29
  • [28] A multi-feature fusion method based on bilstm-attention-crf for chinese named entity recognition
    Zhang, Zhiyuan
    Sun, Shuihua
    Xu, Shiao
    Xu, Fan
    Liu, Jianhua
    Journal of Network Intelligence, 2021, 6 (03): : 518 - 534
  • [29] A Fast VVC Intra Prediction Based on Gradient Analysis and Multi-Feature Fusion CNN
    Jing, Zhiyong
    Zhu, Wendi
    Zhang, Qiuwen
    ELECTRONICS, 2023, 12 (09)
  • [30] A hybrid CNN and BLSTM network for human complex activity recognition with multi-feature fusion
    Huan, Ruohong
    Zhan, Ziwei
    Ge, Luoqi
    Chi, Kaikai
    Chen, Peng
    Liang, Ronghua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (30) : 36159 - 36182