USING SELF ATTENTION DNNS TO DISCOVER PHONEMIC FEATURES FOR AUDIO DEEP FAKE DETECTION

被引:2
|
作者
Dhamyal, Hira [1 ]
Ali, Ayesha [1 ]
Qazi, Ihsan Ayyub [1 ]
Raza, Agha Ali [1 ]
机构
[1] Lahore Univ Management Sci, Lahore, Pakistan
关键词
spoof; bonafide; countermeasure; attention; phonemes; deep neural network; senet; explainable; fair; small datasets; forensics; deepfake; SPEECH;
D O I
10.1109/ASRU51503.2021.9688312
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the advancement in natural-sounding speech production models, it is becoming important to develop models that can detect spoofed audios. Synthesized speech models do not explicitly account for all factors affecting speech production, such as the shape, size and structure of a speaker's vocal tract. In this paper, we hypothesize that due to practical limitations of audio corpora (including size, distribution, and balance of variables like gender, age, and accents), there exist certain phonemes that synthesized models are not able to replicate as well as the human articulation system and such phonemes differ in their spectral characteristics from bonafide speech. To discover such phonemes and quantify their effectiveness in distinguishing between spoofed and bonafide speech, we use a deep learning model with self-attention, and analyze the attention weights of the trained model. We use the ASVSpoof2019 dataset for our analysis and find that the attention mechanism picks most on fricatives: /S/,/SH/, nasals: /M/,/N/, vowels: /Y/, and stops: /D/. Furthermore, we obtain 7.54% EER on train and 11.98% on dev data when using only the top-16 most attended phonemes from input audio, better than when any other phoneme classes are used.
引用
收藏
页码:1178 / 1184
页数:7
相关论文
共 50 条
  • [31] Arabic Fake News Detection Using Deep Learning
    Fouad, Khaled M.
    Sabbeh, Sahar F.
    Medhat, Walaa
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (02): : 3647 - 3665
  • [32] Detection of Online Fake Review Using Deep Learning
    Monisha, G. B.
    Nayak, Jyothi S.
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 4, SMARTCOM 2024, 2024, 948 : 161 - 172
  • [33] Arabic Fake News Detection Using Deep Learning
    Othman, Nermin Abdelhakim
    Elzanfaly, Doaa S.
    Elhawary, Mostafa Mahmoud M.
    IEEE ACCESS, 2024, 12 : 122363 - 122376
  • [34] Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme
    Sekar, R. Raja
    Rajkumar, T. Dhiliphan
    Anne, Koteswara Rao
    VISUAL COMPUTER, 2025, 41 (04): : 2783 - 2800
  • [35] AUDIO RECAPTURE DETECTION USING DEEP LEARNING
    Luo, Da
    Wu, Haojun
    Huang, Jiwu
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 478 - 482
  • [36] Multiple features based approach for automatic fake news detection on social networks using deep learning
    Sahoo, Somya Ranjan
    Gupta, B. B.
    APPLIED SOFT COMPUTING, 2021, 100
  • [37] Sentence Matching with Deep Self-attention and Co-attention Features
    Wang, Zhipeng
    Yan, Danfeng
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2021, PT II, 2021, 12816 : 550 - 561
  • [38] MFAN: Multi-Level Features Attention Network for Fake Certificate Image Detection
    Sun, Yu
    Ni, Rongrong
    Zhao, Yao
    ENTROPY, 2022, 24 (01)
  • [39] Fake news stance detection using selective features and FakeNET
    Aljrees, Turki
    Cheng, Xiaochun
    Ahmed, Mian Muhammad
    Umer, Muhammad
    Majeed, Rizwan
    Alnowaiser, Khaled
    Abuzinadah, Nihal
    Ashraf, Imran
    PLOS ONE, 2023, 18 (07):
  • [40] Fake-checker: A fusion of texture features and deep learning for deepfakes detection
    ul Huda, Noor
    Javed, Ali
    Maswadi, Kholoud
    Alhazmi, Ali
    Ashraf, Rehan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 49013 - 49037