Voice spoofing detection corpus for single and multi-order audio replays

被引:19
|
作者
Baumann, Roland [1 ]
Malik, Khalid Mahmood [1 ]
Javed, Ali [1 ]
Ball, Andersen [1 ]
Kujawa, Brandon [1 ]
Malik, Hafiz [2 ]
机构
[1] Oakland Univ, Dept Comp Sci & Engn, Rochester, MI 48309 USA
[2] Univ Michigan, Elect & Comp Engn, Dearborn, MI 48128 USA
来源
基金
美国国家科学基金会;
关键词
Multi-order voice replay attack; Internet of multimedia things; Voice replay spoofing; Voice controlled devices; Automatic speaker verification anti-spoofing; Voice spoofing dataset;
D O I
10.1016/j.csl.2020.101132
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The evolution of modern voice-controlled devices (VCDs) has revolutionized the Internet of Things (IoT) and resulted in the increased realization of smart homes, personalization, and home automation through voice commands. These VCDs can be exploited in IoT driven environments to generate various spoofing attacks, including the chaining of replay attacks (i.e. multi-order replay attacks). Existing datasets like ASVspoof 2017, ASVspoof 2019, and ReMASC contain only first-order replay recordings (i.e. replayed once); therefore, they cannot offer evaluation of anti-spoofing algorithms capable of detecting multi-order replay attacks. Additionally, large-scale datasets like ASVspoof 2017 and ASVspoof 2019 do not capture the characteristics of microphone arrays, which are an essential characteristic of modern VCDs. Therefore, there exists a need for a diverse replay spoofing detection corpus that consists of multi-order replay recordings against bona fide voice samples. This paper presents a novel voice spoofing detection corpus (VSDC) to evaluate the performance of multi-order replay anti-spoofing methods. The proposed VSDC consists of first-order (i.e. replayed once) and second-order replay (i.e. replayed twice) samples against the bona fide audio recordings. We ensured to create a diverse replay spoofing detection corpus in terms of environments, recording and playback devices, speakers, configurations, replay scenarios, etc. More specifically, we used 35 microphones, 25 different recording configurations, and 60 different playback configurations for first- and second-order replays to generate a total of 14,050 samples belonging to 19 speakers. Additionally, the proposed VSDC can also be used to evaluate the performance of speaker verification systems in terms of independent speaker verification. To the best of our knowledge, this is the first publicly available replay spoofing detection corpus comprised of first and second-order replay samples. Experimental results signify the effectiveness of the proposed VSDC in terms of evaluating the performance of anti-spoofing methods under multi-order replay attacks and diverse conditions. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 40 条
  • [1] A Pre-Training Framework Based on Multi-Order Acoustic Simulation for Replay Voice Spoofing Detection
    Go, Changhwan
    Park, Nam In
    Jeon, Oc-Yeub
    Chun, Chanjun
    SENSORS, 2023, 23 (16)
  • [2] Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms
    Wen, Penghui
    Hu, Kun
    Yue, Wenxi
    Zhang, Sen
    Zhou, Wanlei
    Wang, Zhiyong
    INTERSPEECH 2023, 2023, : 271 - 275
  • [3] Towards protecting cyber-physical and IoT systems from single- and multi-order voice spoofing attacks
    Javed, Ali
    Malik, Khalid Mahmood
    Irtaza, Aun
    Malik, Hafiz
    APPLIED ACOUSTICS, 2021, 183
  • [4] POCO: a Voice Spoofing and Liveness Detection Corpus based on Pop Noise
    Akimoto, Kosuke
    Liew, Seng Pei
    Mishima, Sakiko
    Mizushima, Ryo
    Lee, Kong Aik
    INTERSPEECH 2020, 2020, : 1081 - 1085
  • [5] Multi-Order Networks for Action Unit Detection
    Tallec, Gauthier
    Dapogny, Arnaud
    Bailly, Kevin
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2876 - 2888
  • [6] Drone Detection Based on Multi-order Kinematic Parameters
    Liu, Sun-Xiang-Yu
    Li, Gui-Tao
    Zhan, Ya-Feng
    Gao, Peng
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (06): : 1429 - 1447
  • [7] Detection Evolution with Multi-Order Contextual Co-occurrence
    Chen, Guang
    Ding, Yuanyuan
    Xiao, Jing
    Han, Tony X.
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1798 - 1805
  • [8] A Multi-Order Markov Chain Based Scheme for Anomaly Detection
    Sha, Wenyao
    Zhu, Yongxin
    Huang, Tian
    Qiu, Meikang
    Zhu, Yan
    Zhang, Qiannan
    2013 IEEE 37TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSACW), 2013, : 83 - 88
  • [9] Event Detection with Multi-Order Graph Convolution and Aggregated Attention
    Yan, Haoran
    Jin, Xiaolong
    Meng, Xiangbin
    Guo, Jiafeng
    Cheng, Xueqi
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5766 - 5770
  • [10] Audio Multi-View Spoofing Detection Framework Based on Audio-Text-Emotion Correlations
    Wu, Junyan
    Yin, Qilin
    Sheng, Ziqi
    Lu, Wei
    Huang, Jiwu
    Li, Bin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 7133 - 7146