Fast and Lightweight Voice Replay Attack Detection via Time-Frequency Spectrum Difference

被引:0
|
作者
He, Ruiwen [1 ]
Cheng, Yushi [2 ]
Zheng, Zhicong [1 ]
Ji, Xiaoyu [1 ]
Xu, Wenyuan [1 ]
机构
[1] Zhejiang Univ, Coll Elect Engn, Ubiquitous Syst Secur Lab, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, ZJU UIUC Inst, Ubiquitous Syst Secur Lab, Hangzhou 310027, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 18期
基金
中国国家自然科学基金;
关键词
Acoustic feature; defense; replay attack; security measurement; statistical analysis;
D O I
10.1109/JIOT.2024.3406962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the open nature of voice and voice interface, an adversary can spoof voice recognition systems by replaying prerecorded voice commands from legitimate users, known as the voice replay attack. Existing detection methods against voice replay attacks mainly rely on extra hardware to determine the sound source or require excessive computing resources to train a classifier with abundant acoustic features. In this article, we propose Anti-Replay, a fast and lightweight detection system for voice replay attacks. To overcome the challenge of redundant classification features and complex calculation, we first investigate the time-frequency spectrum difference between the genuine human voice and the replayed audio caused by the nonlinear distortion of the attacker's microphones and speakers. Then, we design 5 types with a total of 77 features in both the time and frequency domains and propose a convolutional neural network classifier SE-ResNet50 for attack detection. Evaluations against the data sets of ASVspoof2017, ASVspoof2019, and ASVspoof2021 demonstrate that Anti-Replay can achieve an average equal error rate (EER) of 1.36% across three data sets. Meanwhile, Anti-Replay decreases the training time by 52.3% and 90.2% and decreases the model size by 83.5% and 99.9% compared with the baseline model constant-Q cepstral coefficient-Gaussian mixture model and the state-of-the-art method Res2Net. We have also confirmed that our system is effective in detecting the adaptive replay attack.
引用
收藏
页码:29798 / 29810
页数:13
相关论文
共 50 条
  • [21] Detection of heart blocks in ECG signals by spectrum and time-frequency analysis
    Saad, Norhashimah Mohd
    Abdullah, Abdul Rahim
    Low, Yin Fen
    2006 4TH STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT, 2006, : 61 - +
  • [22] Approach for fast time-frequency analysis
    Cheng, Chun Hing
    Pradhan, Pyari Mohan
    Mitchell, Joseph Ross
    IET SIGNAL PROCESSING, 2014, 8 (04) : 360 - 372
  • [23] Hyperbolic time-frequency power spectrum
    Le, KN
    Dabke, KP
    Egan, GK
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 705 - 708
  • [24] On the time-frequency detection of chirps
    Chassande-Mottin, E
    Flandrin, P
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 1999, 6 (02) : 252 - 281
  • [25] Face Presentation Attack Detection Across Spectrum using Time-Frequency Descriptors of Maximal Response in Laplacian Scale-Space
    Raghavendra, R.
    Raja, Kiran B.
    Marcel, Sebastien
    Busch, Christoph
    2016 SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2016,
  • [26] Time-frequency transformation integrated with a lightweight convolutional neural network for detection of myocardial infarction
    Sheth, Kashvi Ankitbhai
    Upreti, Charvi
    Prusty, Manas Ranjan
    Satapathy, Sandeep Kumar
    Mishra, Shruti
    Cho, Sung-Bae
    BMC MEDICAL IMAGING, 2024, 24 (01):
  • [27] Measuring transmitter attack time through time-frequency representations
    Angrisani, Leopoldo
    D'apuzzo, Massimo
    D'arco, Mauro
    Lo Moriello, Rosario Schiano
    Vadursi, Michele
    Recent Researches in Telecommunications, Informatics, Electronics and Signal Processing - TELE-INFO'11, MINO'11, SIP'11, 2011, : 201 - 206
  • [28] A Lightweight Replay Attack Detection Framework for Battery Depended IoT Devices Designed for Healthcare
    Rughoobur, Paavan
    Nagowah, Leckraj
    2017 INTERNATIONAL CONFERENCE ON INFOCOM TECHNOLOGIES AND UNMANNED SYSTEMS (TRENDS AND FUTURE DIRECTIONS) (ICTUS), 2017, : 811 - 817
  • [29] Measuring Transmitter Attack Time through Time-Frequency Representations
    Angrisani, Leopoldo
    D'Apuzzo, Massimo
    D'Arco, Mauro
    Lo Moriello, Rosario Schiano
    Vadursi, Michele
    RECENT RESEARCHES IN TELECOMMUNICATIONS, INFORMATICS, ELECTRONICS & SIGNAL PROCESSING, 2011, : 201 - +
  • [30] Frequency Domain Linear Prediction Features for Replay Spoofing Attack Detection
    Wickramasinghe, Buddhi
    Irtza, Saad
    Ambikairajah, Eliathamby
    Epps, Julien
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 661 - 665