Fast and Lightweight Voice Replay Attack Detection via Time-Frequency Spectrum Difference

被引:0
|
作者
He, Ruiwen [1 ]
Cheng, Yushi [2 ]
Zheng, Zhicong [1 ]
Ji, Xiaoyu [1 ]
Xu, Wenyuan [1 ]
机构
[1] Zhejiang Univ, Coll Elect Engn, Ubiquitous Syst Secur Lab, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, ZJU UIUC Inst, Ubiquitous Syst Secur Lab, Hangzhou 310027, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 18期
基金
中国国家自然科学基金;
关键词
Acoustic feature; defense; replay attack; security measurement; statistical analysis;
D O I
10.1109/JIOT.2024.3406962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the open nature of voice and voice interface, an adversary can spoof voice recognition systems by replaying prerecorded voice commands from legitimate users, known as the voice replay attack. Existing detection methods against voice replay attacks mainly rely on extra hardware to determine the sound source or require excessive computing resources to train a classifier with abundant acoustic features. In this article, we propose Anti-Replay, a fast and lightweight detection system for voice replay attacks. To overcome the challenge of redundant classification features and complex calculation, we first investigate the time-frequency spectrum difference between the genuine human voice and the replayed audio caused by the nonlinear distortion of the attacker's microphones and speakers. Then, we design 5 types with a total of 77 features in both the time and frequency domains and propose a convolutional neural network classifier SE-ResNet50 for attack detection. Evaluations against the data sets of ASVspoof2017, ASVspoof2019, and ASVspoof2021 demonstrate that Anti-Replay can achieve an average equal error rate (EER) of 1.36% across three data sets. Meanwhile, Anti-Replay decreases the training time by 52.3% and 90.2% and decreases the model size by 83.5% and 99.9% compared with the baseline model constant-Q cepstral coefficient-Gaussian mixture model and the state-of-the-art method Res2Net. We have also confirmed that our system is effective in detecting the adaptive replay attack.
引用
收藏
页码:29798 / 29810
页数:13
相关论文
共 50 条
  • [1] Replay attack detection by channel frequency response difference enhancement
    Guo, Xingchen
    Yu, Yibiao
    FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2019, 11198
  • [2] Replay Attack Detection Based on High Frequency Missing Spectrum
    Yuan, Junming
    Ablimit, Mijit
    Hamdulla, Askar
    INFORMATION, 2023, 14 (01)
  • [3] VOICE QUALITY FEATURES FOR REPLAY ATTACK DETECTION
    Woubie, Abraham
    Backstrom, Tom
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 384 - 388
  • [4] Cooperative Time-Frequency Localization for Wideband Spectrum Sensing With A Lightweight Detector
    Zhao, Runyi
    Ruan, Yuhan
    Li, Yongzhao
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (07) : 1844 - 1848
  • [5] Replay attack detection based on distortion by loudspeaker for voice authentication
    Ren, Yanzhen
    Fang, Zhong
    Liu, Dengkai
    Chen, Changwen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (07) : 8383 - 8396
  • [6] Replay attack detection based on distortion by loudspeaker for voice authentication
    Yanzhen Ren
    Zhong Fang
    Dengkai Liu
    Changwen Chen
    Multimedia Tools and Applications, 2019, 78 : 8383 - 8396
  • [7] FAST KEYWORD DETECTION WITH SPARSE TIME-FREQUENCY MODELS
    Kokiopoulou, Effrosyni
    Frossard, Pascal
    Verscheure, Olivier
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1081 - +
  • [8] Replay Attack Detection Based on Voice and Non-voice Sections for Speaker Verification
    Mills, Ananda Garin
    Kaewcharuay, Patthranit
    Sathirasattayanon, Pannathorn
    Duangpummet, Suradej
    Galajit, Kasorn
    Karnjana, Jessada
    Aimmanee, Pakinee
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 221 - 226
  • [9] An LDoS attack detection method based on FSWT time-frequency distribution
    Wang, Xiaocai
    Tang, Dan
    Feng, Ye
    Qin, Zheng
    Xiong, Bing
    Liu, Yufeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [10] Detection of changes in clock noise using the time-frequency spectrum
    Galleani, Lorenzo
    METROLOGIA, 2008, 45 (06) : S143 - S153