Replay Speech Detection Based on Dual-Input Hierarchical Fusion Network

被引:0
|
作者
Hu, Chenlei [1 ]
Zhou, Ruohua [1 ]
Yuan, Qingsheng [2 ]
机构
[1] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing 102627, Peoples R China
[2] Natl Comp Network Emergency Response Tech Team Coo, Beijing 100029, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 09期
关键词
anti-spoofing; replay speech detection; HFM; ASVspoof; 2021; PATTERN;
D O I
10.3390/app13095350
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Speech anti-spoofing is a crucial aspect of speaker recognition systems and has received a great deal of attention in recent years. Deep neural networks have achieved satisfactory results in datasets with similar training and testing data distributions, but their generalization ability is limited in datasets with different distributions. In this paper, we proposed a novel dual-input hierarchical fusion network (HFN) to improve the generalization ability of our model. The network had two inputs (the original speech signal and the time-reversed signal), which increased the volume and diversity of the training data. The hierarchical fusion model (HFM) enabled more thorough fusion of information from different input levels and improved model performance by fusing the two inputs after speech feature extraction. We finally evaluated the results using the ASVspoof 2021 PA (Physical Access) dataset, and the proposed system achieved an Equal Error Rate (EER) of 24.46% and a minimum tandem Detection Cost Function (min t-DCF) of 0.6708 in the test set. Compared with the four baseline systems in the ASVspoof 2021 competition, the proposed system min t-DCF values were decreased by 28.9%, 31.0%, 32.6%, and 32.9%, and the EERs were decreased by 35.7%, 38.1%, 45.4%, and 49.7%, respectively.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Seizure prediction in scalp EEG based channel attention dual-input convolutional neural network
    Sun, Biao
    Lv, Jia-Jun
    Rui, Lin-Ge
    Yang, Yu-Xuan
    Chen, Yun-Gang
    Ma, Chao
    Gao, Zhong-Ke
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2021, 584
  • [12] Optimisation of a Doherty power amplifier based on dual-input characterisation
    Piacibello, Anna
    Quaglia, Roberto
    Camarchia, Vittorio
    Ramella, Chiara
    Pirola, Marco
    2019 IEEE INTERNATIONAL CONFERENCE ON MICROWAVES, ANTENNAS, COMMUNICATIONS AND ELECTRONIC SYSTEMS (COMCAS), 2019,
  • [13] Scene-adaptive crowd counting method based on meta learning with dual-input network DMNet
    Zhao, Haoyu
    Min, Weidong
    Xu, Jianqiang
    Wang, Qi
    Zou, Yi
    Fu, Qiyan
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (01)
  • [14] Scene-adaptive crowd counting method based on meta learning with dual-input network DMNet
    Haoyu Zhao
    Weidong Min
    Jianqiang Xu
    Qi Wang
    Yi Zou
    Qiyan Fu
    Frontiers of Computer Science, 2023, 17
  • [15] Advanced dual-input artificial optical synapse for recognition and generative neural network
    Liu, Zhengjun
    Fang, Yuxiao
    Cai, Zhaohui
    Liu, Yijun
    Dong, Ziling
    Zheng, Renming
    Shen, Zongjie
    Wu, Rui
    Qu, Wenjing
    Fu, Jufei
    Ru, Changhai
    Wu, Ye
    Gu, Jiangmin
    Liu, Yina
    Liu, Qing
    Zhao, Chun
    Wen, Zhen
    NANO ENERGY, 2024, 132
  • [16] Classification Patient-Ventilator Asynchrony with Dual-Input Convolutional Neural Network
    Chong, Thern Chang
    Loo, Nien Loong
    Chiew, Yeong Shiong
    Mat-Nor, Mohd Basri
    Ralib, Azrina Md
    IFAC PAPERSONLINE, 2021, 54 (15): : 322 - 327
  • [17] Dual-input attention network for automatic identification of detritus from river sands
    Ge, Shiping
    Wang, Cong
    Jiang, Zhiwei
    Hao, Huizhen
    Gu, Qing
    COMPUTERS & GEOSCIENCES, 2021, 151
  • [18] HIERARCHICAL NETWORK BASED ON THE FUSION OF STATIC AND DYNAMIC FEATURES FOR SPEECH EMOTION RECOGNITION
    Cao, Qi
    Hou, Mixiao
    Chen, Bingzhi
    Zhang, Zheng
    Lu, Guangming
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6334 - 6338
  • [19] The auto segmentation for cardiac structures using a dual-input deep learning network based on vision saliency and transformer
    Wang, Jing
    Wang, Shuyu
    Liang, Wei
    Zhang, Nan
    Zhang, Yan
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2022, 23 (05):
  • [20] REMAINING USEFUL LIFE PREDICTION OF WIND TURBINE BEARINGS BASED ON DUAL-INPUT DEEP CONVOLUTIONAL NEURAL NETWORK
    Liu J.
    Su Y.
    Chen C.
    Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2023, 44 (12): : 238 - 250