Novel Demodulation-Based Features using Classifier-level Fusion of GMM and CNN for Replay Detection

被引:0
|
作者
Kamble, Madhu R. [1 ]
Tak, Hemlata [1 ]
Krishna, Maddala V. Siva [2 ]
Patil, Hemant A. [1 ]
机构
[1] DA IICT, Speech Res Lab, Gandhinagar, Gujarat, India
[2] IIIT, Vadodara, Gujarat, India
关键词
Automatic speaker verification; spoof; replay; demodulation techniques; convolutional neural network; SPEAKER VERIFICATION; INSTANTANEOUS FREQUENCY; ENERGY SEPARATION; COUNTERMEASURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we explore the use of Convolutional Neural Networks (CNN) for replay spoof detection in Automatic Speaker Verification (ASV) system. The Amplitude and Frequency Modulation (AM-FM) feature sets obtained from the Hilbert transform (HT) and Energy Separation Algorithm (ESA) are used as the front end. We have observed the effect of max-pooling and fully connected (FC) layers, when replaced with the convolutional layers in CNN. The results are compared with Gaussian Mixture Model (GMM) classifier, furthermore to obtain the possible complementary information of both the GMM and CNN classifiers, we have explored classifier-level fusion. In addition, we compared our results with Constant-Q Cepstral Coefficients (CQCC) and Mel Frequency Cepstral Coefficients (MFCC) feature sets. The architecture with max-pooling when replaced with convolutional layer along with FC layers had performed relatively better on most of the AM-FM feature sets compared to other CNNs. The ESA-based AM features (i.e., Instantaneous Amplitude Cosine Coefficients (ESA-IACC)) performed better as AM do not have more fluctuation as FM have during models training. The lower EER is obtained with classifier-level fusion of ESA-IACC feature set resulting in 2.54 % EER on development set and 6.04 % on evaluation set of ASVspoof 2017 Challenge database.
引用
收藏
页码:334 / 338
页数:5
相关论文
共 50 条
  • [41] AudioMask: Robust Sound Event Detection Using Mask R-CNN and Frame-Level Classifier
    Nasiri, Alireza
    Cui, Yuxin
    Liu, Zhonghao
    Jin, Jing
    Zhao, Yong
    Hu, Jianjun
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 485 - 492
  • [42] Phoneme Independent Pathological Voice Detection Using Wavelet Based MFCCs, GMM-SVM Hybrid Classifier
    Vikram, C. M.
    Umarani, K.
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 929 - 934
  • [43] Replay Attack Detection Using Linear Prediction Analysis-Based Relative Phase Features
    Phapatanaburi, Khomdet
    Wang, Longbiao
    Nakagawa, Seiichi
    Iwahashi, Masahiro
    IEEE ACCESS, 2019, 7 : 183614 - 183625
  • [44] A novel approach to enhanced fall detection using STFT and magnitude features with CNN autoencoder
    Tomorn Soontornnapar
    Tuchsanai Ploysuwan
    Neural Computing and Applications, 2025, 37 (6) : 4229 - 4245
  • [45] Underwater Cage Boundary Detection Based on GLCM Features by Using SVM Classifier
    Shi, Xiaoting
    Huang, Hai
    Wang, Bo
    Pang, Shuo
    Qin, Hongde
    2019 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2019, : 1169 - 1174
  • [46] DETECTION AND CLASSIFICATION OF COVID-19 USING GRAY-LEVEL FEATURES AND ENSEMBLE CLASSIFIER
    Patnaik, Vijaya
    Mohanty, Monalisa
    Subudhi, Asit Kumar
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2024,
  • [47] Detection of Highway Pavement Damage Based on a CNN Using Grayscale and HOG Features
    Chen, Guo-Hong
    Ni, Jie
    Chen, Zhuo
    Huang, Hao
    Sun, Yun-Lei
    Ip, Wai Hung
    Yung, Kai Leung
    SENSORS, 2022, 22 (07)
  • [48] Human Action Recognition in Video Sequence using Logistic Regression by Features Fusion Approach based on CNN Features
    Ahmad, Tariq
    Wu, Jinsong
    Khan, Imran
    Rahim, Asif
    Khan, Amjad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 18 - 25
  • [49] HUMAN FALL DETECTION USING SEGMENT-LEVEL CNN FEATURES AND SPARSE DICTIONARY LEARNING
    Ge, Chenjie
    Gu, Irene Yu-Hua
    Yang, Jie
    2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [50] A NOVEL CNN SEGMENTATION FRAMEWORK BASED ON USING NEW SHAPE AND APPEARANCE FEATURES
    Soliman, Ahmed
    Shaffie, Ahmed
    Ghazal, Mohammed
    Gimel'farb, Georgy
    Keynton, Robert
    El-Baz, Ayman
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3488 - 3492