Learnable MFCCs for Speaker Verification

被引:5
|
作者
Liu, Xuechen [1 ,2 ]
Sahidullah, Md [2 ]
Kinnunen, Tomi [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland
[2] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
基金
芬兰科学院;
关键词
Speaker verification; feature extraction; mel-frequency cesptral coefficients (MFCCs); RECOGNITION; FEATURES;
D O I
10.1109/ISCAS51556.2021.9401593
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a learnable mel-frequency cepstral coefficients (MFCCs) front-end architecture for deep neural network (DNN) based automatic speaker verification. Our architecture retains the simplicity and interpretability of MFCC-based features while allowing the model to be adapted to data flexibly. In practice, we formulate data-driven version of four linear transforms in a standard MFCC extractor - windowing, discrete Fourier transform (DFT), mel filterbank and discrete cosine transform (DCT). Results reported reach up to 6.7% (VoxCeleb1) and 9.7% (SITW) relative improvement in term of equal error rate (EER) from static MFCCs, without additional tuning effort.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Dictionary Attacks on Speaker Verification
    Marras, Mirko
    Korus, Pawel
    Jain, Anubhav
    Memon, Nasir
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 773 - 788
  • [42] Speaker verification for telemedical applications
    Buch, OA
    Reddy, NP
    PROCEEDINGS OF THE 19TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 19, PTS 1-6: MAGNIFICENT MILESTONES AND EMERGING OPPORTUNITIES IN MEDICAL ENGINEERING, 1997, 19 : 902 - 903
  • [43] Discriminative Adaptation for Speaker Verification
    Longworth, C.
    Gales, M. J. F.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1467 - 1470
  • [44] Lightweight Embeddings for Speaker Verification
    Tkachenko, Maxim
    Yamshinin, Alexander
    Kotov, Mikhail
    Nastasenko, Marina
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 687 - 696
  • [45] Speaker verification: Part 1
    Biom. Technol. Today, 2006, 6 (9-11):
  • [46] DISCRIMINATIVE AUTOENCODERS FOR SPEAKER VERIFICATION
    Lee, Hung-Shin
    Lu, Yu-Ding
    Hsu, Chin-Cheng
    Tsao, Yu
    Wang, Hsin-Min
    Leng, Shyh-Kang
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5375 - 5379
  • [47] Speaker verification for multimedia application
    Ciota, Z
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 2752 - 2756
  • [48] Prosodic Features for Speaker Verification
    Mary, Leena
    Yegnanarayana, B.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 917 - 920
  • [49] Ensemble approach in Speaker Verification
    Perera, Leibny Paola Garcia
    Raj, Bhiksha
    Flores, Juan Arturo Nolazco
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2454 - 2458
  • [50] SPEAKER VERIFICATION FOR ROMANIAN LANGUAGE
    Dumitru, C. O.
    Gavat, Inge
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2006, 68 (04): : 81 - 90