ASVtorch toolkit: Speaker verification with deep neural networks

被引:3
|
作者
Lee, Kong Aik [1 ]
Vestman, Ville [2 ]
Kinnunen, Tomi [2 ]
机构
[1] ASTAR, Inst Infocomm Res, Singapore, Singapore
[2] Univ Eastern Finland, Computat Speech Grp, Joensuu, Finland
基金
芬兰科学院;
关键词
Speaker recognition; PyTorch; Deep learning; RECOGNITION;
D O I
10.1016/j.softx.2021.100697
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) - recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework. (C) 2021 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Deep Neural Networks Approach for Speaker Verification on Embedded Devices
    Do-Duc, Hao
    Van-Khai, Nguyen
    Chau-Thanh, Duc
    RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2024, PT I, 2024, 2144 : 27 - 38
  • [2] Deep neural networks for speaker verification with short speech utterances
    Yang, Il-Ho
    Heo, Hee-Soo
    Yoon, Sung-Hyun
    Yu, Ha-Jin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2016, 35 (06): : 501 - 509
  • [3] Channel adaptation based on deep neural networks for speaker verification
    Long Y.
    Ni J.
    Ye H.
    2016, Sichuan University (48): : 151 - 155
  • [4] MODELLING SPEAKER AND CHANNEL VARIABILITY USING DEEP NEURAL NETWORKS FOR ROBUST SPEAKER VERIFICATION
    Bhattacharya, Gautam
    Alam, Jahangir
    Kenny, Patrick
    Gupta, Vishwa
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 192 - 198
  • [5] STUDY ON THE TEMPORAL POOLING USED IN DEEP NEURAL NETWORKS FOR SPEAKER VERIFICATION
    Rouvier, Mickael
    Bousquet, Pierre-Michel
    Duret, Jarod
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 501 - 505
  • [6] Investigation of Bottleneck Features and Multilingual Deep Neural Networks for Speaker Verification
    Tian, Yao
    Cai, Meng
    He, Liang
    Liu, Jia
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1151 - 1155
  • [7] DEEP NEURAL NETWORKS FOR SMALL FOOTPRINT TEXT-DEPENDENT SPEAKER VERIFICATION
    Variani, Ehsan
    Lei, Xin
    McDermott, Erik
    Moreno, Ignacio Lopez
    Gonzalez-Dominguez, Javier
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [8] Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data
    Tian, Yao
    Cai, Meng
    He, Liang
    Zhang, Wei-Qiang
    Liu, Jia
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1863 - 1867
  • [9] SNR-Invariant Multitask Deep Neural Networks for Robust Speaker Verification
    Yao, Qi
    Mak, Man-Wai
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (11) : 1670 - 1674
  • [10] Speaker verification using committee neural networks
    Reddy, NP
    Butch, OA
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2003, 72 (02) : 109 - 115