ASVtorch toolkit: Speaker verification with deep neural networks

被引：3

作者：

Lee, Kong Aik ^{[1
]}

Vestman, Ville ^{[2
]}

Kinnunen, Tomi ^{[2
]}

机构：

[1] ASTAR, Inst Infocomm Res, Singapore, Singapore

[2] Univ Eastern Finland, Computat Speech Grp, Joensuu, Finland

来源：

SOFTWAREX | 2021年 / 14卷

基金：

芬兰科学院;

关键词：

Speaker recognition; PyTorch; Deep learning; RECOGNITION;

D O I：

10.1016/j.softx.2021.100697

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) - recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework. (C) 2021 The Author(s). Published by Elsevier B.V.

引用

页数：6

共 50 条

[1] A Deep Neural Networks Approach for Speaker Verification on Embedded Devices
Do-Duc, Hao
Van-Khai, Nguyen
Chau-Thanh, Duc
RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2024, PT I, 2024, 2144 : 27 - 38
[2] Deep neural networks for speaker verification with short speech utterances
Yang, Il-Ho
Heo, Hee-Soo
Yoon, Sung-Hyun
Yu, Ha-Jin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2016, 35 (06): : 501 - 509
[3] Channel adaptation based on deep neural networks for speaker verification
Long Y.
Ni J.
Ye H.
2016, Sichuan University (48): : 151 - 155
[4] MODELLING SPEAKER AND CHANNEL VARIABILITY USING DEEP NEURAL NETWORKS FOR ROBUST SPEAKER VERIFICATION
Bhattacharya, Gautam
Alam, Jahangir
Kenny, Patrick
Gupta, Vishwa
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 192 - 198
[5] STUDY ON THE TEMPORAL POOLING USED IN DEEP NEURAL NETWORKS FOR SPEAKER VERIFICATION
Rouvier, Mickael
Bousquet, Pierre-Michel
Duret, Jarod
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 501 - 505
[6] Investigation of Bottleneck Features and Multilingual Deep Neural Networks for Speaker Verification
Tian, Yao
Cai, Meng
He, Liang
Liu, Jia
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1151 - 1155
[7] DEEP NEURAL NETWORKS FOR SMALL FOOTPRINT TEXT-DEPENDENT SPEAKER VERIFICATION
Variani, Ehsan
Lei, Xin
McDermott, Erik
Moreno, Ignacio Lopez
Gonzalez-Dominguez, Javier
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[8] Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data
Tian, Yao
Cai, Meng
He, Liang
Zhang, Wei-Qiang
Liu, Jia
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1863 - 1867
[9] SNR-Invariant Multitask Deep Neural Networks for Robust Speaker Verification
Yao, Qi
Mak, Man-Wai
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (11) : 1670 - 1674
[10] Speaker verification using committee neural networks
Reddy, NP
Butch, OA
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2003, 72 (02) : 109 - 115

← 1 2 3 4 5 →