Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

被引:0
|
作者
Shulipa, Andrey [1 ]
Novoselov, Sergey [1 ,2 ]
Melnikov, Aleksandr [2 ]
机构
[1] ITMO Univ, St Petersburg, Russia
[2] Speech Technol Ctr, St Petersburg, Russia
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speaker recognition; Domain adaptation; Mismatch conditions;
D O I
10.1007/978-3-319-43958-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In last years satisfactory performance of speaker recognition (SR) systems have been achieved in evaluations provided by NIST. It was possible due to using large datasets to train system parameters and accurate speaker variability modeling. In such a cases test and train conditions are similar and it ensures good performance for the evaluations. However in practical applications when training and testing conditions are different the problem of mismatching of the optimal SR system parameters occurs. It is the main problem in the deployment of the real application systems. It leads to reducing SR systems effectiveness. This paper investigates discriminative and generative approaches for the adaptation of the parameters of the speaker recognition systems and proposes effective solutions to improve their performance.
引用
收藏
页码:124 / 130
页数:7
相关论文
共 50 条
  • [21] Automatic speaker verification system for dysarthric speakers using prosodic features and out-of-domain data augmentation
    Salim, Shinimol
    Shahnawazuddin, Syed
    Ahmad, Waquar
    APPLIED ACOUSTICS, 2023, 210
  • [22] Certifying Out-of-Domain Generalization for Blackbox Functions
    Weber, Maurice
    Li, Linyi
    Wang, Boxin
    Zhao, Zhikuan
    Li, Bo
    Zhang, Ce
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [23] Rewriting a Generative Model with Out-of-Domain Patterns
    Gao, Panpan
    Sun, Hanxu
    Chen, Gang
    Li, Minggang
    ELECTRONICS, 2025, 14 (04):
  • [24] IMPROVING CONFIDENCE ESTIMATION ON OUT-OF-DOMAIN DATA FOR END-TO-END SPEECH RECOGNITION
    Li, Qiujia
    Zhang, Yu
    Qiu, David
    He, Yanzhang
    Cao, Liangliang
    Woodland, Philip C.
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6537 - 6541
  • [25] Specialist management and coordination of "Out-of-domain care"
    Koopman, RJ
    May, KM
    FAMILY MEDICINE, 2004, 36 (01) : 46 - 50
  • [26] Out-of-Domain Evaluation of Finnish Dependency Parsing
    Kanerva, Jenna
    Ginter, Filip
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1114 - 1124
  • [27] Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance
    Rahman, Md Hafizur
    Himawan, Ivan
    Mclaren, Mitchell
    Fookes, Clinton
    Sridharan, Sridha
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3593 - 3597
  • [28] Evaluation of Domain Adaptation Approaches to Improve the Translation Quality
    Yildirim, Ezgi
    Tantug, Ahmet Cuneyd
    NEW TRENDS IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, 2015, 572 : 15 - 26
  • [29] A Simple Unsupervised Knowledge-Free Domain Adaptation for Speaker Recognition
    Lin, Wan
    Li, Lantian
    Wang, Dong
    APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [30] CYCLE-GANS FOR DOMAIN ADAPTATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
    Nidadavolu, Phani Sankar
    Villalba, Jesus
    Dehak, Najim
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6206 - 6210