Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

被引:0
|
作者
Shulipa, Andrey [1 ]
Novoselov, Sergey [1 ,2 ]
Melnikov, Aleksandr [2 ]
机构
[1] ITMO Univ, St Petersburg, Russia
[2] Speech Technol Ctr, St Petersburg, Russia
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speaker recognition; Domain adaptation; Mismatch conditions;
D O I
10.1007/978-3-319-43958-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In last years satisfactory performance of speaker recognition (SR) systems have been achieved in evaluations provided by NIST. It was possible due to using large datasets to train system parameters and accurate speaker variability modeling. In such a cases test and train conditions are similar and it ensures good performance for the evaluations. However in practical applications when training and testing conditions are different the problem of mismatching of the optimal SR system parameters occurs. It is the main problem in the deployment of the real application systems. It leads to reducing SR systems effectiveness. This paper investigates discriminative and generative approaches for the adaptation of the parameters of the speaker recognition systems and proposes effective solutions to improve their performance.
引用
收藏
页码:124 / 130
页数:7
相关论文
共 50 条
  • [1] IN-DOMAIN AND OUT-OF-DOMAIN DATA AUGMENTATION TO IMPROVE CHILDREN'S SPEAKER VERIFICATION SYSTEM IN LIMITED DATA SCENARIO
    Shahnawazuddin, S.
    Ahmad, Waquar
    Adiga, Nagaraj
    Kumar, Avinash
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7554 - 7558
  • [2] Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data
    Sarkar, Achintya Kr.
    Sahidullah, Md.
    Tan, Zheng-Hua
    Kinnunen, Tomi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2611 - 2615
  • [3] Using out-of-domain data to improve on-domain language models
    Iyer, R
    Ostendorf, M
    Gish, H
    IEEE SIGNAL PROCESSING LETTERS, 1997, 4 (08) : 221 - 223
  • [4] Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
    Guan, Shanyan
    Xu, Jingwei
    Wang, Yunbo
    Ni, Bingbing
    Yang, Xiaokang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10467 - 10476
  • [5] On robustness of unsupervised domain adaptation for speaker recognition
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    INTERSPEECH 2019, 2019, : 2958 - 2962
  • [6] DOMAIN AND SPEAKER ADAPTATION FOR CORTANA SPEECH RECOGNITION
    Zhao, Yong
    Li, Jinyu
    Zhang, Shixiong
    Chen, Liping
    Gong, Yifan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5984 - 5988
  • [7] A simple baseline for domain generalization of action recognition and a realistic out-of-domain scenario
    Kim, Hyungmin
    Jeon, Hobeum
    Kim, Dohyung
    Kim, Jaehong
    2023 20TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR, 2023, : 515 - 520
  • [8] Confidence sharing adaptation for out-of-domain human pose and shape estimation
    Yue, Tianyi
    Ren, Keyan
    Shi, Yu
    Zhao, Hu
    Bian, Qingyun
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
  • [9] On Calibration and Out-of-domain Generalization
    Wald, Yoav
    Feder, Amir
    Greenfeld, Daniel
    Shalit, Uri
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech
    Christensen, H.
    Aniol, M. B.
    Bell, P.
    Green, P.
    Hain, T.
    King, S.
    Swietojanski, P.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3609 - 3612