Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

被引:0
|
作者
Shulipa, Andrey [1 ]
Novoselov, Sergey [1 ,2 ]
Melnikov, Aleksandr [2 ]
机构
[1] ITMO Univ, St Petersburg, Russia
[2] Speech Technol Ctr, St Petersburg, Russia
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speaker recognition; Domain adaptation; Mismatch conditions;
D O I
10.1007/978-3-319-43958-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In last years satisfactory performance of speaker recognition (SR) systems have been achieved in evaluations provided by NIST. It was possible due to using large datasets to train system parameters and accurate speaker variability modeling. In such a cases test and train conditions are similar and it ensures good performance for the evaluations. However in practical applications when training and testing conditions are different the problem of mismatching of the optimal SR system parameters occurs. It is the main problem in the deployment of the real application systems. It leads to reducing SR systems effectiveness. This paper investigates discriminative and generative approaches for the adaptation of the parameters of the speaker recognition systems and proposes effective solutions to improve their performance.
引用
收藏
页码:124 / 130
页数:7
相关论文
共 50 条
  • [31] Speaker to Emotion: Domain Adaptation for Speech Emotion Recognition with Residual Adapters
    Xi, Yuxuan
    Li, Pengcheng
    Song, Yan
    Jiang, Yiheng
    Dai, Lirong
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 513 - 518
  • [32] THE CORAL plus plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF SPEAKER RECOGNITION
    Li, Rongjin
    Zhang, Weibin
    Chen, Dongpeng
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7172 - 7176
  • [33] SUPERVISED DOMAIN ADAPTATION FOR I-VECTOR BASED SPEAKER RECOGNITION
    Garcia-Romero, Daniel
    McCree, Alan
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [34] An Empirical Study on Explanations in Out-of-Domain Settings
    Chrysostomou, George
    Aletras, Nikolaos
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6920 - 6938
  • [35] Out-of-domain FrameNet Semantic Role Labeling
    Hartmann, Silvana
    Kuznetsov, Ilia
    Martin, Teresa
    Gurevych, Iryna
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 471 - 482
  • [36] Dialogue Act Recognition for Chinese Out-of-Domain Utterances using Hybrid CNN-RF
    Wang, Jundong
    Huang, Peijie
    Huang, Qiangjia
    Ke, Zixuan
    Lin, Piyuan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 14 - 17
  • [37] Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model
    Vyas, Apoorv
    Madikeri, Srikanth
    Bourlard, Herve
    INTERSPEECH 2021, 2021, : 2861 - 2865
  • [38] Practical and Efficient Out-of-Domain Detection with Adversarial Learning
    Wang, Bo
    Mine, Tsunenori
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 853 - 862
  • [39] Efficient Out-of-Domain Detection for Sequence to Sequence Models
    Vazhentsev, Artem
    Tsvigun, Akim
    Vashurin, Roman
    Petrakov, Sergey
    Vasilev, Daniil
    Panov, Maxim
    Panchenko, Alexander
    Shelmanov, Artem
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1430 - 1454
  • [40] The effects of appetitive stimuli on out-of-domain consumption impatience
    Li, Xiuping
    JOURNAL OF CONSUMER RESEARCH, 2008, 34 (05) : 649 - 656