Effective speaker adaptations for speaker verification

被引:0
|
作者
Ahn, S [1 ]
Kang, S [1 ]
Ko, H [1 ]
机构
[1] Korea Univ, Dept Elect Engn, Sungbuk Ku, Seoul 136701, South Korea
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper concerns effective speaker adaptation methods to solve the over-training problem in speaker verification, which frequently occurs when modeling a speaker with sparse training data. While various speaker adaptations have already been applied to speech recognition, these methods have not yet been formally considered in speaker verification. This paper proposes speaker adaptation methods using a combination of MAP and MLLR adaptations, which are successfully used in speech recognition, and applies to speaker verification. Our aim is to remedy the small training data problem by investigating effective speaker adaptations for speaker modeling. Experimental results show that the speaker verification system using a weighted MAP and MLLR adaptation outperforms that of the conventional speaker models without adaptation by a factor of up to 5 times. From these results, we show that the speaker adaptation method achieves significantly better performance even when only small training data is available for speaker verification.
引用
收藏
页码:1081 / 1084
页数:4
相关论文
共 50 条
  • [1] Speaker adaptations in sparse training data for improved speaker verification
    Ahn, S
    Ko, H
    ELECTRONICS LETTERS, 2000, 36 (04) : 371 - 373
  • [2] On Deep Speaker Embeddings for Speaker Verification
    Jakubec, Maros
    Jarina, Roman
    Lieskovska, Eva
    Chmulik, Michal
    2021 44TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2021, : 162 - 166
  • [3] SPEAKER VERIFICATION
    CHAPMAN, WD
    LI, KP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1966, 40 (05): : 1282 - &
  • [4] Speaker verification
    Atkins, Wendy
    Biometric Technology Today, 2001, 9 (03) : 8 - 11
  • [5] Deep Speaker Embeddings for Speaker Verification of Children
    Abed, Mohammed Hamzah
    Sztaho, David
    TEXT, SPEECH, AND DIALOGUE, TSD 2024, PT II, 2024, 15049 : 58 - 69
  • [6] Disentangling speaker and channel effects in speaker verification
    Kenny, P
    Dumouchel, P
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 37 - 40
  • [7] DISENTANGLED SPEAKER EMBEDDING FOR ROBUST SPEAKER VERIFICATION
    Yi, Lu
    Mak, Man-Wai
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7662 - 7666
  • [8] Speaker verification without background speaker models
    Hsu, CN
    Yu, HC
    Yang, BH
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 233 - 236
  • [9] Effective speaker verification via dynamic mismatch compensation
    Pillay, S.
    Ariyaeeinia, A.
    Sivakumaran, P.
    Pawlewski, M.
    IET BIOMETRICS, 2012, 1 (02) : 130 - 135
  • [10] An Effective Deep Embedding Learning Architecture for Speaker Verification
    Jiang, Yiheng
    Song, Yan
    McLoughlin, Ian
    Gao, Zhifu
    Dai, Lirong
    INTERSPEECH 2019, 2019, : 4040 - 4044