An analysis of data fusion methods for speaker verification

被引:0
|
作者
Farrell, KR [1 ]
Ramachandran, RP [1 ]
Mammone, RJ [1 ]
机构
[1] T NETIX SpeakEZ Inc, Englewood, CO 80112 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we analyze the diversity of information as provided by several modeling approaches for speaker verification. This information is used to facilitate the fusion of the individual results into an overall result that provides advantages in accuracy over the individual models. The modeling methods that are evaluated consist of the neural tree network (NTN), Gaussian mixture model (GMM), hidden Markov model (HMM), and dynastic time warping (DTW). With the exception of DTW, all methods utilize subword-based approaches. The phrase-level scores for each modeling approach are used for combination, Several data fusion methods are evaluated for combining the model results, including the linear and log opinion pool approaches along with voting. The results of the above analysis have been integrated into a system that has been tested with several databases collected within landline and cellular environments. We have found the Linear and log opinion pool methods to consistently reduce the error rate from that obtained when the models are need individually.
引用
收藏
页码:1129 / 1132
页数:4
相关论文
共 50 条
  • [41] Simplified factor analysis in speaker verification
    Guo, Wu
    Li, Yijie
    Dai, Lirong
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1316 - 1319
  • [42] Speaker-Aware Linear Discriminant Analysis in Speaker Verification
    Zheng, Naijun
    Wu, Xixin
    Zhong, Jinghua
    Liu, Xunying
    Meng, Helen
    INTERSPEECH 2020, 2020, : 3012 - 3016
  • [43] Multi-Source Domain Adaptation and Fusion for Speaker Verification
    Zhu, Donghui
    Chen, Ning
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2103 - 2116
  • [44] Robust speaker verification via fusion of speech and lip modalities
    Wark, T.
    Sridharan, S.
    Chandran, V.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 6 : 3061 - 3064
  • [45] Fusion of MFCC and MPEG-7 attributes for speaker verification
    Altincay, Hakan
    Ergun, Cem
    Ciloglu, Tolga
    2006 IEEE 14th Signal Processing and Communications Applications, Vols 1 and 2, 2006, : 551 - 554
  • [46] FUSION OF SPECTRAL FEATURE SETS FOR IMPROVING SPEAKER VERIFICATION PERFORMANCES
    Rastoceanu, Florin
    Lazar, Marilena
    17TH INTERNATIONAL CONFERENCE - THE KNOWLEDGE-BASED ORGANIZATION: APPLIED TECHNICAL SCIENCES AND ADVANCED MILITARY TECHNOLOGIES, CONFERENCE PROCEEDING 3, 2011, : 338 - 343
  • [47] Sample-specific late classifier fusion for speaker verification
    Hasheminejad, Mohammad
    Farsi, Hassan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (12) : 15273 - 15289
  • [48] Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion
    Cai, Weicheng
    Cai, Danwei
    Liu, Wenbo
    Li, Gang
    Li, Ming
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 17 - 21
  • [49] Robust speaker verification via fusion of speech and lip modalities
    Wark, T
    Sridharan, S
    Chandran, V
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 3061 - 3064
  • [50] Progressive channel fusion for more efficient TDNN on speaker verification
    Zhao, Zhenduo
    Li, Zhuo
    Wang, Wenchao
    Xu, Ji
    SPEECH COMMUNICATION, 2024, 163