An analysis of data fusion methods for speaker verification

被引:0
|
作者
Farrell, KR [1 ]
Ramachandran, RP [1 ]
Mammone, RJ [1 ]
机构
[1] T NETIX SpeakEZ Inc, Englewood, CO 80112 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we analyze the diversity of information as provided by several modeling approaches for speaker verification. This information is used to facilitate the fusion of the individual results into an overall result that provides advantages in accuracy over the individual models. The modeling methods that are evaluated consist of the neural tree network (NTN), Gaussian mixture model (GMM), hidden Markov model (HMM), and dynastic time warping (DTW). With the exception of DTW, all methods utilize subword-based approaches. The phrase-level scores for each modeling approach are used for combination, Several data fusion methods are evaluated for combining the model results, including the linear and log opinion pool approaches along with voting. The results of the above analysis have been integrated into a system that has been tested with several databases collected within landline and cellular environments. We have found the Linear and log opinion pool methods to consistently reduce the error rate from that obtained when the models are need individually.
引用
收藏
页码:1129 / 1132
页数:4
相关论文
共 50 条
  • [31] Exploring Acoustic Factor Analysis for Limited Test Data Speaker Verification
    Mamodiya, Salil
    Kumar, Lay
    Das, Rohan Kumar
    Prasanna, S. R. Mahadeva
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 1397 - 1401
  • [32] iVector Fusion of Prosodic and Cepstral Features for Speaker Verification
    Kockmann, Marcel
    Ferrer, Luciana
    Burget, Lukas
    Cernocky, Jan Honza
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 272 - 275
  • [33] Combining methods to improve speaker verification decision
    Genoud, D
    Bimbot, F
    Gravier, G
    Chollet, G
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1756 - 1759
  • [34] ChildAugment: Data augmentation methods for zero-resource children's speaker verification
    Singh, Vishwanath Pratap
    Sahidullah, Md
    Kinnunen, Tomi
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (03): : 2221 - 2232
  • [35] ChildAugment: Data augmentation methods for zero-resource children's speaker verification
    Singh, Vishwanath Pratap
    Sahidullah, Md
    Kinnunen, Tomi
    Journal of the Acoustical Society of America, 1600, 155 (03): : 2221 - 2232
  • [36] Discriminant Analysis Methods Comparison in I-Vector Space for Speaker Verification
    Mohammadi, Mohsen
    Mohammadi, Hamid Reza Sadegh
    2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 166 - 172
  • [37] PLDA Speaker Verification with Limited Speech Data
    Ridzik, Andrej
    Rusko, Milan
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 325 - 332
  • [38] Speaker verification under mismatched data conditions
    Pillay, S. G.
    Ariyaeeinia, A.
    Pawlewski, M.
    Sivakumaran, P.
    IET SIGNAL PROCESSING, 2009, 3 (04) : 236 - 246
  • [39] Speaker Verification on Unbalanced Data with Genetic Programming
    Loughran, Roisin
    Agapitos, Alexandros
    Kattan, Ahmed
    Brabazon, Anthony
    O'Neill, Michael
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2016, PT I, 2016, 9597 : 737 - 753
  • [40] UTILIZATION OF UNLABELED DEVELOPMENT DATA FOR SPEAKER VERIFICATION
    Liu, Gang
    Yu, Chengzhu
    Shokouhi, Navid
    Misra, Abhinav
    Xing, Hua
    Hansen, John H. L.
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 418 - 423