An analysis of data fusion methods for speaker verification

被引:0
|
作者
Farrell, KR [1 ]
Ramachandran, RP [1 ]
Mammone, RJ [1 ]
机构
[1] T NETIX SpeakEZ Inc, Englewood, CO 80112 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we analyze the diversity of information as provided by several modeling approaches for speaker verification. This information is used to facilitate the fusion of the individual results into an overall result that provides advantages in accuracy over the individual models. The modeling methods that are evaluated consist of the neural tree network (NTN), Gaussian mixture model (GMM), hidden Markov model (HMM), and dynastic time warping (DTW). With the exception of DTW, all methods utilize subword-based approaches. The phrase-level scores for each modeling approach are used for combination, Several data fusion methods are evaluated for combining the model results, including the linear and log opinion pool approaches along with voting. The results of the above analysis have been integrated into a system that has been tested with several databases collected within landline and cellular environments. We have found the Linear and log opinion pool methods to consistently reduce the error rate from that obtained when the models are need individually.
引用
收藏
页码:1129 / 1132
页数:4
相关论文
共 50 条
  • [21] i-Vector-Based Speaker Verification on Limited Data Using Fusion Techniques
    Kumari, T. R. Jayanthi
    Jayanna, H. S.
    JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) : 565 - 582
  • [22] Speaker Verification with the Constraint of Limited Data
    Kumari, Thyamagondlu Renukamurthy Jayanthi
    Jayanna, Haradagere Siddaramaiah
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (04): : 807 - 823
  • [23] Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion
    Aloradi, Ahmad
    Mack, Wolfgang
    Elminshawi, Mohamed
    Habets, EmanuM A. P.
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 354 - 358
  • [24] Speaker adaptations in sparse training data for improved speaker verification
    Ahn, S
    Ko, H
    ELECTRONICS LETTERS, 2000, 36 (04) : 371 - 373
  • [25] Confidence based multiple classifier fusion in speaker verification
    Huenupan, Fernando
    Yoma, Nestor Becerra
    Molina, Carlos
    Garreton, Claudio
    PATTERN RECOGNITION LETTERS, 2008, 29 (07) : 957 - 966
  • [26] Speaker verification based on fusion of acoustic and articulatory information
    Li, Ming
    Kim, Jangwon
    Ghosh, Prasanta
    Ramanarayanan, Vikram
    Narayanan, Shrikanth
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1613 - 1617
  • [27] Comparison of discriminative training methods for speaker verification
    Ma, CY
    Chang, E
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 192 - 195
  • [28] Exploring kernel discriminant analysis for speaker verification with limited test data
    Das, Rohan Kumar
    Manam, Akhil Babu
    Prasanna, S. R. Mahadeva
    PATTERN RECOGNITION LETTERS, 2017, 98 : 26 - 31
  • [29] Evaluation of kernel methods for speaker verification and identification
    Wan, V
    Renals, S
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 669 - 672
  • [30] Fusion of TEO Phase with MFCC Features for Speaker Verification
    Agrawal, Purvi
    Patil, Hemant A.
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 161 - 166