Sub-word speaker verification using data fusion methods

被引:2
|
作者
Farrell, KR
Ramachandran, RP
Sharma, M
Mammone, RJ
机构
关键词
D O I
10.1109/NNSP.1997.622435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speaker verification is a rapidly maturing technology that is becoming available for commercial applications. In this paper, we investigate the application of data fusion methods to sub-word implementations of speaker verification. At a sub-word level, we utilize the diversity of the information provided by the neural tree network and Gaussian mixture model to provide a more robust sub-word model. The phrase-level scores for each modeling approach are obtained and then combined. The data fusion method we use for combining the model scores is the linear opinion pool. In addition to using the diversity of the model scores, we also apply the concept of redundancy by using a leave-one-out approach to partition the input data. This allows us to generate several models and accommodate the small training sample issues imposed by our specific applications. The theoretical results of the above analysis have been integrated into a system that has been tested with several databases that were collected within landline and cellular environments. These results are included in this paper. We have found that the proper data fusion techniques will typically reduce the error rate by a factor of two.
引用
收藏
页码:531 / 540
页数:10
相关论文
共 50 条
  • [1] General phrase speaker verification using sub-word background models and likelihood-ratio scoring
    Parthasarathy, S
    Rosenberg, AE
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2403 - 2406
  • [2] An analysis of data fusion methods for speaker verification
    Farrell, KR
    Ramachandran, RP
    Mammone, RJ
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1129 - 1132
  • [3] Data alignment for sub-word parallelism in DSP
    Fridman, Jose
    IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation, 1999, : 251 - 260
  • [4] Printed Arabic sub-word recognition using moments
    Elrube, Ibrahim A.
    El Sonni, Mohamed T.
    Saleh, Soha S.
    World Academy of Science, Engineering and Technology, 2010, 42 : 724 - 728
  • [5] Language identification using parallel sub-word recognition
    Jayram, AKVS
    Ramasubramanian, V
    Sreenivas, TV
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 32 - 35
  • [6] Identifying translationese at the word and sub-word level
    Avner, Ehud Alexander
    Ordan, Noam
    Wintner, Shuly
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2016, 31 (01) : 30 - 54
  • [7] MAP and Sub-Word Level T-Norm for Text-Dependent Speaker Recognition
    Toledano, Doroteo T.
    Hernandez-Lopez, Daniel
    Esteve-Elizalde, Cristina
    Gonzalez-Rodriguez, Joaquin
    Fernandez Pozo, Ruben
    Hernandez Gomez, Luis
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1933 - +
  • [8] Contemporary Polish Language Model (Version 2) Using Big Data and Sub-Word Approach
    Wolk, Krzysztof
    INTERSPEECH 2020, 2020, : 4931 - 4935
  • [9] Word/sub-word lattices decomposition and combination for speech recognition
    Le, Viet-Bac
    Seng, Sopheap
    Besacier, Laurent
    Bigi, Brigitte
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4321 - 4324
  • [10] Exploring the limits of sub-word level parallelism
    Scott, K
    Davidson, J
    2000 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2000, : 81 - 91