Sub-word speaker verification using data fusion methods

被引:2
|
作者
Farrell, KR
Ramachandran, RP
Sharma, M
Mammone, RJ
机构
关键词
D O I
10.1109/NNSP.1997.622435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speaker verification is a rapidly maturing technology that is becoming available for commercial applications. In this paper, we investigate the application of data fusion methods to sub-word implementations of speaker verification. At a sub-word level, we utilize the diversity of the information provided by the neural tree network and Gaussian mixture model to provide a more robust sub-word model. The phrase-level scores for each modeling approach are obtained and then combined. The data fusion method we use for combining the model scores is the linear opinion pool. In addition to using the diversity of the model scores, we also apply the concept of redundancy by using a leave-one-out approach to partition the input data. This allows us to generate several models and accommodate the small training sample issues imposed by our specific applications. The theoretical results of the above analysis have been integrated into a system that has been tested with several databases that were collected within landline and cellular environments. These results are included in this paper. We have found that the proper data fusion techniques will typically reduce the error rate by a factor of two.
引用
收藏
页码:531 / 540
页数:10
相关论文
共 50 条
  • [11] A neural network for 500 vocabulary word spotting using acoustic sub-word units
    Yu, HJ
    Oh, YH
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3277 - 3280
  • [12] New Approach for Jawi Sub-word Segmentation using Histogram Projection
    Saddami, Khairun
    Munadi, Khairul
    Arnia, Fitri
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 1619 - 1623
  • [13] Optimal Matrix Computing Using Vector Division with Sub-word Parallel
    Gan, Xin-Biao
    Dai, Kui
    Shen, Li
    Wang, Zhi-Ying
    INTERNATIONAL SYMPOSIUM ON UBIQUITOUS MULTIMEDIA COMPUTING, PROCEEDINGS, 2008, : 3 - 6
  • [14] Sub-word parallelism in digital signal processing
    Fridman, J
    IEEE SIGNAL PROCESSING MAGAZINE, 2000, 17 (02) : 27 - 35
  • [15] Systematic design of programs with sub-word parallelism
    Schaffer, R
    Merker, R
    Catthoor, F
    PAR ELEC 2002: INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, 2002, : 393 - 398
  • [16] Sub-word Language Modeling for Russian LVCSR
    Zablotskiy, Sergey
    Minker, Wolfgang
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 413 - 421
  • [17] Combined approach to dysarthric speaker verification using data augmentation and feature fusion
    Salim, Shinimol
    Shahnawazuddin, Syed
    Ahmad, Waquar
    SPEECH COMMUNICATION, 2024, 160
  • [18] Sub-word Based Offline Handwritten Farsi Word Recognition Using Recurrent Neural Network
    Ghadikolaie, Mohammad Fazel Younessy
    Kabir, Ehsanolah
    Razzazi, Farbod
    ETRI JOURNAL, 2016, 38 (04) : 703 - 713
  • [19] A neural network using acoustic sub-word units for continuous speech recognition
    Yu, HJ
    Oh, YH
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 506 - 509
  • [20] Sub-word Image Clustering in Farsi Printed Books
    Soheili, Mohammad Reza
    Kabir, Ehsanollah
    Stricker, Didier
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445