Sub-word speaker verification using data fusion methods

被引：2

作者：

Farrell, KR

Ramachandran, RP

Sharma, M

Mammone, RJ

机构：

来源：

NEURAL NETWORKS FOR SIGNAL PROCESSING VII | 1997年

关键词：

D O I：

10.1109/NNSP.1997.622435

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speaker verification is a rapidly maturing technology that is becoming available for commercial applications. In this paper, we investigate the application of data fusion methods to sub-word implementations of speaker verification. At a sub-word level, we utilize the diversity of the information provided by the neural tree network and Gaussian mixture model to provide a more robust sub-word model. The phrase-level scores for each modeling approach are obtained and then combined. The data fusion method we use for combining the model scores is the linear opinion pool. In addition to using the diversity of the model scores, we also apply the concept of redundancy by using a leave-one-out approach to partition the input data. This allows us to generate several models and accommodate the small training sample issues imposed by our specific applications. The theoretical results of the above analysis have been integrated into a system that has been tested with several databases that were collected within landline and cellular environments. These results are included in this paper. We have found that the proper data fusion techniques will typically reduce the error rate by a factor of two.

引用

页码：531 / 540

页数：10

共 50 条

[11] A neural network for 500 vocabulary word spotting using acoustic sub-word units
Yu, HJ
Oh, YH
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3277 - 3280
[12] New Approach for Jawi Sub-word Segmentation using Histogram Projection
Saddami, Khairun
Munadi, Khairul
Arnia, Fitri
PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 1619 - 1623
[13] Optimal Matrix Computing Using Vector Division with Sub-word Parallel
Gan, Xin-Biao
Dai, Kui
Shen, Li
Wang, Zhi-Ying
INTERNATIONAL SYMPOSIUM ON UBIQUITOUS MULTIMEDIA COMPUTING, PROCEEDINGS, 2008, : 3 - 6
[14] Sub-word parallelism in digital signal processing
Fridman, J
IEEE SIGNAL PROCESSING MAGAZINE, 2000, 17 (02) : 27 - 35
[15] Systematic design of programs with sub-word parallelism
Schaffer, R
Merker, R
Catthoor, F
PAR ELEC 2002: INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, 2002, : 393 - 398
[16] Sub-word Language Modeling for Russian LVCSR
Zablotskiy, Sergey
Minker, Wolfgang
SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 413 - 421
[17] Combined approach to dysarthric speaker verification using data augmentation and feature fusion
Salim, Shinimol
Shahnawazuddin, Syed
Ahmad, Waquar
SPEECH COMMUNICATION, 2024, 160
[18] Sub-word Based Offline Handwritten Farsi Word Recognition Using Recurrent Neural Network
Ghadikolaie, Mohammad Fazel Younessy
Kabir, Ehsanolah
Razzazi, Farbod
ETRI JOURNAL, 2016, 38 (04) : 703 - 713
[19] A neural network using acoustic sub-word units for continuous speech recognition
Yu, HJ
Oh, YH
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 506 - 509
[20] Sub-word Image Clustering in Farsi Printed Books
Soheili, Mohammad Reza
Kabir, Ehsanollah
Stricker, Didier
SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445

← 1 2 3 4 5 →