An analysis of data fusion methods for speaker verification

被引：0

作者：

Farrell, KR ^{[1
]}

Ramachandran, RP ^{[1
]}

Mammone, RJ ^{[1
]}

机构：

[1] T NETIX SpeakEZ Inc, Englewood, CO 80112 USA

来源：

PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 | 1998年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we analyze the diversity of information as provided by several modeling approaches for speaker verification. This information is used to facilitate the fusion of the individual results into an overall result that provides advantages in accuracy over the individual models. The modeling methods that are evaluated consist of the neural tree network (NTN), Gaussian mixture model (GMM), hidden Markov model (HMM), and dynastic time warping (DTW). With the exception of DTW, all methods utilize subword-based approaches. The phrase-level scores for each modeling approach are used for combination, Several data fusion methods are evaluated for combining the model results, including the linear and log opinion pool approaches along with voting. The results of the above analysis have been integrated into a system that has been tested with several databases collected within landline and cellular environments. We have found the Linear and log opinion pool methods to consistently reduce the error rate from that obtained when the models are need individually.

引用

页码：1129 / 1132

页数：4

共 50 条

[41] Simplified factor analysis in speaker verification
Guo, Wu
Li, Yijie
Dai, Lirong
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1316 - 1319
[42] Speaker-Aware Linear Discriminant Analysis in Speaker Verification
Zheng, Naijun
Wu, Xixin
Zhong, Jinghua
Liu, Xunying
Meng, Helen
INTERSPEECH 2020, 2020, : 3012 - 3016
[43] Multi-Source Domain Adaptation and Fusion for Speaker Verification
Zhu, Donghui
Chen, Ning
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2103 - 2116
[44] Robust speaker verification via fusion of speech and lip modalities
Wark, T.
Sridharan, S.
Chandran, V.
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 6 : 3061 - 3064
[45] Fusion of MFCC and MPEG-7 attributes for speaker verification
Altincay, Hakan
Ergun, Cem
Ciloglu, Tolga
2006 IEEE 14th Signal Processing and Communications Applications, Vols 1 and 2, 2006, : 551 - 554
[46] FUSION OF SPECTRAL FEATURE SETS FOR IMPROVING SPEAKER VERIFICATION PERFORMANCES
Rastoceanu, Florin
Lazar, Marilena
17TH INTERNATIONAL CONFERENCE - THE KNOWLEDGE-BASED ORGANIZATION: APPLIED TECHNICAL SCIENCES AND ADVANCED MILITARY TECHNOLOGIES, CONFERENCE PROCEEDING 3, 2011, : 338 - 343
[47] Sample-specific late classifier fusion for speaker verification
Hasheminejad, Mohammad
Farsi, Hassan
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (12) : 15273 - 15289
[48] Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion
Cai, Weicheng
Cai, Danwei
Liu, Wenbo
Li, Gang
Li, Ming
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 17 - 21
[49] Robust speaker verification via fusion of speech and lip modalities
Wark, T
Sridharan, S
Chandran, V
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 3061 - 3064
[50] Progressive channel fusion for more efficient TDNN on speaker verification
Zhao, Zhenduo
Li, Zhuo
Wang, Wenchao
Xu, Ji
SPEECH COMMUNICATION, 2024, 163

← 1 2 3 4 5 →