Speaker discriminative weighting method for VQ-based speaker identification

被引:0
|
作者
Kinnunen, T [1 ]
Fränti, P [1 ]
机构
[1] Univ Joensuu, Dept Comp Sci, FIN-80101 Joensuu, Finland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the matching function in vector quantization based speaker identification system. The model of a speaker is a codebook generated from the set of feature vectors from the speakers voice sample. The matching is performed by evaluating the similarity of the unknown speaker and the models in the database. In this paper, we propose to use weighted matching method that takes into account the correlations between the known models in the database. Larger weights are assigned to vectors that have high discriminating power between the speakers and vice versa. Experiments show that the new method provides significantly higher identification accuracy and it can detect the correct speaker from shorter speech samples more reliable than the unweighted matching method.
引用
收藏
页码:150 / 156
页数:7
相关论文
共 50 条
  • [1] Speaker identification using the VQ-based discriminative kernels
    Lei, ZC
    Yang, YC
    Wu, ZH
    AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 797 - 803
  • [2] A discriminative training algorithm for VQ-based speaker identification
    He, JL
    Liu, L
    Palm, G
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03): : 353 - 356
  • [3] A fast search method of VQ-based speaker identification for large population using discriminative factor and hierarchical matching
    Pan, ZB
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4017 - 4017
  • [4] Temporal decomposition: A promising approach to VQ-based speaker identification
    Nguyen, PC
    Akagi, M
    Ho, TB
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 184 - 187
  • [5] Enhanced VQ-based algorithms for speech independent speaker identification
    Fan, NP
    Rosca, J
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 470 - 477
  • [6] Temporal decomposition: A promising approach to VQ-based speaker identification
    Nguyen, PC
    Akagi, M
    Ho, TB
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 617 - 620
  • [7] A computationally efficient modeling method for text dependent VQ-based speaker identification system
    Gu, QR
    Shibata, T
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 245 - 248
  • [8] Parallel implementation of a VQ-based text-independent speaker identification
    Soǧanci, Ruhsar
    Gürgen, Fikret
    Topcuoǧlu, Haluk
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3261 : 291 - 300
  • [9] Parallel implementation of a VQ-based text-independent speaker identification
    Soganci, R
    Gürgen, F
    Topcuoglu, H
    ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2004, 3261 : 291 - 300
  • [10] Weighted Distortion Measure on Standard Deviation for VQ-Based Speaker Identification
    Luo, Xiao-ting
    Ji, Li-xin
    Li, Shao-mei
    2010 2ND INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY (EBISS 2010), 2010, : 422 - 425