An approach of binary isomorphic quantization for speaker identification

被引:0
|
作者
Junsod, S [1 ]
Surarerks, A [1 ]
机构
[1] Chulalongkorn Univ, ELITE, Bangkok 10330, Thailand
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Binary isomorphic quantization is a technique for reducing amount of feature vectors by determining their similar form. The feature vectors are extracted from speech. This method is based on a function that measures internal changing of feature vectors to produce binary vectors. The binary vectors are partitioned and then clustered the same binary vectors together. A Set of clusters with the maximum frequency will be chosen to generate a codebook instead of using all binary vectors. An experimental results show the efficiency in speaker identification which gives high accuracy especially in the continuous speech. Moreover, we also investigate its performance by comparing it with other methods.
引用
收藏
页码:761 / 764
页数:4
相关论文
共 50 条
  • [21] New approach for short utterance speaker identification
    Chakroun, Rania
    Frikha, Mondher
    Zouari, Leila Beltaifa
    IET SIGNAL PROCESSING, 2018, 12 (07) : 873 - 880
  • [22] An approach to speaker identification using multiple classifiers
    Radova, V
    Psutka, J
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 1135 - 1138
  • [23] A hybrid GMM/SVM approach to speaker identification
    Fine, S
    Navrátil, J
    Gopinath, RA
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 417 - 420
  • [24] Collaborative personal speaker identification: A generalized approach
    Rossi, Mirco
    Amft, Oliver
    Troester, Gerhard
    PERVASIVE AND MOBILE COMPUTING, 2012, 8 (03) : 415 - 428
  • [25] Dynamic Bayesian network approach to speaker identification
    Sang, LF
    Yang, YC
    Wu, ZH
    Zhang, WF
    ELECTRONICS LETTERS, 2003, 39 (03) : 329 - 330
  • [26] Multimodal approach for speaker identification in news programs
    Martone, AF
    Taskiran, CM
    Delp, EJ
    STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2005, 2005, 5682 : 308 - 316
  • [27] Affinity Preserving Quantization for Hashing: A Vector Quantization Approach to Learning Compact Binary Codes
    Wang, Zhe
    Duan, Ling-Yu
    Huang, Tiejun
    Gao, Wen
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1102 - 1108
  • [28] Speaker Identification Using Vector Quantization and I-vector with Reference to Assamese Language
    Bharali, Sruti Sruba
    Kalita, Sanjib Kr.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 164 - 168
  • [29] Robust speaker identification system based on multilayer eigen-codebook vector quantization
    Hsieh, CT
    Lai, E
    Chen, WC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1185 - 1193
  • [30] Robust speaker identification system based on multilayer eigen-codebook vector quantization
    Hsieh, Ching-Tang
    Lai, Eugene
    Chen, Wan-Chen
    IEICE Transactions on Information and Systems, 2004, E87-D (05) : 1185 - 1193