An approach of binary isomorphic quantization for speaker identification

被引：0

作者：

Junsod, S ^{[1
]}

Surarerks, A ^{[1
]}

机构：

[1] Chulalongkorn Univ, ELITE, Bangkok 10330, Thailand

来源：

PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY | 2003年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Binary isomorphic quantization is a technique for reducing amount of feature vectors by determining their similar form. The feature vectors are extracted from speech. This method is based on a function that measures internal changing of feature vectors to produce binary vectors. The binary vectors are partitioned and then clustered the same binary vectors together. A Set of clusters with the maximum frequency will be chosen to generate a codebook instead of using all binary vectors. An experimental results show the efficiency in speaker identification which gives high accuracy especially in the continuous speech. Moreover, we also investigate its performance by comparing it with other methods.

引用

页码：761 / 764

页数：4

共 50 条

[21] New approach for short utterance speaker identification
Chakroun, Rania
Frikha, Mondher
Zouari, Leila Beltaifa
IET SIGNAL PROCESSING, 2018, 12 (07) : 873 - 880
[22] An approach to speaker identification using multiple classifiers
Radova, V
Psutka, J
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 1135 - 1138
[23] A hybrid GMM/SVM approach to speaker identification
Fine, S
Navrátil, J
Gopinath, RA
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 417 - 420
[24] Collaborative personal speaker identification: A generalized approach
Rossi, Mirco
Amft, Oliver
Troester, Gerhard
PERVASIVE AND MOBILE COMPUTING, 2012, 8 (03) : 415 - 428
[25] Dynamic Bayesian network approach to speaker identification
Sang, LF
Yang, YC
Wu, ZH
Zhang, WF
ELECTRONICS LETTERS, 2003, 39 (03) : 329 - 330
[26] Multimodal approach for speaker identification in news programs
Martone, AF
Taskiran, CM
Delp, EJ
STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2005, 2005, 5682 : 308 - 316
[27] Affinity Preserving Quantization for Hashing: A Vector Quantization Approach to Learning Compact Binary Codes
Wang, Zhe
Duan, Ling-Yu
Huang, Tiejun
Gao, Wen
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1102 - 1108
[28] Speaker Identification Using Vector Quantization and I-vector with Reference to Assamese Language
Bharali, Sruti Sruba
Kalita, Sanjib Kr.
2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 164 - 168
[29] Robust speaker identification system based on multilayer eigen-codebook vector quantization
Hsieh, CT
Lai, E
Chen, WC
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1185 - 1193
[30] Robust speaker identification system based on multilayer eigen-codebook vector quantization
Hsieh, Ching-Tang
Lai, Eugene
Chen, Wan-Chen
IEICE Transactions on Information and Systems, 2004, E87-D (05) : 1185 - 1193

← 1 2 3 4 5 →