Speaker identification using orthogonal and discriminative features

被引：0

作者：

Davarpanah, SH

Mirzaei, A

Ziaei, A

机构：

来源：

IWSSIP 2005: Proceedings of the 12th International Worshop on Systems, Signals & Image Processing | 2005年

关键词：

speaker identification; multivariate analysis of variance algorithm; vector quantization;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is desirable that acoustic vectors would form separable clusters in the feature space; however analysis of the common feature vectors does not support this assumption. This paper proposes a new method that manipulates the original features to produce a new feature set in which classes have more convex shape. The proposed methodology uses the idea that according to it, different features have unequal discrimination properties between speakers; So an automatic weighting function based on Multivariate Analysis Of Variance Algorithm (MANOVA) is proposed. MANOVA searches for a linear combination of original features with the largest separation among the speakers. The Vector Quantization (VQ) algorithm is used to detect speakers in the next stage. Although this algorithm is faster and has fewer complexes than the other classification algorithms in this content, promising results are achieved.

引用

页码：293 / 296

页数：4

共 50 条

[1] Learning Discriminative Features for Speaker Identification and Verification
Yadav, Sarthak
Rai, Atul
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2237 - 2241
[2] Discriminative speaker adaptation using articulatory features
Metze, Florian
[J]. SPEECH COMMUNICATION, 2007, 49 (05) : 348 - 360
[3] Discriminative lip-motion features for biometric speaker identification
Cetingül, HE
Yemez, Y
Erzin, E
Tekalp, AM
[J]. ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2023 - 2026
[4] Discriminative training for speaker identification
Hong, QY
Kwong, S
[J]. ELECTRONICS LETTERS, 2004, 40 (04) : 280 - 281
[5] Speaker Identification Using Discriminative Learning of Large Margin GMM
Daoudi, Khalid
Jourani, Reda
Andre-Obrecht, Regine
Aboutajdine, Driss
[J]. NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 300 - +
[6] Speaker identification using the VQ-based discriminative kernels
Lei, ZC
Yang, YC
Wu, ZH
[J]. AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 797 - 803
[7] Discriminative analysis of lip motion features for speaker identification and speech-reading
Cetinguel, H. Ertan
Yemez, Yuecel
Erzin, Engin
Tekalp, A. Murat
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (10) : 2879 - 2891
[8] Discriminative training of GMM for speaker identification
delAlamo, CM
Gil, FJC
Munilla, CDL
Gomez, LH
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 89 - 92
[9] Speaker identification using nonlinear dynamical features
Petry, A
Barone, DAC
[J]. CHAOS SOLITONS & FRACTALS, 2002, 13 (02) : 221 - 231
[10] Speaker identification using speech and lip features
Ou, GB
Li, X
Yao, XC
Jia, HB
Murphey, YL
[J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2565 - 2570

← 1 2 3 4 5 →