Speaker identification using orthogonal and discriminative features

被引:0
|
作者
Davarpanah, SH
Mirzaei, A
Ziaei, A
机构
关键词
speaker identification; multivariate analysis of variance algorithm; vector quantization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is desirable that acoustic vectors would form separable clusters in the feature space; however analysis of the common feature vectors does not support this assumption. This paper proposes a new method that manipulates the original features to produce a new feature set in which classes have more convex shape. The proposed methodology uses the idea that according to it, different features have unequal discrimination properties between speakers; So an automatic weighting function based on Multivariate Analysis Of Variance Algorithm (MANOVA) is proposed. MANOVA searches for a linear combination of original features with the largest separation among the speakers. The Vector Quantization (VQ) algorithm is used to detect speakers in the next stage. Although this algorithm is faster and has fewer complexes than the other classification algorithms in this content, promising results are achieved.
引用
收藏
页码:293 / 296
页数:4
相关论文
共 50 条
  • [1] Learning Discriminative Features for Speaker Identification and Verification
    Yadav, Sarthak
    Rai, Atul
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2237 - 2241
  • [2] Discriminative speaker adaptation using articulatory features
    Metze, Florian
    [J]. SPEECH COMMUNICATION, 2007, 49 (05) : 348 - 360
  • [3] Discriminative lip-motion features for biometric speaker identification
    Cetingül, HE
    Yemez, Y
    Erzin, E
    Tekalp, AM
    [J]. ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2023 - 2026
  • [4] Discriminative training for speaker identification
    Hong, QY
    Kwong, S
    [J]. ELECTRONICS LETTERS, 2004, 40 (04) : 280 - 281
  • [5] Speaker Identification Using Discriminative Learning of Large Margin GMM
    Daoudi, Khalid
    Jourani, Reda
    Andre-Obrecht, Regine
    Aboutajdine, Driss
    [J]. NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 300 - +
  • [6] Speaker identification using the VQ-based discriminative kernels
    Lei, ZC
    Yang, YC
    Wu, ZH
    [J]. AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 797 - 803
  • [7] Discriminative analysis of lip motion features for speaker identification and speech-reading
    Cetinguel, H. Ertan
    Yemez, Yuecel
    Erzin, Engin
    Tekalp, A. Murat
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (10) : 2879 - 2891
  • [8] Discriminative training of GMM for speaker identification
    delAlamo, CM
    Gil, FJC
    Munilla, CDL
    Gomez, LH
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 89 - 92
  • [9] Speaker identification using nonlinear dynamical features
    Petry, A
    Barone, DAC
    [J]. CHAOS SOLITONS & FRACTALS, 2002, 13 (02) : 221 - 231
  • [10] Speaker identification using speech and lip features
    Ou, GB
    Li, X
    Yao, XC
    Jia, HB
    Murphey, YL
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2565 - 2570