Unsupervised speaker recognition based on competition between self-organizing maps

被引:49
|
作者
Lapidot, I [1 ]
Guterman, H
Cohen, A
机构
[1] Negev Acad Coll Engn, Dept Software Engn, IL-84100 Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Dept Elect & Comp Engn, IL-84105 Beer Sheva, Israel
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2002年 / 13卷 / 04期
关键词
competitive learning; segmentation; self-organizing maps (SOMs); speaker recognition; temporal data clustering; vector quantization (VQ);
D O I
10.1109/TNN.2002.1021888
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method for clustering the speakers from unlabeled and unsegmented conversation (with known number of speakers), when no a priori knowledge about the identity of the participants is given. Each speaker was modeled by a self-organizing map (SOM). The SOMs were randomly initiated. An iterative algorithm allows the data move from one model to another and adjust the SOMs. The restriction that the data can move only in small groups but not by moving each and every feature vector separately force the SOMs to adjust to speakers (instead of phonemes or other vocal events). This method was applied to high-quality conversations with two to five participants and to two-speaker telephone-quality conversations. The results for two (both high- and telephone-quality) and three speakers were over 80% correct segmentation. The problem becomes even harder when the number of participants is also unknown. Based on the iterative clustering algorithm a validity criterion was also developed to estimate the number of speakers. In 16 out of 17 conversations of high-quality conversations between two and three participants, the estimation of the number of the participants was correct. In telephone-quality the results were poorer.
引用
收藏
页码:877 / 887
页数:11
相关论文
共 50 条
  • [1] Unsupervised speaker classification using self-organizing maps (SOM)
    Voitovetsky, I
    Guterman, H
    Cohen, A
    [J]. NEURAL NETWORKS FOR SIGNAL PROCESSING VII, 1997, : 578 - 587
  • [2] Experiment with multiple parametric self-organizing maps for speaker recognition
    Gomez, P.
    Yen, K.
    Caballero, A.
    [J]. Proceedings of the Sixth International Conference on Information and Management Sciences, 2007, 6 : 851 - 856
  • [3] Text independent automatic speaker recognition using self-organizing maps
    Mafra, AT
    Simoes, MG
    [J]. CONFERENCE RECORD OF THE 2004 IEEE INDUSTRY APPLICATIONS CONFERENCE, VOLS 1-4: COVERING THEORY TO PRACTICE, 2004, : 1503 - 1510
  • [4] PARALLEL SELF-ORGANIZING FEATURE MAPS FOR UNSUPERVISED PATTERN-RECOGNITION
    HUNTSBERGER, TL
    AJJIMARANGSEE, P
    [J]. INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 16 (04) : 357 - 372
  • [5] Deep Self-Organizing Maps for Unsupervised Image Classification
    Wickramasinghe, Chathurika S.
    Amarasinghe, Kasun
    Manic, Milos
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (11) : 5837 - 5845
  • [6] Gird pattern recognition based on clustering of self-organizing maps
    Tian, Jing
    Zhang, Boyu
    Yang, Wenyu
    [J]. Tian, J. (yutaka-2010@163.com), 1600, Editorial Board of Medical Journal of Wuhan University (38): : 1330 - 1334
  • [7] Unsupervised pattern recognition in continuous seismic wavefield records using Self-Organizing Maps
    Kohler, Andreas
    Ohrnberger, Matthias
    Scherbaum, Frank
    [J]. GEOPHYSICAL JOURNAL INTERNATIONAL, 2010, 182 (03) : 1619 - 1630
  • [8] Hierarchies of Self-Organizing Maps for action recognition
    Buonamente, Miriam
    Dindo, Haris
    Johnsson, Magnus
    [J]. COGNITIVE SYSTEMS RESEARCH, 2016, 39 : 33 - 41
  • [9] SELF-ORGANIZING MAPS - LOCAL COMPETITION AND EVOLUTIONARY OPTIMIZATION
    JOCKUSCH, S
    RITTER, H
    [J]. NEURAL NETWORKS, 1994, 7 (08) : 1229 - 1239
  • [10] A Survey on the Development of Self-Organizing Maps for Unsupervised Intrusion Detection
    Xiaofei Qu
    Lin Yang
    Kai Guo
    Linru Ma
    Meng Sun
    Mingxing Ke
    Mu Li
    [J]. Mobile Networks and Applications, 2021, 26 : 808 - 829