Speaker Clustering Based on Non-negative Matrix Factorization

被引:0
|
作者
Nishida, Masafumi [1 ]
Yamamoto, Seiichi [1 ]
机构
[1] Doshisha Univ, Dept Informat Syst Design, Kyoto 6100321, Japan
关键词
unsupervised speaker clustering; non-negative matrix factorization; agglomerative hierarchical clustering; multi-party conversation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses unsupervised speaker clustering for multi-party conversations. Hierarchical clustering methods were mainly used in previous studies. However, these methods require many processes, such as distance calculation and cluster merging, when there are many utterances in conversation data. We propose a clustering method based on non-negative matrix factorization. The proposed method can perform fast and robust clustering by decomposing a matrix consisting of distances between models. We conducted speaker clustering experiments using a Bayesian information criterion based method, a method based on the likelihood ratio between Gaussian mixture models, and the proposed method. Experimental results showed that the proposed method achieves higher clustering accuracy than these conventional methods.
引用
收藏
页码:956 / 959
页数:4
相关论文
共 50 条
  • [1] Document clustering based on spectral clustering and non-negative matrix factorization
    Bao, Lei
    Tang, Sheng
    Li, Jintao
    Zhang, Yongdong
    Ye, Wei-Ping
    NEW FRONTIERS IN APPLIED ARTIFICIAL INTELLIGENCE, 2008, 5027 : 149 - +
  • [2] Non-negative Matrix Factorization Based on Clustering and Its Application
    Li M.
    Liang L.
    Chen Y.
    Xu G.
    He K.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2018, 29 (06): : 720 - 725
  • [3] Clustering-based initialization for non-negative matrix factorization
    Xue, Yun
    Tong, Chong Sze
    Chen, Ying
    Chen, Wen-Sheng
    APPLIED MATHEMATICS AND COMPUTATION, 2008, 205 (02) : 525 - 536
  • [4] Speaker Clustering Based on Non-Negative Matrix Factorization Using Gaussian Mixture Model in Complementary Subspace
    Nishida, Masafumi
    Yamamoto, Seiichi
    PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [5] Speaker conversion using kernel non-negative matrix factorization
    Xu Qinyu
    Lu Guanming
    Yan Jingjie
    Li Haibo
    Cheng Xiao
    The Journal of China Universities of Posts and Telecommunications, 2017, (05) : 60 - 67
  • [6] Speaker conversion using kernel non-negative matrix factorization
    Xu Qinyu
    Lu Guanming
    Yan Jingjie
    Li Haibo
    Cheng Xiao
    The Journal of China Universities of Posts and Telecommunications, 2017, 24 (05) : 60 - 67
  • [7] Multiview Clustering Based on Non-Negative Matrix Factorization and Pairwise Measurements
    Wang, Xiumei
    Zhang, Tianzhen
    Gao, Xinbo
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (09) : 3333 - 3346
  • [8] Speaker conversion using kernel non-negative matrix factorization
    Qinyu X.
    Guanming L.
    Jingjie Y.
    Haibo L.
    Xiao C.
    Guanming, Lu (lugm@njupt.edu.cn), 2017, Beijing University of Posts and Telecommunications (24): : 60 - 67
  • [9] Binary Codes Based on Non-Negative Matrix Factorization for Clustering and Retrieval
    Xiong, Jiang
    Tao, Yingyin
    Zhang, Meng
    Li, Huaqing
    IEEE ACCESS, 2020, 8 : 207012 - 207023
  • [10] Biased unconstrained non-negative matrix factorization for clustering
    Deng, Ping
    Zhang, Fan
    Li, Tianrui
    Wang, Hongjun
    Horng, Shi-Jinn
    KNOWLEDGE-BASED SYSTEMS, 2022, 239