Speaker Clustering Based on Non-Negative Matrix Factorization Using Gaussian Mixture Model in Complementary Subspace

被引:0
|
作者
Nishida, Masafumi [1 ]
Yamamoto, Seiichi [2 ]
机构
[1] Shizuoka Univ, Dept Informat, Shizuoka, Japan
[2] Doshisha Univ, Dept Informat & Comp Sci, Kyoto, Japan
关键词
ACM proceedings; text tagging; DIARIZATION;
D O I
10.1145/3095713.3095721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech feature variations are mainly attributed to variations in phonetic and speaker information included in speech data. If these two types of information are separated from each other, more robust speaker clustering can be achieved. Principal component analysis transformation can separate speaker information from phonetic information, under the assumption that a space with large within-speaker variance is a "phonetic subspace" and a space within-speaker variance is a "phonetic sub-space". We propose a speaker clustering method based on non-negative matrix factorization using a Gaussian mixture model trained in the speaker subspace. We carried out comparative experiments of the proposed method with conventional methods based on Bayesian information criterion and Gaussian mixture model in an observation space. The experimental results showed that the proposed method can achieve higher clustering accuracy than conventional methods.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [41] Graph regularized sparse non-negative matrix factorization for clustering
    Deng, Ping
    Wang, Hongjun
    Li, Tianrui
    Zhao, Hui
    Wu, Yanping
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 987 - 994
  • [42] Sparse non-negative matrix factorization for uncertain data clustering
    Chen, Danyang
    Wang, Xiangyu
    Xu, Xiu
    Zhong, Cheng
    Xu, Jinhui
    INTELLIGENT DATA ANALYSIS, 2022, 26 (03) : 615 - 636
  • [43] Non-negative Matrix Factorization based on γ-Divergence
    Machida, Kohei
    Takenouchi, Takashi
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [44] Curavture-Aware Non-negative Matrix Factorization for Clustering
    Lv, Jiaren
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 115 - 120
  • [45] Non-negative matrix factorization by maximizing correntropy for cancer clustering
    Jim Jing-Yan Wang
    Xiaolei Wang
    Xin Gao
    BMC Bioinformatics, 14
  • [46] Regularized non-negative matrix factorization with Gaussian mixtures and masking model for speech enhancement
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    SPEECH COMMUNICATION, 2017, 87 : 18 - 30
  • [47] Improvement of non-negative matrix factorization based language model using exponential models
    Novak, M
    Mammone, R
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 190 - 193
  • [48] Fast intrusion detection based on a non-negative matrix factorization model
    Guan, Xiaohong
    Wang, Wei
    Zhang, Xiangliang
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2009, 32 (01) : 31 - 44
  • [49] Fault detection method for non-Gaussian processes based on non-negative matrix factorization
    Li, Xiang-bao
    Yang, Yu-pu
    Zhang, Wei-dong
    ASIA-PACIFIC JOURNAL OF CHEMICAL ENGINEERING, 2013, 8 (03) : 362 - 370
  • [50] Document clustering of MEDLINE abstracts based on non-negative matrix factorization using local confidence assessment
    Kang, Byeong-Chul
    Sur, Zee-Won
    Park, Chulhwan
    Cho, Man-gi
    BIOCHIP JOURNAL, 2010, 4 (04) : 336 - 349