Sparse non-negative matrix factorization for uncertain data clustering

被引:0
|
作者
Chen, Danyang [1 ]
Wang, Xiangyu [2 ]
Xu, Xiu [3 ]
Zhong, Cheng [1 ]
Xu, Jinhui [4 ]
机构
[1] Guangxi Univ, Sch Comp Elect & Informat, Nanning, Guangxi, Peoples R China
[2] Cloud & Smart Ind Grp, Tencent, Guangdong, Peoples R China
[3] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou, Jiangsu, Peoples R China
[4] Univ Buffalo, Dept Comp Sci & Engn, Buffalo, NY USA
基金
中国国家自然科学基金;
关键词
Uncertain data clustering; sparse non-negative matrix factorization; data analysis; machine learning; ALGORITHMS;
D O I
10.3233/IDA-205622
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of clustering a set of uncertain data, where each data consists of a point-set indicating its possible locations. The objective is to identify the representative for each uncertain data and group them into k clusters so as to minimize the total clustering cost. Different from other models, our model does not assume that there is a probability distribution for each uncertain data. Thus, all possible locations need to be considered to determine the representative. Existing methods for this problem are either impractical or have difficulty to handle large-scale datasets due to their pairwise-distance based global search strategy and expensive optimization computation. In this paper, we propose a novel sparse Non-negative Matrix Factorization (NMF) method which measures the similarity of uncertain data by their most commonly shared features. A divide-and-conquer approach is adopted to remarkably improve the efficiency. A novel diagonal l(0)-constraint and its l(1) relaxation are proposed to overcome the challenge of determining the representatives. We give a detailed analysis to show the correctness of our method, and provide an effective initialization and peeling strategy to enhance the ability of processing large-scale datasets. Experimental results on some benchmark datasets confirm the effectiveness of our method.
引用
收藏
页码:615 / 636
页数:22
相关论文
共 50 条
  • [1] Graph Regularized Sparse Non-Negative Matrix Factorization for Clustering
    Deng, Ping
    Li, Tianrui
    Wang, Hongjun
    Wang, Dexian
    Horng, Shi-Jinn
    Liu, Rui
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (03) : 910 - 921
  • [2] Graph regularized sparse non-negative matrix factorization for clustering
    Deng, Ping
    Wang, Hongjun
    Li, Tianrui
    Zhao, Hui
    Wu, Yanping
    [J]. DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 987 - 994
  • [3] Multiobjective Sparse Non-Negative Matrix Factorization
    Gong, Maoguo
    Jiang, Xiangming
    Li, Hao
    Tan, Kay Chen
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (08) : 2941 - 2954
  • [4] Enforced Sparse Non-Negative Matrix Factorization
    Gavin, Brendan
    Gadepally, Vijay
    Kepner, Jeremy
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 902 - 911
  • [5] Probabilistic Sparse Non-negative Matrix Factorization
    Hinrich, Jesper Love
    Morup, Morten
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 488 - 498
  • [6] Non-negative matrix factorization for semi-supervised data clustering
    Chen, Yanhua
    Rege, Manjeet
    Dong, Ming
    Hua, Jing
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (03) : 355 - 379
  • [7] Non-negative matrix factorization for semi-supervised data clustering
    Yanhua Chen
    Manjeet Rege
    Ming Dong
    Jing Hua
    [J]. Knowledge and Information Systems, 2008, 17 : 355 - 379
  • [8] Comparison of Non-negative Matrix Factorization Methods for Clustering Genomic Data
    Hou, Mi-Xiao
    Gao, Ying-Lian
    Liu, Jin-Xing
    Shang, Jun-Liang
    Zheng, Chun-Hou
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT II, 2016, 9772 : 290 - 299
  • [9] Sparse Non-Negative Matrix Factorization for Mesh Segmentation
    McGraw, Tim
    Kang, Jisun
    Herring, Donald
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2016, 16 (01)
  • [10] Co-sparse Non-negative Matrix Factorization
    Wu, Fan
    Cai, Jiahui
    Wen, Canhong
    Tan, Haizhu
    [J]. FRONTIERS IN NEUROSCIENCE, 2022, 15