Parallel Non-Negative Matrix Tri-Factorization for Text Data Co-Clustering

被引:14
|
作者
Chen, Yufu [1 ]
Lei, Zhiqi [1 ]
Rao, Yanghui [1 ]
Xie, Haoran [2 ]
Wang, Fu Lee [3 ]
Yin, Jian [4 ,5 ]
Li, Qing [6 ]
机构
[1] Sun Yat sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[2] Lingnan Univ, Dept Comp & Decis Sci, Hong Kong, Peoples R China
[3] Hong Kong Metropolitan Univ, Sch Sci & Technol, Kowloon, Hong Kong, Peoples R China
[4] Sun Yat sen Univ, Sch Artificial Intelligence, Zhuhai 519082, Peoples R China
[5] Sun Yat sen Univ, Guangdong Key Lab Big Data Anal & Proc, Guangzhou 510006, Peoples R China
[6] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Matrix decomposition; Computational modeling; Data models; Convergence; Optimization; Scalability; Partitioning algorithms; Non-negative matrix tri-factorization; parallel computing; message passing; Newton iteration; FRAMEWORK; MODEL; ALGORITHMS;
D O I
10.1109/TKDE.2022.3145489
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a novel paradigm for data mining and dimensionality reduction, Non-negative Matrix Tri-Factorization (NMTF) has attracted much attention due to its notable performance and elegant mathematical derivation, and it has been applied to a plethora of real-world applications, such as text data co-clustering. However, the existing NMTF-based methods usually involve intensive matrix multiplications, which exhibits a major limitation of high computational complexity. With the explosion at both the size and the feature dimension of texts, there is a growing need to develop a parallel and scalable NMTF-based algorithm for text data co-clustering. To this end, we first show in this paper how to theoretically derive the original optimization problem of NMTF by introducing the Lagrangian multipliers. Then, we propose to solve the Lagrange dual objective function in parallel through an efficient distributed implementation. Extensive experiments on five benchmark corpora validate the effectiveness, efficiency, and scalability of our distributed parallel update algorithm for an NMTF-based text data co-clustering method.
引用
收藏
页码:5132 / 5146
页数:15
相关论文
共 50 条
  • [31] A novel regularized asymmetric non-negative matrix factorization for text clustering
    Aghdam, Mehdi Hosseinzadeh
    Zanjani, Mohammad Daryaie
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (06)
  • [32] Sparse non-negative matrix factorization for uncertain data clustering
    Chen, Danyang
    Wang, Xiangyu
    Xu, Xiu
    Zhong, Cheng
    Xu, Jinhui
    INTELLIGENT DATA ANALYSIS, 2022, 26 (03) : 615 - 636
  • [33] Multi-Type Co-clustering of General Heterogeneous Information Networks via Nonnegative Matrix Tri-factorization
    Zhang, Xianchao
    Li, Haixin
    Liang, Wenxin
    Luo, Jiebo
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1353 - 1358
  • [34] Non-negative Matrix Factorization and Co-clustering: A Promising Tool for Multi-tasks Bearing Fault Diagnosis
    Shen, Fei
    Chen, Chao
    Yan, Ruqiang
    12TH INTERNATIONAL CONFERENCE ON DAMAGE ASSESSMENT OF STRUCTURES, 2017, 842
  • [35] Learning Dual Preferences with Non-negative Matrix Tri-Factorization for Top-N Recommender System
    Li, Xiangsheng
    Rao, Yanghui
    Xie, Haoran
    Chen, Yufu
    Lau, Raymond Y. K.
    Wang, Fu Lee
    Yin, Jian
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2018, PT I, 2018, 10827 : 133 - 149
  • [36] Four algorithms to solve symmetric multi-type non-negative matrix tri-factorization problem
    Hribar, Rok
    Hrga, Timotej
    Papa, Gregor
    Petelin, Gasper
    Povh, Janez
    Przulj, Natasa
    Vukasinovic, Vida
    JOURNAL OF GLOBAL OPTIMIZATION, 2022, 82 (02) : 283 - 312
  • [37] Improved Computational Drug-Repositioning by Self-Paced Non-Negative Matrix Tri-Factorization
    Dang, Qi
    Liang, Yong
    Ouyang, Dong
    Miao, Rui
    Ling, Caijin
    Liu, Xiaoying
    Xie, Shengli
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (03) : 1953 - 1962
  • [38] Robust capped norm dual hyper-graph regularized non-negative matrix tri-factorization
    Yu, Jiyang
    Pan, Baicheng
    Yu, Shanshan
    Leung, Man-Fai
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (07) : 12486 - 12509
  • [39] Four algorithms to solve symmetric multi-type non-negative matrix tri-factorization problem
    Rok Hribar
    Timotej Hrga
    Gregor Papa
    Gašper Petelin
    Janez Povh
    Nataša Pržulj
    Vida Vukašinović
    Journal of Global Optimization, 2022, 82 : 283 - 312
  • [40] Non-negative matrix factorization for semi-supervised data clustering
    Chen, Yanhua
    Rege, Manjeet
    Dong, Ming
    Hua, Jing
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (03) : 355 - 379