OvNMTF Algorithm: an Overlapping Non-Negative Matrix Tri-Factorization for Coclustering

被引:1
|
作者
de Freitas Junior, Waldyr L. [1 ]
Peres, Sarajane M. [1 ]
Freire, Valdinei [1 ]
Brunialti, Lucas Fernandes [2 ]
机构
[1] Univ Sao Paulo, Escola Artes Ciencias & Humanidades, Sao Paulo, Brazil
[2] Cobli, Sao Paulo, Brazil
关键词
coclustering; matrix factorization;
D O I
10.1109/ijcnn48605.2020.9207364
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coclustering algorithms are an alternative to classic one-sided clustering algorithms. Because of its ability to simultaneously cluster rows and columns of a dyadic data matrix, coclustering offers a higher value-added information: it offers column clusters besides row clusters, and the relationship between them in terms of coclusters. Different structures of coclusters are possible, and those that overlap in terms of rows or columns still represent an open question with room for improvements. In addition, while most related literature cites coclustering as a means of producing better results from one-side clustering, few initiatives study it as a tool capable of providing higher quality descriptive information about this clustering. In this paper, we present a new coclustering algorithm - OvNMTF, based on triple matrix factorization, which properly handle overlapped coclusters, by adding degrees of freedom for matrix factorization that enable the discovery of specialized column clusters for each row cluster. As a proof of concept, we modeled text analysis as a coclustering problem with column overlaps, assuming that given words (data matrix columns) are associated with over one document cluster (row cluster) because they can assume different semantic relationships in each association. Experiments on synthetic data sets show the OvNMTF algorithm reasonableness; experiments on real-world text data show its power for extracting high quality information.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Scalable non-negative matrix tri-factorization
    Copar, Andrej
    Zitnik, Marinka
    Zupan, Blaz
    BIODATA MINING, 2017, 10
  • [2] Scalable non-negative matrix tri-factorization
    Andrej Čopar
    Marinka žitnik
    Blaž Zupan
    BioData Mining, 10
  • [3] The BinOvNMTF Algorithm: Overlapping Columns Co-clustering based on Non-negative Matrix Tri-Factorization
    Brunialti, Lucas F.
    Peres, Sarajane M.
    da Silva, Valdinei Freire
    de Moraes Lima, Clodoaldo A.
    2017 6TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2017, : 330 - 335
  • [4] Predicting Drug Synergism by Means of Non-Negative Matrix Tri-Factorization
    Pinoli, Pietro
    Ceddia, Gaia
    Ceri, Stefano
    Masseroli, Marco
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (04) : 1956 - 1967
  • [5] Two-dimensional data partitioning for non-negative matrix tri-factorization
    Yan, Jiaxing
    Liu, Hai
    Lei, Zhiqi
    Rao, Yanghui
    Liu, Guan
    Xie, Haoran
    Tao, Xiaohui
    Wang, Fu Lee
    BIG DATA RESEARCH, 2024, 37
  • [6] Improving drug repositioning accuracy using non-negative matrix tri-factorization
    Li, Qingmei
    Wang, Yangyang
    Wang, Jihan
    Zhao, Congzhe
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [7] Learning Latent Features with Infinite Non-negative Binary Matrix Tri-factorization
    Yang, Xi
    Huang, Kaizhu
    Zhang, Rui
    Hussain, Amir
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT I, 2016, 9947 : 587 - 596
  • [8] Non-negative Matrix Tri-Factorization for co-clustering: An analysis of the block matrix
    Del Buono, N.
    Pio, G.
    INFORMATION SCIENCES, 2015, 301 : 13 - 26
  • [9] Non-Negative Matrix Factorization for Semisupervised Heterogeneous Data Coclustering
    Chen, Yanhua
    Wang, Lijun
    Dong, Ming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (10) : 1459 - 1474
  • [10] Robust Non-Negative Matrix Tri-Factorization with Dual Hyper-Graph Regularization
    Yu, Jiyang
    Che, Hangjun
    Leung, Man-Fai
    Liu, Cheng
    Wu, Wenhui
    Yan, Zheng
    BIG DATA MINING AND ANALYTICS, 2025, 8 (01): : 214 - 232