Selective Matrix Factorization for Multi-relational Data Fusion

被引:15
|
作者
Wang, Yuehui [1 ]
Yu, Guoxian [1 ,3 ]
Domeniconi, Carlotta [2 ]
Wang, Jun [1 ]
Zhang, Xiangliang [3 ]
Guo, Maozu [4 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
[2] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
[3] King Abdullah Univ Sci & Technol, Thuwal, Saudi Arabia
[4] Beijing Univ Civil Engn & Architecture, Coll Elect & Informat Engn, Beijing, Peoples R China
关键词
Matrix factorization; Data fusion; Multi-relational data; Association prediction; FRAMEWORK; PREDICTION;
D O I
10.1007/978-3-030-18576-3_19
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Matrix factorization based data fusion solutions can account for the intrinsic structures of multi-relational data sources, but most solutions equally treat these sources or prefer sparse ones, which may be irrelevant for the target task. In this paper, we introduce a Selective Matrix Factorization based Data Fusion approach (SelMFDF) to collaboratively factorize multiple inter-relational data matrices into lowrank representation matrices of respective object types and optimize the weights of them. To avoid preference to sparse data matrices, it additionally regularizes these low-rank matrices by approximating them to multiple intra-relational data matrices and also optimizes the weights of them. Both weights contribute to automatically integrate relevant data sources. Finally, it reconstructs the target relational data matrix using the optimized low-rank matrices. We applied SelMFDF for predicting inter-relations (lncRNA-miRNA interactions, functional annotations of proteins) and intra-relations (protein-protein interactions). SelMFDF achieves a higher AUROC (area under the receiver operating characteristics curve) by at least 5.88%, and larger AUPRC (area under the precision-recall curve) by at least 18.23% than other related and competitive approaches. The empirical study also confirms that SelMFDF can not only differentially integrate these relational data matrices, but also has no preference toward sparse ones.
引用
收藏
页码:313 / 329
页数:17
相关论文
共 50 条
  • [1] Weighted matrix factorization on multi-relational data for LncRNA-disease association prediction
    Wang, Yuehui
    Yu, Guoxian
    Wang, Jun
    Fu, Guangyuan
    Guo, Maozu
    Domeniconi, Carlotta
    [J]. METHODS, 2020, 173 : 32 - 43
  • [2] Personalized Multi-relational Matrix Factorization Model for Predicting Student Performance
    Nedungadi, Prema
    Smruthy, T. K.
    [J]. INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 1, 2016, 384 : 163 - 172
  • [3] Clustering Multivariate Time Series Data a via Multi-Nonnegative Matrix Factorization in Multi-Relational Networks
    Zhou, Lihua
    Du, Guowang
    Tao, Dapeng
    Chen, Hongmei
    Cheng, Jun
    Gong, Libo
    [J]. IEEE ACCESS, 2018, 6 : 74747 - 74761
  • [4] Multi-relational learning via hierarchical nonparametric Bayesian collective matrix factorization
    Yang, Hongxia
    Lozano, Aurelie
    [J]. JOURNAL OF APPLIED STATISTICS, 2015, 42 (05) : 1133 - 1147
  • [5] Large-Scale Factorization of Type-Constrained Multi-Relational Data
    Krompass, Denis
    Nickel, Maximilian
    Tresp, Volker
    [J]. 2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2014, : 18 - 24
  • [6] Evolutionary Propositionalization of Multi-Relational Data
    Kassarnig, Valentin
    Wotawa, Franz
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2018, 28 (11-12) : 1739 - 1754
  • [7] Numbers in multi-relational data mining
    Knobbe, AJ
    Ho, EKY
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 544 - 551
  • [8] Class-aware tensor factorization for multi-relational classification
    Katsimpras, Georgios
    Paliouras, Georgios
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (02)
  • [9] Robust Multi-Relational Clustering via l1-Norm Symmetric Nonnegative Matrix Factorization
    Liu, Kai
    Wang, Hua
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 397 - 401
  • [10] On multi-relational data mining for foundation of data mining
    Liu, Miao
    Guo, Hai-Feng
    Chen, Zhengxin
    [J]. 2007 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2007, : 389 - +