Determining the number of latent factors in statistical multi-relational learning

被引:0
|
作者
Shi, Chengchun [1 ]
Lu, Wenbin [1 ]
Song, Rui [1 ]
机构
[1] Department of Statistics, North Carolina State University, Raleigh,NC,27695, United States
关键词
Knowledge graph - Maximum likelihood estimation - Learning systems - Sampling - Statistics - Factorization;
D O I
暂无
中图分类号
学科分类号
摘要
Statistical relational learning is primarily concerned with learning and inferring relationships between entities in large-scale knowledge graphs. Nickel et al. (2011) proposed a RESCAL tensor factorization model for statistical relational learning, which achieves better or at least comparable results on common benchmark data sets when compared to other state-of-the-art methods. Given a positive integer s, RESCAL computes an s-dimensional latent vector for each entity. The latent factors can be further used for solving relational learning tasks, such as collective classification, collective entity resolution and link-based clustering. The focus of this paper is to determine the number of latent factors in the RESCAL model. Due to the structure of the RESCAL model, its log-likelihood function is not concave. As a result, the corresponding maximum likelihood estimators (MLEs) may not be consistent. Nonetheless, we design a specific pseudometric, prove the consistency of the MLEs under this pseudometric and establish its rate of convergence. Based on these results, we propose a general class of information criteria and prove their model selection consistencies when the number of relations is either bounded or diverges at a proper rate of the number of entities. Simulations and real data examples show that our proposed information criteria have good finite sample properties. © 2019 Chengchun Shi, Wenbin Lu, Rui Song.
引用
收藏
相关论文
共 50 条
  • [1] Determining the Number of Latent Factors in Statistical Multi-Relational Learning
    Shi, Chengchun
    Lu, Wenbin
    Song, Rui
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [2] Introduction to the special issue on multi-relational data mining and statistical relational learning
    Blockeel, H
    Jensen, D
    Kramer, S
    MACHINE LEARNING, 2006, 62 (1-2) : 3 - 5
  • [3] An Optimized Approach for Feature Extraction in Multi-Relational Statistical Learning
    Bakshi, Garima
    Shukla, Rati
    Yadav, Vikash
    Dahiya, Aman
    Anand, Rohit
    Sindhwani, Nidhi
    Singh, Harinder
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2021, 80 (06): : 537 - 542
  • [4] Introduction to the special issue on multi-relational data mining and statistical relational learning
    Hendrik Blockeel
    David Jensen
    Stefan Kramer
    Machine Learning, 2006, 62 : 3 - 5
  • [5] Multi-Relational Measurement for Latent Construct Networks
    Koehly, Laura M.
    Marcum, Christopher Steven
    PSYCHOLOGICAL METHODS, 2018, 23 (01) : 42 - 57
  • [6] Statistical Analysis of Multi-Relational Network Recovery
    Wang, Zhi
    Tang, Xueying
    Liu, Jingchen
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2020, 6
  • [7] Multi-Relational Learning with Gaussian Processes
    Xu, Zhao
    Kersting, Kristian
    Tresp, Volker
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1309 - 1314
  • [8] Multi-Relational Contrastive Learning for Recommendation
    Wei, Wei
    Xia, Lianghao
    Huang, Chao
    PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 338 - 349
  • [9] Adaptive Convolution for Multi-Relational Learning
    Jiang, Xiaotian
    Wang, Quan
    Wang, Bin
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 978 - 987
  • [10] Metric learning with multi-relational data
    Pan, Jiajun
    Le Capitaine, Hoel
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,