RDF2Vec: RDF Graph Embeddings for Data Mining

被引:214
|
作者
Ristoski, Petar [1 ]
Paulheim, Heiko [1 ]
机构
[1] Univ Mannheim, Data & Web Sci Grp, Mannheim, Germany
来源
关键词
Graph embeddings; Linked open data; Data mining; KERNEL;
D O I
10.1007/978-3-319-46523-4_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linked Open Data has been recognized as a valuable source for background information in data mining. However, most data mining tools require features in propositional form, i.e., a vector of nominal or numerical features associated with an instance, while Linked Open Data sources are graphs by nature. In this paper, we present RDF2Vec, an approach that uses language modeling approaches for unsupervised feature extraction from sequences of words, and adapts them to RDF graphs. We generate sequences by leveraging local information from graph substructures, harvested by Weisfeiler-Lehman Subtree RDF Graph Kernels and graph walks, and learn latent numerical representations of entities in RDF graphs. Our evaluation shows that such vector representations outperform existing techniques for the propositionalization of RDF graphs on a variety of different predictive machine learning tasks, and that feature vector representations of general knowledge graphs such as DBpedia and Wikidata can be easily reused for different tasks.
引用
收藏
页码:498 / 514
页数:17
相关论文
共 50 条
  • [21] Integrating RDF into Hypergraph-Graph (HG(2)) Data Structure
    Munshi, Shiladitya
    Chakraborty, Ayan
    Mukhopadhyay, Debajyoti
    2013 INTERNATIONAL CONFERENCE ON CLOUD & UBIQUITOUS COMPUTING & EMERGING TECHNOLOGIES (CUBE 2013), 2013, : 208 - +
  • [22] Global RDF Vector Space Embeddings
    Cochez, Michael
    Ristoski, Petar
    Ponzetto, Simone Paolo
    Paulheim, Heiko
    SEMANTIC WEB - ISWC 2017, PT I, 2017, 10587 : 190 - 207
  • [23] Embeddings of Simple Modular Extended RDF
    Damasio, Carlos Viegas
    Analyti, Anastasia
    Antoniou, Grigoris
    WEB REASONING AND RULE SYSTEMS, 2010, 6333 : 204 - +
  • [24] A Distributed Graph Engine for Web Scale RDF Data
    Zeng, Kai
    Yang, Jiacheng
    Wang, Haixun
    Shao, Bin
    Wang, Zhongyuan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (04): : 265 - 276
  • [25] Querying RDF data from a graph database perspective
    Angles, R
    Gutierrez, C
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2005, 3532 : 346 - 360
  • [26] Applying DAC Principles to the RDF Graph Data Model
    Kirrane, Sabrina
    Mileo, Alessandra
    Decker, Stefan
    SECURITY AND PRIVACY PROTECTION IN INFORMATION PROCESSING SYSTEMS, 2013, 405 : 69 - 82
  • [27] Indexing temporal RDF graph
    Yan, Li
    Zhao, Ping
    Ma, Zongmin
    COMPUTING, 2019, 101 (10) : 1457 - 1488
  • [28] Improving RDF Data Through Association Rule Mining
    Ziawasch Abedjan
    Felix Naumann
    Datenbank-Spektrum, 2013, 13 (2) : 111 - 120
  • [29] Mining Biomedical Ontologies and Data Using RDF Hypergraphs
    Liu, Haishan
    Dou, Dejing
    Jin, Ruoming
    LePendu, Paea
    Shah, Nigam
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 141 - 146
  • [30] Mining semantic association rules from RDF data
    Barati, Molood
    Bai, Quan
    Liu, Qing
    KNOWLEDGE-BASED SYSTEMS, 2017, 133 : 183 - 196