RDF2Vec: RDF Graph Embeddings for Data Mining

被引:214
|
作者
Ristoski, Petar [1 ]
Paulheim, Heiko [1 ]
机构
[1] Univ Mannheim, Data & Web Sci Grp, Mannheim, Germany
来源
关键词
Graph embeddings; Linked open data; Data mining; KERNEL;
D O I
10.1007/978-3-319-46523-4_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linked Open Data has been recognized as a valuable source for background information in data mining. However, most data mining tools require features in propositional form, i.e., a vector of nominal or numerical features associated with an instance, while Linked Open Data sources are graphs by nature. In this paper, we present RDF2Vec, an approach that uses language modeling approaches for unsupervised feature extraction from sequences of words, and adapts them to RDF graphs. We generate sequences by leveraging local information from graph substructures, harvested by Weisfeiler-Lehman Subtree RDF Graph Kernels and graph walks, and learn latent numerical representations of entities in RDF graphs. Our evaluation shows that such vector representations outperform existing techniques for the propositionalization of RDF graphs on a variety of different predictive machine learning tasks, and that feature vector representations of general knowledge graphs such as DBpedia and Wikidata can be easily reused for different tasks.
引用
收藏
页码:498 / 514
页数:17
相关论文
共 50 条
  • [41] Graph-based Large Scale RDF Data Compression
    Zhang, Wei Emma
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1276 - 1276
  • [42] Exploiting RDF Open Data Using NoSQL Graph Databases
    Bouhali, Raouf
    Laurent, Anne
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2015, 458 : 177 - 190
  • [43] A DISTRIBUTIONAL STRUCTURED SEMANTIC SPACE FOR QUERYING RDF GRAPH DATA
    Freitas, Andre
    Curry, Edward
    Gabriel Oliveira, Joao
    O'Riain, Sean
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2011, 5 (04) : 433 - 462
  • [44] Navigation in RDF data
    Dokulil, Jiri
    Katreniakova, Jana
    PROCEEDINGS OF THE 12TH INTERNATIONAL INFORMATION VISUALISATION, 2008, : 26 - +
  • [45] RDF2PT: Generating Brazilian Portuguese Texts from RDF Data
    Moussallem, Diego
    Ferreira, Thiago Castro
    Zampieri, Marcos
    Cavalcanti, Maria Claudia
    Xexeo, Geraldo
    Neves, Mariana
    Ngomo, Axel-Cyrille Ngonga
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3043 - 3050
  • [46] MonetDB/RDF: Discovering and Exploiting the Emergent Schema of RDF Data
    Minh-Duc Pham
    Boncz, Peter
    ERCIM NEWS, 2014, (96): : 41 - 42
  • [47] H2RDF+ : An Efficient Data Management System for Big RDF Graphs
    Papailiou, Nikolaos
    Tsoumakos, Dimitrios
    Konstantinou, Ioannis
    Karras, Panagiotis
    Koziris, Nectarios
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 909 - 912
  • [48] mStore: Schema Mining based-RDF Data Storage
    Zheng, Guopeng
    Ren, Tenglong
    Yang, Lulu
    Zhang, Xiaowang
    Feng, Zhiyong
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 168 - 171
  • [49] RDF Data Clustering
    Giannini, Silvia
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2013, 2013, 160 : 220 - 231
  • [50] Efficient RDF Interchange (ERI) Format for RDF Data Streams
    Fernandez, Javier D.
    Llaves, Alejandro
    Corcho, Oscar
    SEMANTIC WEB - ISWC 2014, PT II, 2014, 8797 : 244 - 259