Mining the Web of Linked Data with RapidMiner

被引:48
|
作者
Ristoski, Petar [1 ]
Bizer, Christian [1 ]
Paulheim, Heiko [1 ]
机构
[1] Univ Mannheim, Data & Web Sci Grp, B6,26, D-68159 Mannheim, Germany
来源
JOURNAL OF WEB SEMANTICS | 2015年 / 35卷
关键词
Linked Open Data; Data mining; RapidMiner;
D O I
10.1016/j.websem.2015.06.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lots of data from different domains are published as Linked Open Data (LOD). While there are quite a few browsers for such data, as well as intelligent tools for particular purposes, a versatile tool for deriving additional knowledge by mining the Web of Linked Data is still missing. In this system paper, we introduce the RapidMiner Linked Open Data extension. The extension hooks into the powerful data mining and analysis platform RapidMiner, and offers operators for accessing Linked Open Data in RapidMiner, allowing for using it in sophisticated data analysis workflows without the need for expert knowledge in SPARQL or RDF. The extension allows for autonomously exploring the Web of Data by following links, thereby discovering relevant datasets on the fly, as well as for integrating overlapping data found in different datasets. As an example, we show how statistical data from the World Bank on scientific publications, published as an RDF data cube, can be automatically linked to further datasets and analyzed using additional background knowledge from ten different LOD datasets. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:142 / 151
页数:10
相关论文
共 50 条
  • [1] Mining Schema Knowledge from Linked Data on the Web
    Mehri, Razieh
    Valtchev, Petko
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2017): 10TH INTERNATIONAL CONFERENCE, KSEM 2017, MELBOURNE, VIC, AUSTRALIA, AUGUST 19-20, 2017, PROCEEDINGS, 2017, 10412 : 261 - 273
  • [2] An Exploratory Study on Utilising the Web of Linked Data for Product Data Mining
    Zhang Z.
    Song X.
    [J]. SN Computer Science, 4 (1)
  • [3] Web + Data Mining = Web Mining
    Kilian Stoffel
    [J]. HMD Praxis der Wirtschaftsinformatik, 2009, 46 (4) : 6 - 20
  • [4] Web data mining
    Wibonele, KJ
    Zhang, YQ
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 241 - 244
  • [5] Data mining for the web
    Spiliopoulou, M
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 588 - 589
  • [6] Data mining on Web
    Zhang, XB
    [J]. THIRD INTERNATIONAL CONFERENCE ON ELECTRONIC COMMERCE ENGINEERING: DIGITAL ENTERPRISES AND NONTRADITIONAL INDUSTRIALIZATION, 2003, : 504 - 507
  • [7] From temporal data mining and web mining to temporal web mining
    Samia, M
    Conrad, S
    [J]. DATABASES AND INFORMATION SYSTEMS, 2005, 118 : 91 - 102
  • [8] Survivability Strategies for Emerging Wireless Networks With Data Mining Techniques: a Case Study With NetLogo and RapidMiner
    Garcia-Magarino, Ivan
    Gray, Geraldine
    Lacuesta, Raquel
    Lloret, Jaime
    [J]. IEEE ACCESS, 2018, 6 : 27958 - 27970
  • [9] Correlation review of classification algorithm using data mining tool: WEKA, Rapidminer, Tanagra, Orange and Knime
    Naik, Amrita
    Samant, Lilavati
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELLING AND SECURITY (CMS 2016), 2016, 85 : 662 - 668
  • [10] Data Preprocessing for Web Data Mining
    Zhang, Wei
    Chen, Tinggui
    [J]. ADVANCES IN ELECTRONIC COMMERCE, WEB APPLICATION AND COMMUNICATION, VOL 2, 2012, 149 : 303 - +