Mining the Web of Linked Data with RapidMiner

被引:48
|
作者
Ristoski, Petar [1 ]
Bizer, Christian [1 ]
Paulheim, Heiko [1 ]
机构
[1] Univ Mannheim, Data & Web Sci Grp, B6,26, D-68159 Mannheim, Germany
来源
JOURNAL OF WEB SEMANTICS | 2015年 / 35卷
关键词
Linked Open Data; Data mining; RapidMiner;
D O I
10.1016/j.websem.2015.06.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lots of data from different domains are published as Linked Open Data (LOD). While there are quite a few browsers for such data, as well as intelligent tools for particular purposes, a versatile tool for deriving additional knowledge by mining the Web of Linked Data is still missing. In this system paper, we introduce the RapidMiner Linked Open Data extension. The extension hooks into the powerful data mining and analysis platform RapidMiner, and offers operators for accessing Linked Open Data in RapidMiner, allowing for using it in sophisticated data analysis workflows without the need for expert knowledge in SPARQL or RDF. The extension allows for autonomously exploring the Web of Data by following links, thereby discovering relevant datasets on the fly, as well as for integrating overlapping data found in different datasets. As an example, we show how statistical data from the World Bank on scientific publications, published as an RDF data cube, can be automatically linked to further datasets and analyzed using additional background knowledge from ten different LOD datasets. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:142 / 151
页数:10
相关论文
共 50 条
  • [31] Web Log Data Analysis and Mining
    Grace, L. K. Joshila
    Maheswari, V.
    Nagamalai, Dhinaharan
    [J]. ADVANCED COMPUTING, PT III, 2011, 133 : 459 - 469
  • [32] Mining indirect associations in Web data
    Tan, PN
    Kumar, V
    [J]. WEBKDD 2001 - MINING WEB LOG DATA ACROSS ALL CUSTOMERS TOUCH POINTS, 2002, 2356 : 145 - 166
  • [33] A web architecture for data mining in biology
    Doncescu, Andrei
    Farmer, Muhammad
    Inoue, Katsumi
    Richard, Gibes
    [J]. 20TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 2, PROCEEDINGS, 2006, : 607 - +
  • [34] Web data mining and reasoning model
    Li, YF
    Zhong, N
    [J]. AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 1128 - 1134
  • [35] Data mining in a closed Web environment
    Faba-Pérez, C
    Guerrero-Bote, VP
    De Moya-Anegón, F
    [J]. SCIENTOMETRICS, 2003, 58 (03) : 623 - 640
  • [36] Personalized Web Data Mining System
    He, Bo
    [J]. ADVANCED RESEARCH ON INFORMATION SCIENCE, AUTOMATION AND MATERIAL SYSTEM, PTS 1-6, 2011, 219-220 : 183 - 186
  • [37] Mining web data for competency management
    Zhu, J
    Gonçalves, AL
    Uren, VS
    Motta, E
    Pacheco, R
    [J]. 2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2005, : 94 - 100
  • [38] Data mining for Web security: UserWatcher
    Mahoui, M
    Bhargava, B
    Mohania, M
    [J]. IC'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS I AND II, 2001, : 936 - 942
  • [39] Challenges concerning web data mining
    Gaul, Wolfgang
    [J]. COMPSTAT 2006: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2006, : 403 - 416
  • [40] Web log data mining analysis
    Lu Ansheng
    [J]. 2012 INTERNATIONAL CONFERENCE ON INTELLIGENCE SCIENCE AND INFORMATION ENGINEERING, 2012, 20 : 213 - 215