Mining the Web of Linked Data with RapidMiner

被引:48
|
作者
Ristoski, Petar [1 ]
Bizer, Christian [1 ]
Paulheim, Heiko [1 ]
机构
[1] Univ Mannheim, Data & Web Sci Grp, B6,26, D-68159 Mannheim, Germany
来源
JOURNAL OF WEB SEMANTICS | 2015年 / 35卷
关键词
Linked Open Data; Data mining; RapidMiner;
D O I
10.1016/j.websem.2015.06.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lots of data from different domains are published as Linked Open Data (LOD). While there are quite a few browsers for such data, as well as intelligent tools for particular purposes, a versatile tool for deriving additional knowledge by mining the Web of Linked Data is still missing. In this system paper, we introduce the RapidMiner Linked Open Data extension. The extension hooks into the powerful data mining and analysis platform RapidMiner, and offers operators for accessing Linked Open Data in RapidMiner, allowing for using it in sophisticated data analysis workflows without the need for expert knowledge in SPARQL or RDF. The extension allows for autonomously exploring the Web of Data by following links, thereby discovering relevant datasets on the fly, as well as for integrating overlapping data found in different datasets. As an example, we show how statistical data from the World Bank on scientific publications, published as an RDF data cube, can be automatically linked to further datasets and analyzed using additional background knowledge from ten different LOD datasets. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:142 / 151
页数:10
相关论文
共 50 条
  • [21] Data Mining: Web Data Mining Techniques, Tools and Algorithms: An Overview
    Mughal, Muhammd Jawad Hamid
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (06) : 208 - 215
  • [22] Publishing CSV Data as Linked Data on the Web
    Mahmud, S. M. Hasan
    Hossin, Md Altab
    Hasan, Md Rezwan
    Jahan, Hosney
    Noori, Sheak Rashed Haider
    Ahmed, Md Razu
    [J]. PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 805 - 817
  • [23] Mining the web to add semantics to retail data mining
    Ghani, R
    [J]. WEB MINING: FROM WEB TO SEMANTIC WEB, 2004, 3209 : 43 - 56
  • [24] Web Data Mining System Based on Web Services
    Chen, Chunying
    Zhou, Xiongwei
    Zhang, Jianzhong
    [J]. HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 216 - +
  • [25] Preprocessing and mining web log data for web personalization
    Baglioni, M
    Ferrara, U
    Romei, A
    Ruggieri, S
    Turini, F
    [J]. AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 237 - 249
  • [26] Analysis of Sentiments for Sports data using RapidMiner
    Pawar, Tanuj
    Kalra, Parul
    Mehrotra, Deepti
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT 2018), 2018, : 625 - 628
  • [27] Mining indirect associations in Web data
    Tan, PN
    Kumar, V
    [J]. WEBKDD 2001 - MINING WEB LOG DATA ACROSS ALL CUSTOMERS TOUCH POINTS, 2002, 2356 : 145 - 166
  • [28] Web Log Data Analysis and Mining
    Grace, L. K. Joshila
    Maheswari, V.
    Nagamalai, Dhinaharan
    [J]. ADVANCED COMPUTING, PT III, 2011, 133 : 459 - 469
  • [29] A web architecture for data mining in biology
    Doncescu, Andrei
    Farmer, Muhammad
    Inoue, Katsumi
    Richard, Gibes
    [J]. 20TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 2, PROCEEDINGS, 2006, : 607 - +
  • [30] Semantic Web, Data Mining, and Security
    Thuraisingham, Bhavani
    Khan, Latifur
    Kantarcioglu, Murat
    [J]. IEEE INTELLIGENT SYSTEMS, 2010, 25 (05) : 86 - 88