Extraction and Integration of Web Data by End-Users

被引:2
|
作者
Agarwal, Sudhir [1 ]
Genesereth, Michael [1 ]
机构
[1] Stanford Univ, Stanford Comp Sci Dept, 353 Serra Mall, Stanford, CA 94305 USA
关键词
Web Data Extraction; Web Data Integration and Search; FORM;
D O I
10.1145/2505515.2505635
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For increasingly sophisticated use cases end users often need to extract, combine, and aggregate information from various (often dynamically generated) web pages from multiple websites. Current search engines do not focus on combining information from various web pages in order to answer the overall information need of the user. Semantic Web and Linked Data usually take a static view on the data and rely on providers' cooperation. In this paper, we present a novel approach that enables end users to easily extract data from web pages while they browse, store it locally in their browser as well as structure, integrate and search such data. We propose Datalog rules for integrating and searching the extracted data. We show how cleaning steps and integration rules can be reused to accelerate the cleaning and integration of extracted data. The proposed approach is implemented as a browser plugin. We present its implementation details and report on our evaluation of the plugin concerning user experience and browsing time saving.
引用
收藏
页码:2405 / 2410
页数:6
相关论文
共 50 条
  • [1] Data on IT end-users
    不详
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2000, 100 (5-6) : 246 - 246
  • [2] Semantic Web Query Authoring for End-Users
    Moya, Diego
    Macias, Jose A.
    [J]. ENGINEERING THE USER INTERFACE: FROM RESEARCH TO PRACTICE, 2009, : 147 - 160
  • [3] Conceptual modelling of web sites for end-users
    De Troyer O.
    Decruyenaere T.
    [J]. World Wide Web, 2000, Springer (03) : 27 - 42
  • [4] CrawLogo: empowering end-users to program the Web
    McGee, K
    Nilsson, J
    [J]. 2004 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN CENTRIC COMPUTING: PROCEEDINGS, 2004, : 134 - 136
  • [5] Delivering the multiagent technology to end-users through the web
    Mitrovic, Dejan
    Ivanovic, Mirjana
    Badica, Costin
    [J]. 4TH INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, MINING AND SEMANTICS, 2014,
  • [6] Accessibility metrics of web pages for blind end-users
    González, J
    Macías, M
    Rodríguez, R
    Sánchez, F
    [J]. WEB ENGINEERING, PROCEEDINGS, 2003, 2722 : 374 - 383
  • [7] Helping end-users "engineer" dependable web applications
    Elbaum, Sebastian
    Chilakamarri, Kalyan-Ram
    Gopal, Bbuvana
    Rothermel, Gregg
    [J]. 16TH IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 2005, : 31 - 40
  • [8] A Tool Suite to Enable Web designers, Web Application developers and End-Users to Handle Semantic Data
    Rico, Mariano
    Corcho, Oscar
    Antonio Macias, Jose
    Camacho, David
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2010, 6 (03) : 38 - 60
  • [9] STARTING END-USERS
    NORTON, RA
    WESTWATER, J
    [J]. ASLIB PROCEEDINGS, 1986, 38 (11-12): : 381 - 388
  • [10] Generating surrogates to make the semantic Web intelligible to end-users
    Gandon, F
    [J]. 2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2005, : 352 - 358