Protection techniques from information extraction

被引:0
|
作者
Greco, Gianluigi [1 ]
Ianni, Giovambattista [1 ]
Lio, Vincenzino [1 ]
Palopoli, Luigi [1 ]
机构
[1] Univ Calabria, Calabria, Italy
关键词
D O I
10.1109/WI.2006.138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information extraction technologies meet the market need for automatic tools for extracting semi-structured information from web pages. However pages may change over time due to different reasons, ranging from restyling pages to on-purpose modifications brought about into pages in order to puzzle Web wrappers. In this paper we deal with this latter scenario, by studying the issue of on-purpose wrapper spoiling and its relationship to wrapping. We present an architecture and a tool implementing a wrapper spoiling system, and discuss some practical spoiling techniques which are also experimentally tested.
引用
收藏
页码:1029 / +
页数:2
相关论文
共 50 条
  • [41] Information extraction from Greek texts
    Karra, M
    Bekakos, MP
    NEURAL, PARALLEL, AND SCIENTIFIC COMPUTATIONS, VOL 2, PROCEEDINGS, 2002, : 17 - 20
  • [42] Open Information Extraction from the Web
    Banko, Michele
    Cafarella, Michael J.
    Soderland, Stephen
    Broadhead, Matt
    Etzioni, Oren
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2670 - 2676
  • [43] EXTRACTION OF INFORMATION FROM A RADAR DISPLAY
    TURNER, RJ
    JOURNAL OF NAVIGATION, 1974, 27 (04): : 533 - 535
  • [44] EXTRACTION OF STRUCTURAL INFORMATION FROM LEED
    DUKE, CB
    TUCKER, CW
    JOURNAL OF VACUUM SCIENCE & TECHNOLOGY, 1971, 8 (01): : 5 - &
  • [45] Information extraction from broadcast news
    Gotoh, Y
    Renals, S
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2000, 358 (1769): : 1295 - 1309
  • [46] Automatic Information Extraction from Heatmaps
    Markowska-Kaczmar, Urszula
    Szymanska, Agnieszka
    Culer, Lukasz
    5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 267 - +
  • [47] STEROLOGY IN THE EXTRACTION OF INFORMATION FROM IMAGES
    OBERHOLZER, M
    BIANCHI, L
    DALQUEN, P
    LANDMANN, L
    HEITZ, PU
    ANALYTICAL AND QUANTITATIVE CYTOLOGY AND HISTOLOGY, 1985, 7 (03): : 197 - 204
  • [48] EXTRACTION OF INFORMATION FROM VISUAL PERSISTENCE
    ERWIN, DE
    AMERICAN JOURNAL OF PSYCHOLOGY, 1976, 89 (04): : 659 - 667
  • [49] Information extraction from voicemail transcripts
    Jansche, M
    Abney, SP
    PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 320 - 327
  • [50] EXTRACTION OF INFORMATION FROM A RADAR DISPLAY
    不详
    JOURNAL OF NAVIGATION, 1975, 28 (01): : 110 - 110