Protection techniques from information extraction

被引:0
|
作者
Greco, Gianluigi [1 ]
Ianni, Giovambattista [1 ]
Lio, Vincenzino [1 ]
Palopoli, Luigi [1 ]
机构
[1] Univ Calabria, Calabria, Italy
关键词
D O I
10.1109/WI.2006.138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information extraction technologies meet the market need for automatic tools for extracting semi-structured information from web pages. However pages may change over time due to different reasons, ranging from restyling pages to on-purpose modifications brought about into pages in order to puzzle Web wrappers. In this paper we deal with this latter scenario, by studying the issue of on-purpose wrapper spoiling and its relationship to wrapping. We present an architecture and a tool implementing a wrapper spoiling system, and discuss some practical spoiling techniques which are also experimentally tested.
引用
收藏
页码:1029 / +
页数:2
相关论文
共 50 条
  • [1] Information Extraction from the Web: System and Techniques
    Luo Xiao
    Dieter Wissmann
    Michael Brown
    Stephan Jablonski
    Applied Intelligence, 2004, 21 : 195 - 224
  • [2] Information extraction from the Web: System and techniques
    Xiao, L
    Wissmann, D
    Brown, M
    Jablonski, S
    APPLIED INTELLIGENCE, 2004, 21 (02) : 195 - 224
  • [3] A Review: Information Extraction Techniques From Research Papers
    Jayaram, Kavitha
    Sangeeta, K.
    2017 INTERNATIONAL CONFERENCE ON INNOVATIVE MECHANISMS FOR INDUSTRY APPLICATIONS (ICIMIA), 2017, : 56 - 59
  • [4] ADVANCES IN INFORMATION EXTRACTION TECHNIQUES
    NAGY, G
    REMOTE SENSING OF ENVIRONMENT, 1984, 15 (02) : 167 - 175
  • [5] Integrating shallow and linguistic techniques for information extraction from text
    Ciravegna, F
    Cancedda, N
    TOPICS IN ARTIFICIAL INTELLIGENCE, 1995, 992 : 127 - 138
  • [6] Quantum imaging techniques for improving information extraction from images
    Fabre, Claude
    Treps, Nicolas
    Bachor, Hans A.
    Lam, Ping Koy
    QUANTUM INFORMATION WITH CONTINOUS VARIABLES OF ATOMS AND LIGHT, 2007, : 323 - +
  • [7] Information extraction and norms of mutual protection
    Bisin, Alberto
    Guaitoli, Danilo
    JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2012, 84 (01) : 154 - 162
  • [8] Image information content and extraction techniques
    Ekblad, U
    Kinser, JM
    Atmer, J
    Zetterlund, N
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2004, 525 (1-2): : 397 - 401
  • [9] Inferencing in Information Extraction: Techniques and Applications
    Barbosa, Denilson
    Wang, Haixun
    Yu, Cong
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1534 - 1537
  • [10] Information Extraction from Microarray Data: A Survey of Data Mining Techniques
    Fiori, Alessandro
    Grand, Alberto
    Bruno, Giulia
    Brundu, Francesco Gavino
    Schioppa, Domenico
    Bertotti, Andrea
    JOURNAL OF DATABASE MANAGEMENT, 2014, 25 (01) : 29 - 58