A class of neural-network-based transducers for web information extraction

被引:13
|
作者
Sleiman, Hassan A. [1 ]
Corchuelo, Rafael [1 ]
机构
[1] Univ Seville, ETSI Informat, E-41012 Seville, Spain
关键词
web wrappers; web information extraction; neural networks; finite automata; machine learning; supervised method; WRAPPER INDUCTION;
D O I
10.1016/j.neucom.2013.05.057
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Web is a huge and still growing information repository that has attracted the attention of many companies. Many such companies rely on information extractors to integrate information that is buried into semi-structured web documents into automatic business processes. Many information extractors build on extraction rules, which can be handcrafted or learned using supervised or unsupervised techniques. The literature provides a variety of techniques to learn information extraction rules that build on ad hoc machine learning techniques. In this paper, we propose a hybrid approach that explores the use of standard machine-learning techniques to extract web information. We have specifically explored using neural networks; our results show that our proposal outperforms three state-of-the-art techniques in the literature, which opens up quite a new approach to information extraction. (c) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:61 / 68
页数:8
相关论文
共 50 条
  • [41] NEURAL-NETWORK-BASED DETECTION OF ESOPHAGEAL INTUBATION
    LEON, MA
    RASANEN, J
    MANGAR, D
    ANESTHESIA AND ANALGESIA, 1994, 78 (03): : 548 - 553
  • [42] A Neural-Network-based Sketch Recognition System
    Su, Mu-Chun
    Hsio, Ting-Huan
    Hsieh, Yi-Zeng
    Lin, Shih-Chieh
    Chou, Chien-Hsing
    IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2012), 2012,
  • [43] Neural-network-based system for monitoring the Aurora
    Newell, Patrick T., 1600, (11): : 3 - 4
  • [44] NEURAL-NETWORK-BASED BLACKBOARD DEMON SUBSYSTEMS
    HO, CS
    HSU, CC
    APPLIED INTELLIGENCE, 1993, 3 (02) : 143 - 158
  • [45] A Method to Discover Sensitive Information in Classified Network Based on Web Information Extraction
    Zhang, Jianping
    Li, Hongmin
    Lu, Min
    Ke, Mingmin
    2016 FIRST IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND THE INTERNET (ICCCI 2016), 2016, : 262 - 265
  • [46] A Neural-Network-Based Approach for Routing in a Packet Switching Network
    Cavalieri, S.
    Di Stefano, A.
    Mirabella, O.
    Proceedings of the International Joint Conference on Neural Networks, 1992, 2 : 913 - 918
  • [47] Neural-Network-Based Adaptive Fault Estimation for a Class of Interconnected Nonlinear System with Triangular Forms
    Liu, Lei
    Wang, Zhanshan
    Liu, Jinhai
    Liu, Zhenwei
    ADVANCES IN NEURAL NETWORKS - ISNN 2014, 2014, 8866 : 110 - 120
  • [48] Further result on a dynamic recurrent neural-network-based adaptive observer for a class of nonlinear systems
    Huang, SN
    Tan, KK
    Lee, TH
    AUTOMATICA, 2005, 41 (12) : 2161 - 2162
  • [49] Sensing analysis of self-mixing and Michelson interferometry with neural-network-based phase extraction
    Chen, Junbao
    Wang, Xinmeng
    He, Cheng
    Wang, Ming
    JOURNAL OF MODERN OPTICS, 2024, 71 (1-3) : 34 - 41
  • [50] Symmetric All Convolutional Neural-Network-Based Unsupervised Feature Extraction for Hyperspectral Images Classification
    Zhang, Mingyang
    Gong, Maoguo
    He, Haibo
    Zhu, Shengqi
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (05) : 2981 - 2993