A class of neural-network-based transducers for web information extraction

被引:13
|
作者
Sleiman, Hassan A. [1 ]
Corchuelo, Rafael [1 ]
机构
[1] Univ Seville, ETSI Informat, E-41012 Seville, Spain
关键词
web wrappers; web information extraction; neural networks; finite automata; machine learning; supervised method; WRAPPER INDUCTION;
D O I
10.1016/j.neucom.2013.05.057
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Web is a huge and still growing information repository that has attracted the attention of many companies. Many such companies rely on information extractors to integrate information that is buried into semi-structured web documents into automatic business processes. Many information extractors build on extraction rules, which can be handcrafted or learned using supervised or unsupervised techniques. The literature provides a variety of techniques to learn information extraction rules that build on ad hoc machine learning techniques. In this paper, we propose a hybrid approach that explores the use of standard machine-learning techniques to extract web information. We have specifically explored using neural networks; our results show that our proposal outperforms three state-of-the-art techniques in the literature, which opens up quite a new approach to information extraction. (c) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:61 / 68
页数:8
相关论文
共 50 条
  • [21] The Pragmatics Information Extraction Based on BP Neural Network
    Liu Ding
    Jiang Minghu
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 1256 - 1259
  • [22] Neural-Network-Based Fully Distributed Adaptive Consensus for a Class of Uncertain Multiagent Systems
    Yue, Dongdong
    Cao, Jinde
    Li, Qi
    Liu, Qingshan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 2965 - 2977
  • [23] Web Page Information Extraction Service Based on Graph Convolutional Neural Network and Multimodal Data Fusion
    Zhang, Mingzhu
    Yang, Zhongguo
    Ali, Sikandar
    Ding, Weilong
    2021 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2021, 2021, : 681 - 687
  • [24] Neural-network-based inverse hysteresis model
    Ma, Lian-Wei
    Tan, Yong-Hong
    Zou, Tao
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2008, 25 (05): : 823 - 826
  • [25] A compact neural-network-based CDMA receiver
    Chen, DC
    Sheu, BJ
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 1998, 45 (03): : 384 - 387
  • [26] Approach to the neural-network-based data mining
    Zheng, Zhijun
    Lin, Xiaguang
    Zheng, Shouqi
    Xi'an Jianzhu Keji Daxue Xuebao/Journal of Xi'an University of Architecture & Technology, 2000, 32 (01): : 28 - 30
  • [27] A NEURAL-NETWORK-BASED DEDICATED THINNING METHOD
    AHMED, P
    PATTERN RECOGNITION LETTERS, 1995, 16 (06) : 585 - 590
  • [28] On the research of neural-network-based dynamics for robot
    Yang, Z
    Meng, ZD
    ADVANCES IN DYNAMICS, INSTRUMENTATION AND CONTROL, 2004, : 354 - 360
  • [29] A Neural-Network-Based Gaussian Nonlinear Filter
    Giraldo-Grueso, Felipe
    Popov, Andrey A.
    Zanetti, Renato
    AIAA SCITECH 2024 FORUM, 2024,
  • [30] A neural-network-based robust watermarking scheme
    Chang, CY
    Su, SJ
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 2482 - 2487