Research on Web Information Extraction Based on XML

被引:0
|
作者
Hu, Yan [1 ]
Xuan, Yanyan [1 ]
机构
[1] Wuhan Univ Technol, Dept Comp Sci & Technol, Wuhan 430070, Peoples R China
关键词
D O I
10.1109/WGEC.2008.16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The standard XML technology is used for Web information extraction in this paper, and a generic XML-based Web information extraction solution is proposed. In the extraction process, two key technologies are proposed and implemented: the XML-based Web data conversion technology and the DOM-based XPath generation technology, to simplify the information extraction work. XSLT is used as the description language of extraction rules, which is conductive to the unity of extraction patterns.
引用
收藏
页码:201 / 204
页数:4
相关论文
共 50 条
  • [31] Automatic Data Extraction from Lists in Web Pages Based on XML
    Xin, Zhou
    Hao, Wang
    [J]. ADVANCED TECHNOLOGY IN TEACHING - PROCEEDINGS OF THE 2009 3RD INTERNATIONAL CONFERENCE ON TEACHING AND COMPUTATIONAL SCIENCE (WTCS 2009), VOL 2: EDUCATION, PSYCHOLOGY AND COMPUTER SCIENCE, 2012, 117 : 915 - 921
  • [32] Research of in the Integrated Transportation Information Platform based on XML
    Li, RM
    Lu, HP
    Qian, Z
    Shi, QX
    [J]. 2005 IEEE Intelligent Transportation Systems Conference (ITSC), 2005, : 214 - 219
  • [33] Research on Information Hiding Based on XML Tag Attributes
    Wang, Xiaofeng
    [J]. ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1668 - 1671
  • [34] Research on describing method of process information based on XML
    Chen, Shou-Qiang
    Cai, Chang-Tao
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2009, 43 (06): : 944 - 948
  • [35] Research on the integration of stamping product information based on XML
    Luo, Jin-Ping
    Su, Wen-Bin
    Wang, Chao-Ming
    Guo, Cheng
    [J]. Suxing Gongcheng Xuebao/Journal of Plasticity Engineering, 2003, 10 (04):
  • [36] Research on Web application Platform Based on XML and JS']JSP
    Luo, YueChuan
    Wang, Chen
    Yu, JiuFeng
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 56 - 60
  • [37] The research of web-based data warehouse using XML
    Li, X
    Wang, XR
    Liu, WH
    Liao, L
    [J]. 2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : E42 - E47
  • [38] Research and Realization of Web Services Security Based on XML Signature
    Gu Yue-sheng
    Zhang Bao-jian
    Xu Wu
    [J]. 2009 INTERNATIONAL CONFERENCE ON NETWORKING AND DIGITAL SOCIETY, VOL 2, PROCEEDINGS, 2009, : 116 - 118
  • [39] Web service research of urban geographical data based on XML
    Qiao, Gang
    Wang, Weian
    Wu, Zhangfeng
    Zhang, Jinglei
    [J]. 2006 6TH INTERNATIONAL CONFERENCE ON ITS TELECOMMUNICATIONS PROCEEDINGS, 2006, : 214 - +
  • [40] Web Information Extraction based on similar patterns
    Ye, N
    Wu, XJ
    Zhu, JB
    Chen, WL
    Yao, TS
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT: PROCEEDINGS, 2004, 3129 : 646 - 651