WEB INFORMATION EXTRACTION AND ITS APPLICATION

被引:0
|
作者
Peng, Yan [1 ]
Zhang, Chenyue [2 ]
机构
[1] Capital Normal Univ, Sch Management, Beijing, Peoples R China
[2] ChangYou Com Ltd, Beijing, Peoples R China
关键词
Information Extraction; XML; !text type='HTML']HTML[!/text; Extraction Rule;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information extraction (IE) addresses the problem of extracting specific information from a collection of documents. The work presented in this paper described an approach of design an information extraction system; put forward basic system architecture. Describe the detail steps of web information extraction, such as web page organize, rule generate and result show. Finally, successfully extracted information is placed in an XML template, which has been designed to capture information needed in the teaching -learning system. Although the work presented in this paper was restricted to HTML course outlines, the concepts and methods are easily applied to other different domains.
引用
收藏
页码:448 / 451
页数:4
相关论文
共 50 条
  • [1] Web page title extraction and its application
    Xue, Yewei
    Hu, Yunhua
    Xin, Guomao
    Song, Ruihua
    Shi, Shuming
    Cao, Yunbo
    Lin, Chin-Yew
    Li, Hang
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (05) : 1332 - 1347
  • [2] Web Page Segmentation and its Application for Web Information Crawling
    Feng, Hanyang
    Zhang, Wenzhe
    Wu, Hesheng
    Wang, Chong-Jun
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 598 - 605
  • [3] An Efficient Wrapper for Web Data Extraction and its Application
    Zhang, Suzhi
    Shi, Peizhong
    [J]. ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1245 - 1250
  • [4] An information filtering model on the Web and its application in JobAgent
    Li, Y
    Zhang, C
    Swan, JR
    [J]. KNOWLEDGE-BASED SYSTEMS, 2000, 13 (05) : 285 - 296
  • [5] Research on the Application of Web Information Extraction Based On Semi Structured XML
    Yang, Guo-Jun
    [J]. 2016 INTERNATIONAL CONFERENCE ON SERVICE SCIENCE, TECHNOLOGY AND ENGINEERING (SSTE 2016), 2016, : 317 - 323
  • [6] A method for web information extraction
    Lam, Man I.
    Gong, Zhiguo
    Muyeba, Maybin
    [J]. PROGRESS IN WWW RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 4976 : 383 - +
  • [7] Web Services for information extraction from the Web
    Habegger, B
    Quafafou, M
    [J]. IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2004, : 279 - 286
  • [8] Information extraction for the semantic web
    Baumgartner, R
    Eiter, T
    Gottlob, G
    Herzog, M
    Koch, C
    [J]. REASONING WEB, 2005, 3564 : 275 - 289
  • [9] Rough association mining and its application in web information gathering
    Li, YF
    Zhong, N
    [J]. AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 1005 - 1008
  • [10] Mining key information of web pages: A method and its application
    Wang, Chao
    Lu, Jie
    Zhang, Guangquan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (02) : 425 - 433