On extracting link information of relationship instances from a web site

被引:0
|
作者
Naing, MM [1 ]
Lim, EP [1 ]
Goh, DHL [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Ctr Adv Informat Syst, Singapore 639798, Singapore
来源
WEB SERVICES -ICWS-EUROPE 2003, PROCEEDINGS | 2003年 / 2853卷
关键词
ontology; information extraction; hyperlink structure;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web pages from a web site can often be associated with concepts in an ontology, and pairs of web pages can also be associated with relationships between concepts. With such associations, web pages can be searched, browsed or even reorganized based on their concept and relationship labels. In this paper, we investigate the problem of extracting link information of relationship instances from a web site. We define the notion of link chain and formulate the link chain extraction problem. An extraction method based on sequential covering has been proposed to solve the problem. This paper presents the proposed method and the experiments to evaluate its performance. We have applied the method to extract link chain information from the Yahoo! Movie Web Site with very promising results.
引用
收藏
页码:213 / 226
页数:14
相关论文
共 50 条
  • [21] A strategy for extracting information from semi-structured web pages
    Shaker, Mahmoud
    Ibrahim, Hamidah
    Mustapha, Aida
    Abdullah, Lili Nurliyana
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2010, 6 (04) : 304 - 318
  • [22] Extracting information from WEB tables based on abstract semantic model
    Gu, N.
    Wu, G.W.
    Wu, X.Y.
    Shi, B.L.
    Ruan Jian Xue Bao/Journal of Software, 2001, 12 (SUPPL.): : 220 - 224
  • [23] The accidental corpus: some issues in extracting linguistic information from the Web
    Renouf, A
    Kehoe, A
    Mezquiriz, D
    ADVANCES IN CORPUS LINGUISTICS, 2004, (49): : 403 - 419
  • [24] A proactive web agent for information browsing and extracting
    Lu, HE
    ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 879 - 882
  • [25] Characterising web site link structure
    Zhou, Shi
    Cox, Ingemar
    Petricek, Vaclav
    WSE 2007: NINTH IEEE INTERNATIONAL SYMPOSIUM ON WEB SITE EVOLUTION, PROCEEDINGS, 2007, : 73 - +
  • [26] A Hybrid Method for Extracting Deep Web Information
    Zhang, Yuanpeng
    Wang, Li
    Jiang, Kui
    Qian, Danmin
    Dong, Jiancheng
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 777 - 782
  • [27] Extracting, presenting and browsing of web social information
    Wang, Y
    Zhou, LZ
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 828 - 833
  • [28] Improving the web text content by extracting significant pages into a Web Site
    Ríos, SA
    Velásquez, JD
    Vera, ES
    Yasuda, H
    Aoki, T
    5th International Conference on Intelligent Systems Design and Applications, Proceedings, 2005, : 32 - 36
  • [29] Extracting Event Temporal Information based on Web
    Yuan, Bo
    Chen, Qingcai
    Wang, Xiaolong
    Han, Liwei
    2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 1, 2009, : 346 - 350
  • [30] Geographic information on the Web: Extracting demographic and market research information
    Linberger, P
    White, GW
    19TH ANNUAL NATIONAL ONLINE MEETING, PROCEEDINGS-1998, 1998, : 235 - 242