On extracting link information of relationship instances from a web site

被引:0
|
作者
Naing, MM [1 ]
Lim, EP [1 ]
Goh, DHL [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Ctr Adv Informat Syst, Singapore 639798, Singapore
来源
WEB SERVICES -ICWS-EUROPE 2003, PROCEEDINGS | 2003年 / 2853卷
关键词
ontology; information extraction; hyperlink structure;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web pages from a web site can often be associated with concepts in an ontology, and pairs of web pages can also be associated with relationships between concepts. With such associations, web pages can be searched, browsed or even reorganized based on their concept and relationship labels. In this paper, we investigate the problem of extracting link information of relationship instances from a web site. We define the notion of link chain and formulate the link chain extraction problem. An extraction method based on sequential covering has been proposed to solve the problem. This paper presents the proposed method and the experiments to evaluate its performance. We have applied the method to extract link chain information from the Yahoo! Movie Web Site with very promising results.
引用
收藏
页码:213 / 226
页数:14
相关论文
共 50 条
  • [1] Extracting link chains of relationship instances from a Web site
    Naing, Myo-Myo
    Lim, Ee-Peng
    Chiang, Roger H. L.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (12): : 1590 - 1605
  • [2] Extracting instances of relations from Web documents using redundancy
    de Boer, Viktor
    van Someren, Maarten
    Wielinga, Bob J.
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2006, 4011 : 245 - 258
  • [3] Extracting Company Information from the Web
    Lam, Man I.
    Gong, Zhiguo
    Guo, Jingzhi
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3640 - 3645
  • [4] Extracting table information from the Web
    Kim, YS
    Lee, KH
    DOCUMENT ANALYSIS SYSTEMS VI, PROCEEDINGS, 2004, 3163 : 438 - 441
  • [5] Extracting semistructured information from Web
    Huang, Yu-Qing
    Qi, Guang-Zhi
    Zhang, Fu-Yan
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design & Computer Graphics, 2000, 12 (03): : 230 - 234
  • [6] Extracting macroscopic information from Web links
    Thelwall, M
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2001, 52 (13): : 1157 - 1168
  • [7] Jedi: Extracting and synthesizing information from the Web
    Huck, G
    Fankhauser, P
    Aberer, K
    Neuhold, E
    3RD IFCIS INTERNATIONAL CONFERENCE ON COOPERATIVE INFORMATION SYSTEMS - PROCEEDINGS, 1998, : 32 - 41
  • [8] Extracting Topic Maps from Web Pages by Web Link Structure and Content
    Mase, Motohiro
    Yamada, Seiji
    Nitta, Katsumi
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 1232 - +
  • [10] Tuning up FOIL for extracting information from the web
    Palacios, Pablo
    Fernandez de Viana, Inaki
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2008, 33 (04) : 280 - 284