On extracting link information of relationship instances from a web site

被引:0
|
作者
Naing, MM [1 ]
Lim, EP [1 ]
Goh, DHL [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Ctr Adv Informat Syst, Singapore 639798, Singapore
来源
WEB SERVICES -ICWS-EUROPE 2003, PROCEEDINGS | 2003年 / 2853卷
关键词
ontology; information extraction; hyperlink structure;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web pages from a web site can often be associated with concepts in an ontology, and pairs of web pages can also be associated with relationships between concepts. With such associations, web pages can be searched, browsed or even reorganized based on their concept and relationship labels. In this paper, we investigate the problem of extracting link information of relationship instances from a web site. We define the notion of link chain and formulate the link chain extraction problem. An extraction method based on sequential covering has been proposed to solve the problem. This paper presents the proposed method and the experiments to evaluate its performance. We have applied the method to extract link chain information from the Yahoo! Movie Web Site with very promising results.
引用
收藏
页码:213 / 226
页数:14
相关论文
共 50 条
  • [41] Extracting Hidden Information Based on Comparing Web with UGC
    Uchimura, Keisuke
    Nadamoto, Akiyo
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2010 WORKSHOPS, 2011, 6724 : 365 - 377
  • [42] Extracting Information Seeking Intentions for Web Search Sessions
    Mitsui, Matthew
    Shah, Chirag
    Belkin, Nicholas J.
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 841 - 844
  • [43] A web page segmentation algorithm for extracting product information
    Wu, Changjun
    Zeng, Guosun
    Xu, Guorong
    2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 1374 - 1379
  • [44] The impact of web site structure on link analysis
    Mandl, Thomas
    INTERNET RESEARCH, 2007, 17 (02) : 196 - 206
  • [46] New fish diseases web site link
    不详
    AUSTRALIAN VETERINARY JOURNAL, 2002, 80 (04) : 182 - 182
  • [47] A hidden Markov model-based approach for extracting information from web news
    Tso, Brandt
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2007, 3 (1-2) : 104 - 115
  • [48] A Rule Based DFA Driven Information Extractor for Content Extracting from Web Pages
    Liu, Jin
    Chu, Danliang
    Song, Junjie
    Zhong, Bei
    Cai, Biqi
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 482 - 488
  • [49] An open platform for collecting domain specific web pages and extracting information from them
    Karkaletsis, V
    Spyropoulos, CD
    Knowledge Mining, 2005, 185 : 147 - 157
  • [50] Extracting riches from the Web: Web mining/personalization
    Drogan, M
    Hsu, J
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVI, PROCEEDINGS: SYSTEMICS AND INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATION, 2003, : 214 - 219