Extracting patterns and relations from the World Wide Web

被引:0
|
作者
Brin, S [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
来源
WORLD WIDE WEB AND DATABASES | 1999年 / 1590卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The World Wide Web is a vast resource for information. At the same time it is extremely distributed. A particular type of data such as restaurant lists may be scattered across thousands of independent information sources in many different formats. In this paper, we consider the problem of extracting a relation for such a data type from all of these sources automatically. We present a technique which exploits the duality between sets of patterns and relations to grow the target relation starting from a small sample. To test our technique we use it to extract a relation of (author;title) pairs from the World Wide Web.
引用
收藏
页码:172 / 183
页数:12
相关论文
共 50 条
  • [1] Extracting knowledge from the World Wide Web
    Henzinger, M
    Lawrence, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 : 5186 - 5191
  • [2] Extracting inter-firm networks from world wide web
    Jin, Yingzi
    Matsuo, Yutaka
    Ishizuka, Mitsuru
    [J]. 9TH IEEE INTERNATIONAL CONFERENCE ON E-COMMERCE TECHNOLOGY/4TH IEEE INTERNATIONAL CONFERENCE ON ENTERPRISE COMPUTING, E-COMMERCE AND E-SERVICES, 2007, : 635 - +
  • [3] Self-Adaptive Extracting Academic Entities from World Wide Web
    Yuan, Pingpeng
    Li, Yi
    Jin, Hai
    Liu, Ling
    [J]. 2015 IEEE CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC), 2015, : 270 - 277
  • [4] Extracting World Knowledge from the Web
    Yates, Alexander
    [J]. COMPUTER, 2009, 42 (06) : 94 - 97
  • [5] Mining linguistic browsing patterns in the world wide web
    T.-P. Hong
    K.-Y. Lin
    S.-L. Wang
    [J]. Soft Computing, 2002, 6 (5) : 329 - 336
  • [6] Discovering user access patterns on the World Wide Web
    Cheung, DW
    Kao, B
    Lee, J
    [J]. KNOWLEDGE-BASED SYSTEMS, 1998, 10 (07) : 463 - 470
  • [7] The World Wide Web and active learning in the international relations classroom
    Kuzma, LM
    [J]. PS-POLITICAL SCIENCE & POLITICS, 1998, 31 (03) : 578 - 584
  • [8] Utilizing the World Wide Web as an encyclopedia: Extracting term descriptions from semi-structured texts
    Fujii, A
    Ishikawa, T
    [J]. 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 488 - 495
  • [9] Using lexical patterns for extracting hyponyms from the web
    Ortega-Mendoza, Rosa M.
    Villasenor-Pineda, Luis
    Montes-Y-Gomez, Manuel
    [J]. MICAI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4827 : 904 - +
  • [10] Extracting Usage Patterns from Web Server Log
    Jeba, J. Monisha Privthy
    Bhuvaneswari, M. S.
    Muneeswaran, K.
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON GREEN HIGH PERFORMANCE COMPUTING (ICGHPC), 2016,