Harvesting and organizing knowledge from the web

被引:0
|
作者
Weikum, Gerhard [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Information organization and search on the Web is gaining structure and context awareness and more semantic flavor, for example, in the forms of faceted search, vertical search, entity search, and Deep-Web search. I envision another big leap forward by automatically harvesting and organizing knowledge from the Web, represented in terms of explicit entities and relations as well as ontological concepts. This will be made possible by the confluence of three strong trends: 1) rich Semantic-Web-style knowledge repositories like ontologies and taxonomies, 2) large-scale information extraction from high-quality text sources such as Wikipedia, and 3) social tagging in the spirit of Web 2.0. I refer to the three directions as Semantic Web, Statistical Web, and Social Web (at the risk of some oversimplification), and 1 briefly characterize each of them.
引用
收藏
页码:12 / 13
页数:2
相关论文
共 50 条
  • [1] WebChild: Harvesting and Organizing Commonsense Knowledge from the Web
    Tandon, Niket
    de Melo, Gerard
    Suchanek, Fabian
    Weikum, Gerhard
    [J]. WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 523 - 532
  • [2] Knowledge Harvesting from Text and Web Sources
    Suchanek, Fabian
    Weikum, Gerhard
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 1250 - 1253
  • [3] Organizing knowledge in a Semantic Web for pathology
    Tolksdorf, R
    Bontas, EP
    [J]. OBJECT-ORIENTED AND INTERNET-BASED TECHNOLOGIES, PROCEEDINGS, 2004, 3263 : 39 - 54
  • [4] From Information to Knowledge: Harvesting Entities and Relationships from Web Sources
    Weikum, Gerhard
    Theobald, Martin
    [J]. PODS 2010: PROCEEDINGS OF THE TWENTY-NINTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2010, : 65 - 76
  • [5] Ceres: Harvesting Knowledge from the Semi-structured Web
    Dong, Xin Luna
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1 - 1
  • [6] Knowledge Beacons: Web services for data harvesting of distributed biomedical knowledge
    Hannestad, Lance M.
    Dancik, Vlado
    Godden, Meera
    Suen, Imelda W.
    Huellas-Bruskiewicz, Kenneth C.
    Good, Benjamin M.
    Mungall, Christopher J.
    Bruskiewich, Richard M.
    [J]. PLOS ONE, 2021, 16 (03):
  • [7] Organizing information from the shelf to the web
    Tran, Lan Anh
    [J]. LIBRARY COLLECTIONS ACQUISITIONS & TECHNICAL SERVICES, 2007, 31 (3-4): : 233 - 234
  • [8] Organizing information from the shelf to the web
    Rodriguez, Sandy
    [J]. LIBRARY RESOURCES & TECHNICAL SERVICES, 2008, 52 (03): : 213 - 215
  • [9] Organizing information: from the shelf to the web
    Smith, Nicola
    [J]. PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2008, 42 (02) : 187 - 188
  • [10] Harvesting knowledge from improvement
    Berwick, DM
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1996, 275 (11): : 877 - 878