ALLRIGHT: Automatic ontology instantiation from tabular web documents

被引:0
|
作者
Shchekotykhin, Kostyantyn [1 ]
Jannach, Dietmar [1 ]
Friedrich, Gerhard [1 ]
Kozeruk, Olga [1 ]
机构
[1] Univ Klagenfurt, Univ Str 65, A-9020 Klagenfurt, Austria
来源
SEMANTIC WEB, PROCEEDINGS | 2007年 / 4825卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The process of instantiating an ontology with high-quality and up-to-date instance information manually is both time consuming and prone to error. Automatic ontology instantiation from Web sources is one of the possible solutions to this problem and aims at the computer supported population of an ontology through the exploitation of (redundant) information available on the Web. In this paper we present ALLRIGHT, a comprehensive ontology instantiating system. In particular, the techniques implemented in ALLRIGHT are designed for application scenarios, in which the desired instance information is given in the form of tables and for which existing Information Extraction (IE) approaches based on statistical or natural language processing methods are not directly applicable. Within ALLRIGHT, we have therefore developed new techniques for dealing with tabular instance data and combined these techniques with existing methods. The system supports all necessary steps for ontology instantiation, i.e. web crawling, name extraction, document clustering as well as fact extraction and validation. ALLRIGHT has been successfully evaluated in the popular domains of digital cameras and notebooks leading to a about eighty percent accuracy of the extracted facts given only a very limited amount of seed knowledge.
引用
收藏
页码:466 / +
页数:3
相关论文
共 50 条
  • [1] Automated ontology instantiation from tabular web sources-The ALLRIGHT system
    Jannach, Dietmar
    Shchekotykhin, Kostyantyn
    Friedrich, Gerhard
    [J]. JOURNAL OF WEB SEMANTICS, 2009, 7 (03): : 136 - 153
  • [2] Automatic ontology generation from Web tabular structures
    Pivk, A
    [J]. AI COMMUNICATIONS, 2006, 19 (01) : 83 - 85
  • [3] Automatic ontology-based knowledge extraction from web documents
    Alani, H
    Kim, S
    Millard, DE
    Weal, MJ
    Hall, W
    Lewis, PH
    Shadbolt, NR
    [J]. IEEE INTELLIGENT SYSTEMS, 2003, 18 (01) : 14 - 21
  • [4] An approach of information extraction from web documents for automatic ontology generation
    Yeom, KW
    Park, JH
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 450 - 457
  • [5] AUTOMATIC COMPILING OF TABULAR DOCUMENTS
    MAMIKONOVA, OA
    [J]. AUTOMATION AND REMOTE CONTROL, 1982, 43 (03) : 376 - 380
  • [6] Ontology-based automatic classification of web documents
    Song, MuHee
    Lim, SooYeon
    Kang, DongJin
    Lee, SangJo
    [J]. COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 690 - 700
  • [7] Ontology-based automatic classification and ranking for web documents
    Fang, Jun
    Guo, Lei
    Wang, XiaoDong
    Yang, Ning
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 627 - 631
  • [8] An automatic approach to classify web documents using a domain ontology
    Song, MH
    Lim, SY
    Park, SB
    Kang, DJ
    Lee, SJ
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 666 - 671
  • [9] Relation instantiation for ontology population using the Web
    de Boer, Viktor
    van Someren, Maarten
    Wielinga, Bob J.
    [J]. KI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4314 : 202 - +
  • [10] A Domain Ontology Learning from Web Documents
    Djaanfar, Ahmed Said
    Frikh, Bouchra
    Ouhbi, Brahim
    [J]. INTELLIGENT DISTRIBUTED COMPUTING IV, 2010, 315 : 201 - +