ALLRIGHT: Automatic ontology instantiation from tabular web documents

被引:0
|
作者
Shchekotykhin, Kostyantyn [1 ]
Jannach, Dietmar [1 ]
Friedrich, Gerhard [1 ]
Kozeruk, Olga [1 ]
机构
[1] Univ Klagenfurt, Univ Str 65, A-9020 Klagenfurt, Austria
来源
SEMANTIC WEB, PROCEEDINGS | 2007年 / 4825卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The process of instantiating an ontology with high-quality and up-to-date instance information manually is both time consuming and prone to error. Automatic ontology instantiation from Web sources is one of the possible solutions to this problem and aims at the computer supported population of an ontology through the exploitation of (redundant) information available on the Web. In this paper we present ALLRIGHT, a comprehensive ontology instantiating system. In particular, the techniques implemented in ALLRIGHT are designed for application scenarios, in which the desired instance information is given in the form of tables and for which existing Information Extraction (IE) approaches based on statistical or natural language processing methods are not directly applicable. Within ALLRIGHT, we have therefore developed new techniques for dealing with tabular instance data and combined these techniques with existing methods. The system supports all necessary steps for ontology instantiation, i.e. web crawling, name extraction, document clustering as well as fact extraction and validation. ALLRIGHT has been successfully evaluated in the popular domains of digital cameras and notebooks leading to a about eighty percent accuracy of the extracted facts given only a very limited amount of seed knowledge.
引用
收藏
页码:466 / +
页数:3
相关论文
共 50 条
  • [41] An Automatic Ontology Population with a Machine Learning Technique from Semi-Structured Documents
    Song, Hyun-Je
    Park, Seong-Bae
    Park, Se-Young
    [J]. ICIA: 2009 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-3, 2009, : 519 - 524
  • [42] Automatic Building of an Ontology from a Corpus of Text Documents Using Data Mining Tools
    Toledo-Alvarado, J. I.
    Guzman-Arenas, A.
    Martinez-Luna, G. L.
    [J]. JOURNAL OF APPLIED RESEARCH AND TECHNOLOGY, 2012, 10 (03) : 398 - 404
  • [43] Automatic Stuff Relation Extraction from Scientific Documents for Natural Product Ontology Construction
    Lertsakunsomboon, Suriyasak
    Pechsiri, Chaveevan
    [J]. 2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 273 - 277
  • [44] FROM FIAT TO WEB. WHAT A SOCIAL ONTOLOGY BASED ON DOCUMENTS PERMITS TO EXPLAIN
    Casetta, Elena
    Torrengo, Giuliano
    [J]. RIVISTA DI ESTETICA, 2015, (60) : 54 - 62
  • [45] Learning non-taxonomic relationships from web documents for domain ontology construction
    Sanchez, David
    Moreno, Antonio
    [J]. DATA & KNOWLEDGE ENGINEERING, 2008, 64 (03) : 600 - 623
  • [46] Structure recognition and information extraction from tabular documents
    Chandran, S
    Balasubramanian, S
    Gandhi, T
    Prasad, A
    Kasturi, R
    Chhabra, A
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 1996, 7 (04) : 289 - 303
  • [47] Topic selection of web documents using specific domain ontology
    Kong, Hyunjang
    Hwang, Myunggwon
    Hwang, Gwangsu
    Shim, Jaehong
    Kim, Pankoo
    [J]. MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 1047 - +
  • [48] Rank web documents based on multi-domain ontology
    Liu J.
    Zhou M.
    Lin L.
    Kim H.-J.
    Wang J.
    [J]. J. Ambient Intell. Humanized Comput., 2 (1573-1582): : 1573 - 1582
  • [49] Ontology based semantic annotation of Urdu language web documents
    Rajput, Quratulain
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 : 662 - 670
  • [50] Tailoring dynamic ontology-driven web documents by demonstration
    Macías, JA
    Castells, P
    [J]. SIXTH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION, PROCEEDINGS, 2002, : 535 - 540