ALLRIGHT: Automatic ontology instantiation from tabular web documents

被引：0

作者：

Shchekotykhin, Kostyantyn ^{[1
]}

Jannach, Dietmar ^{[1
]}

Friedrich, Gerhard ^{[1
]}

Kozeruk, Olga ^{[1
]}

机构：

[1] Univ Klagenfurt, Univ Str 65, A-9020 Klagenfurt, Austria

来源：

SEMANTIC WEB, PROCEEDINGS | 2007年 / 4825卷

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The process of instantiating an ontology with high-quality and up-to-date instance information manually is both time consuming and prone to error. Automatic ontology instantiation from Web sources is one of the possible solutions to this problem and aims at the computer supported population of an ontology through the exploitation of (redundant) information available on the Web. In this paper we present ALLRIGHT, a comprehensive ontology instantiating system. In particular, the techniques implemented in ALLRIGHT are designed for application scenarios, in which the desired instance information is given in the form of tables and for which existing Information Extraction (IE) approaches based on statistical or natural language processing methods are not directly applicable. Within ALLRIGHT, we have therefore developed new techniques for dealing with tabular instance data and combined these techniques with existing methods. The system supports all necessary steps for ontology instantiation, i.e. web crawling, name extraction, document clustering as well as fact extraction and validation. ALLRIGHT has been successfully evaluated in the popular domains of digital cameras and notebooks leading to a about eighty percent accuracy of the extracted facts given only a very limited amount of seed knowledge.

引用

页码：466 / +

页数：3

共 50 条

[1] Automated ontology instantiation from tabular web sources-The ALLRIGHT system
Jannach, Dietmar
Shchekotykhin, Kostyantyn
Friedrich, Gerhard
[J]. JOURNAL OF WEB SEMANTICS, 2009, 7 (03): : 136 - 153
[2] Automatic ontology generation from Web tabular structures
Pivk, A
[J]. AI COMMUNICATIONS, 2006, 19 (01) : 83 - 85
[3] Automatic ontology-based knowledge extraction from web documents
Alani, H
Kim, S
Millard, DE
Weal, MJ
Hall, W
Lewis, PH
Shadbolt, NR
[J]. IEEE INTELLIGENT SYSTEMS, 2003, 18 (01) : 14 - 21
[4] An approach of information extraction from web documents for automatic ontology generation
Yeom, KW
Park, JH
[J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 450 - 457
[5] AUTOMATIC COMPILING OF TABULAR DOCUMENTS
MAMIKONOVA, OA
[J]. AUTOMATION AND REMOTE CONTROL, 1982, 43 (03) : 376 - 380
[6] Ontology-based automatic classification of web documents
Song, MuHee
Lim, SooYeon
Kang, DongJin
Lee, SangJo
[J]. COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 690 - 700
[7] Ontology-based automatic classification and ranking for web documents
Fang, Jun
Guo, Lei
Wang, XiaoDong
Yang, Ning
[J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 627 - 631
[8] An automatic approach to classify web documents using a domain ontology
Song, MH
Lim, SY
Park, SB
Kang, DJ
Lee, SJ
[J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 666 - 671
[9] Relation instantiation for ontology population using the Web
de Boer, Viktor
van Someren, Maarten
Wielinga, Bob J.
[J]. KI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4314 : 202 - +
[10] A Domain Ontology Learning from Web Documents
Djaanfar, Ahmed Said
Frikh, Bouchra
Ouhbi, Brahim
[J]. INTELLIGENT DISTRIBUTED COMPUTING IV, 2010, 315 : 201 - +

← 1 2 3 4 5 →