ALLRIGHT: Automatic ontology instantiation from tabular web documents

被引:0
|
作者
Shchekotykhin, Kostyantyn [1 ]
Jannach, Dietmar [1 ]
Friedrich, Gerhard [1 ]
Kozeruk, Olga [1 ]
机构
[1] Univ Klagenfurt, Univ Str 65, A-9020 Klagenfurt, Austria
来源
SEMANTIC WEB, PROCEEDINGS | 2007年 / 4825卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The process of instantiating an ontology with high-quality and up-to-date instance information manually is both time consuming and prone to error. Automatic ontology instantiation from Web sources is one of the possible solutions to this problem and aims at the computer supported population of an ontology through the exploitation of (redundant) information available on the Web. In this paper we present ALLRIGHT, a comprehensive ontology instantiating system. In particular, the techniques implemented in ALLRIGHT are designed for application scenarios, in which the desired instance information is given in the form of tables and for which existing Information Extraction (IE) approaches based on statistical or natural language processing methods are not directly applicable. Within ALLRIGHT, we have therefore developed new techniques for dealing with tabular instance data and combined these techniques with existing methods. The system supports all necessary steps for ontology instantiation, i.e. web crawling, name extraction, document clustering as well as fact extraction and validation. ALLRIGHT has been successfully evaluated in the popular domains of digital cameras and notebooks leading to a about eighty percent accuracy of the extracted facts given only a very limited amount of seed knowledge.
引用
收藏
页码:466 / +
页数:3
相关论文
共 50 条
  • [21] Ontology creation: Extraction of domain knowledge from web documents
    Storey, VC
    Chiang, R
    Chen, GL
    [J]. CONCEPTUAL MODELING - ER 2005, 2005, 3716 : 256 - 269
  • [22] AUTOMATIC DOMAIN ONTOLOGY GENERATION FROM WEB SITES
    Wong, Tak-Lam
    Lam, Wai
    Chen, Enhong
    [J]. JOURNAL OF INTEGRATED DESIGN & PROCESS SCIENCE, 2005, 9 (03) : 29 - 38
  • [23] Automatic discovery of attribute words from Web documents
    Tokunaga, K
    Kazama, J
    Torisawa, K
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 106 - 118
  • [24] Ontology-Based Automatic Annotation: An Approach for Efficient Retrieval of Semantic Results of Web Documents
    Tulasi, R. Lakshmi
    Rao, Meda Sreenivasa
    Ankita, K.
    Hgoudar, R.
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS, ICCII 2016, 2017, 507 : 331 - 339
  • [25] Mapping documents onto Web page ontology
    Mladenic, D
    Grobelnik, M
    [J]. WEB MINING: FROM WEB TO SEMANTIC WEB, 2004, 3209 : 77 - 96
  • [26] Instantiation of the multi-viewpoints ontology from a resource
    Djama O.
    Boufaida Z.
    [J]. International Journal of Computers and Applications, 2022, 44 (02) : 154 - 165
  • [27] OPPCAT: Ontology population from tabular data
    Ozturk, Ovunc
    [J]. JOURNAL OF INFORMATION SCIENCE, 2020, 46 (02) : 161 - 175
  • [28] Individualized Automatic Classification of Web Documents
    Tsai, Yihjia
    Chen, Kaun-Yu
    [J]. PROCEEDINGS OF 2010 CROSS-STRAIT CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY, 2010, : 410 - 412
  • [29] Automatic genre detection of Web documents
    Lim, CS
    Lee, KJ
    Kim, GC
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 310 - 319
  • [30] WEB2ONTO: Automatic Ontology Construction Approach from Web pages
    Elmesalmy, Naglaa
    Hadhoud, Mayada
    Fayeka, Magda
    [J]. 2019 15TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO 2019), 2019, : 175 - 182