Ontology-based information extraction: An introduction and a survey of current approaches

被引:163
|
作者
Wimalasuriya, Daya C. [1 ]
Dou, Dejing [1 ]
机构
[1] Univ Oregon, Dept Comp & Informat Sci, Eugene, OR 97403 USA
关键词
information extraction; ontologies; Semantic Web;
D O I
10.1177/0165551509360123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information extraction (IE) aims to retrieve certain types of information from natural language text by processing them automatically. For example, an IE system might retrieve information about geopolitical indicators of countries from a set of web pages while ignoring other types of information. Ontology-based information extraction (OBIE) has recently emerged as a subfield of information extraction. Here, ontologies - which provide formal and explicit specifications of conceptualizations - play a crucial role in the IE process. Because of the use of ontologies, this field is related to knowledge representation and has the potential to assist the development of the Semantic Web. In this paper, we provide an introduction to ontology-based information extraction and review the details of different OBIE systems developed so far. We attempt to identify a common architecture among these systems and classify them based on different factors, which leads to a better understanding on their operation. We also discuss the implementation details of these systems including the tools used by them and the metrics used to measure their performance. In addition, we attempt to identify the possible future directions for this field.
引用
收藏
页码:306 / 323
页数:18
相关论文
共 50 条
  • [21] Ontology-based information extraction from the World Wide Web
    Korst, Jan
    Geleijnse, Gijs
    de Jong, Nick
    Verschoor, Michael
    [J]. INTELLIGENT ALGORITHMS IN AMBIENT AND BIOMEDICAL COMPUTING, 2006, 7 : 149 - +
  • [22] Ontology-based approach to enhance medical web information extraction
    Otmani, Nassim Abdeldjallal
    Si-Mohammed, Malik
    Comparot, Catherine
    Charrel, Pierre-Jean
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2019, 15 (03) : 359 - 382
  • [23] Ontology-based interactive information extraction from scientific abstracts
    Milward, D
    Bjäreland, M
    Hayes, W
    Maxwell, M
    Öberg, L
    Tilford, N
    Thomas, J
    Hale, R
    Knight, S
    Barnes, JE
    [J]. COMPARATIVE AND FUNCTIONAL GENOMICS, 2005, 6 (1-2): : 67 - 71
  • [24] ONTOLOGY-BASED INFORMATION EXTRACTION FROM PDF DOCUMENTS WITH XONTO
    Oro, Ermelinda
    Ruffolo, Massimo
    Sacca, Domenico
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2009, 18 (05) : 673 - 695
  • [25] Hybrid Ontology-based Information Extraction for Automated Text Grading
    Gutierrez, Fernando
    Dou, Dejing
    Martini, Adam
    Fickas, Stephen
    Zong, Hui
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 359 - 364
  • [26] Research on a Combined Ontology-based Text Information Extraction Technology
    Gong Yiguang
    Mei Ping
    [J]. 2011 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND MULTIMEDIA COMMUNICATION, 2011, : 115 - 118
  • [27] Ontology-Based Traffic Accident Information Extraction on Twitter In Indonesia
    Rakhmawati, Nur Aini
    Awwab, Yasin
    Najib, Ahmad Choirun
    Irsyad, Ahmad
    [J]. INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2022, 25 (70): : 1 - 12
  • [28] Towards Knowledge Handling in Ontology-Based Information Extraction Systems
    Konys, Agnieszka
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 2208 - 2218
  • [29] WebOMSIE: An Ontology-Based Multi Source Web Information Extraction
    Younsi, Zineb
    Quafafou, Mohamed
    Ouzegane, Redouane
    Tari, Abdelkamel
    [J]. NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, 2013, 185 : 199 - +
  • [30] An ontology-based information extraction and summarization of multiple news articles
    Venkatachalam S.
    Subbiah L.P.
    Rajendiran R.
    Venkatachalam N.
    [J]. International Journal of Information Technology, 2020, 12 (2) : 547 - 557