Ontology-Driven Semantic Analysis of Tabular Data: An Iterative Approach with Advanced Entity Recognition

被引:0
|
作者
Mansurova, Madina [1 ]
Barakhnin, Vladimir [1 ]
Ospan, Assel [1 ]
Titkov, Roman [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Fac Informat Technol, Dept Artificial Intelligence & Big Data, Alma Ata 050040, Kazakhstan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 19期
关键词
semantic analysis; OWL ontology; table interpretation; knowledge triplets; entity classification; Levenshtein distance; TABLES;
D O I
10.3390/app131910918
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This study focuses on the extraction and semantic analysis of data from tables, emphasizing the importance of understanding the semantics of tables to obtain useful information. The main goal was to develop a technology using the ontology for the semantic analysis of tables. An iterative algorithm has been proposed that can parse the contents of a table and determine cell types based on the ontology. The study presents an automated method for extracting data in various languages in various fields, subject to the availability of an appropriate ontology. Advanced techniques such as cosine distance search and table subject classification based on a neural network have been integrated to increase efficiency. The result is a software application capable of semantically classifying tabular data, facilitating the rapid transition of information from tables to ontologies. Rigorous testing, including 30 tables in the field of water resources and socio-economic indicators of Kazakhstan, confirmed the reliability of the algorithm. The results demonstrate high accuracy with a notable triple extraction recall of 99.4%. The use of Levenshtein distance for matching entities and ontology as a source of information was key to achieving these metrics. The study offers a promising tool for efficiently extracting data from tables.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Ontology-driven semantic mapping
    Beneventano, Domenico
    Dahlem, Nikolai
    El Haoum, Sabina
    Hahn, Axel
    Montanari, Daniele
    Reinelt, Matthias
    ENTERPRISE INTEROPERABILITY III: NEW CHALLENGES AND INDUSTRIAL APPROACHES, 2008, : 329 - +
  • [2] An Ontology-Driven Approach for Semantic Information Retrieval on the Web
    Rinaldi, Antonio M.
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2009, 9 (03)
  • [3] Ontology-driven description of spatial data for their semantic processing
    Torres, M
    Quintero, R
    Moreno, M
    Fonseca, F
    GEOSPATIAL SEMANTICS, PROCEEDINGS, 2005, 3799 : 242 - 249
  • [4] Advancing data reuse in phyloinformatics using an ontology-driven Semantic Web approach
    Panahiazar, Maryam
    Sheth, Amit P.
    Ranabahu, Ajith
    Vos, Rutger A.
    Leebens-Mack, Jim
    BMC MEDICAL GENOMICS, 2013, 6
  • [5] Advancing data reuse in phyloinformatics using an ontology-driven Semantic Web approach
    Maryam Panahiazar
    Amit P Sheth
    Ajith Ranabahu
    Rutger A Vos
    Jim Leebens-Mack
    BMC Medical Genomics, 6
  • [6] An ontology-driven Semantic Speech Recognition system for Security tasks
    Barroso, N.
    de Ipina, K. Lopez
    Ezeiza, A.
    Hernandez, C.
    2011 IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2011,
  • [7] An Ontology-Driven Approach for Semantic Annotation of Documents with Specific Concepts
    Alec, Celine
    Reynaud-Delaitre, Chantal
    Safar, Brigitte
    SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 : 609 - 624
  • [8] Ontology-Driven Semantic Comparison between Geographic Data Sets
    Cadena Martinez, Rodrigo
    Quintero Tellez, Rolando
    Moreno Ibarra, Marco Antonio
    Torres Ruiz, Miguel
    Guzman Lugo, Giovanni
    COMPUTACION Y SISTEMAS, 2013, 17 (04): : 569 - 581
  • [9] Ontology-Driven Semantic Digital Library
    Noah, Shahrul Azman
    Alias, Nor Afni Raziah
    Osman, Nurul Aida
    Abdullah, Zuraidah
    Omar, Nazlia
    Yahya, Yazrina
    Yusof, Maryati Mohd
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 141 - 150
  • [10] Ontology-driven management of semantic spaces
    Krummenacher, Reto
    SEMANTIC WEB, PROCEEDINGS, 2007, 4825 : 926 - 930