Ontology-Driven Semantic Analysis of Tabular Data: An Iterative Approach with Advanced Entity Recognition

被引:0
|
作者
Mansurova, Madina [1 ]
Barakhnin, Vladimir [1 ]
Ospan, Assel [1 ]
Titkov, Roman [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Fac Informat Technol, Dept Artificial Intelligence & Big Data, Alma Ata 050040, Kazakhstan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 19期
关键词
semantic analysis; OWL ontology; table interpretation; knowledge triplets; entity classification; Levenshtein distance; TABLES;
D O I
10.3390/app131910918
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This study focuses on the extraction and semantic analysis of data from tables, emphasizing the importance of understanding the semantics of tables to obtain useful information. The main goal was to develop a technology using the ontology for the semantic analysis of tables. An iterative algorithm has been proposed that can parse the contents of a table and determine cell types based on the ontology. The study presents an automated method for extracting data in various languages in various fields, subject to the availability of an appropriate ontology. Advanced techniques such as cosine distance search and table subject classification based on a neural network have been integrated to increase efficiency. The result is a software application capable of semantically classifying tabular data, facilitating the rapid transition of information from tables to ontologies. Rigorous testing, including 30 tables in the field of water resources and socio-economic indicators of Kazakhstan, confirmed the reliability of the algorithm. The results demonstrate high accuracy with a notable triple extraction recall of 99.4%. The use of Levenshtein distance for matching entities and ontology as a source of information was key to achieving these metrics. The study offers a promising tool for efficiently extracting data from tables.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] An Ontology-Driven Approach Applied to Information Security
    Vorobiev, Artem
    Bekmamedova, Nargiza
    JOURNAL OF RESEARCH AND PRACTICE IN INFORMATION TECHNOLOGY, 2010, 42 (01): : 61 - 76
  • [42] Research on ontology-driven product data management
    Hu, YJ
    Li, SP
    Yin, QW
    DCABES 2002, PROCEEDING, 2002, : 388 - 392
  • [43] Ontology-Driven Semantic Search for Brazilian Portuguese Clinical Notes
    Hasan, Sadid A.
    Zhu, Xianshu
    Liu, Joey
    Barra, Claudia M.
    Oliveira, Lucas
    Farri, Oladimeji
    MEDINFO 2015: EHEALTH-ENABLED HEALTH, 2015, 216 : 1022 - 1022
  • [44] On ontology-driven document clustering using core semantic features
    Fodeh, Samah
    Punch, Bill
    Tan, Pang-Ning
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (02) : 395 - 421
  • [45] Ontology-Driven Knowledge Graph Construction in the Mathematics Semantic Library
    Ataeva, O. M.
    Serebryakov, V. A.
    Tuchkova, N. P.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2024, 34 (03) : 448 - 455
  • [46] Ontology-driven advanced drug-drug interaction
    Naz, Tabbasum
    Akhtar, Muhammad
    Shahzad, Syed Khuram
    Fasli, Maria
    Iqbal, Muhammad Waseem
    Naqvi, Muhammad Raza
    COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86 (86)
  • [47] Domain-specific requirements analysis framework: ontology-driven approach
    Banerjee S.
    Sarkar A.
    International Journal of Computers and Applications, 2019, 44 (01) : 23 - 47
  • [48] CDAO-Store: Ontology-driven Data Integration for Phylogenetic Analysis
    Brandon Chisham
    Ben Wright
    Trung Le
    Tran Cao Son
    Enrico Pontelli
    BMC Bioinformatics, 12
  • [49] CDAO-Store: Ontology-driven Data Integration for Phylogenetic Analysis
    Chisham, Brandon
    Wright, Ben
    Le, Trung
    Tran Cao Son
    Pontelli, Enrico
    BMC BIOINFORMATICS, 2011, 12
  • [50] Checking the Semantic Correctness of Process Models An Ontology-driven Approach Using Domain Knowledge and Rules
    Fellmann, Michael
    Hogrebe, Frank
    Thomas, Oliver
    Nuttgens, Markus
    ENTERPRISE MODELLING AND INFORMATION SYSTEMS ARCHITECTURES-AN INTERNATIONAL JOURNAL, 2011, 6 (03): : 25 - 35