Algorithm for the Accelerated Calculation of Conceptual Distances in Large Knowledge Graphs

被引:2
|
作者
Quintero, Rolando [1 ]
Mendiola, Esteban [1 ]
Guzman, Giovanni [1 ]
Torres-Ruiz, Miguel [1 ]
Sanchez-Mejorada, Carlos Guzman
机构
[1] Ctr Invest Comp C, Ctr Invest Comp CIC, Unidad Profes Adolfo Lopez Mateos UPALM Zacatenco, Mexico City 07320, Mexico
关键词
conceptual distance; shortest path algorithms; accelerated calculation; computational complexity; PAIRS SHORTEST PATHS; COLONY OPTIMIZATION ALGORITHM; RUNNING TIME ANALYSIS; SEMANTIC SIMILARITY; INFORMATION-CONTENT;
D O I
10.3390/math11234806
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Conceptual distance refers to the degree of proximity between two concepts within a conceptualization. It is closely related to semantic similarity and relationships, but its measurement strongly depends on the context of the given concepts. DIS-C represents an advancement in the computation of semantic similarity/relationships that is independent of the type of knowledge structure and semantic relations when generating a graph from a knowledge base (ontologies, semantic networks, and hierarchies, among others). This approach determines the semantic similarity between two indirectly connected concepts in an ontology by propagating local distances by applying an algorithm based on the All Pairs Shortest Path (APSP) problem. This process is implemented for each pair of concepts to establish the most effective and efficient paths to connect these concepts. The algorithm identifies the shortest path between concepts, which allows for an inference of the most relevant relationships between them. However, one of the critical issues with this process is computational complexity, combined with the design of APSP algorithms, such as Dijkstra, which is O(n(3)). This paper studies different alternatives to improve the DIS-C approach by adapting approximation algorithms, focusing on Dijkstra, pruned Dijkstra, and sketch-based methods, to compute the conceptual distance according to the need to scale DIS-C to analyze very large graphs; therefore, reducing the related computational complexity is critical. Tests were performed using different datasets to calculate the conceptual distance when using the original version of DIS-C and when using the influence area of nodes. In situations where time optimization is necessary for generating results, using the original DIS-C model is not the optimal method. Therefore, we propose a simplified version of DIS-C to calculate conceptual distances based on centrality estimation. The obtained results for the simple version of DIS-C indicated that the processing time decreased 2.381 times when compared to the original DIS-C version. Additionally, for both versions of DIS-C (normal and simple), the APSP algorithm decreased the computational cost when using a two-hop coverage-based approach.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] The Visualization of Large Graphs Accelerated by the Parallel Nearest Neighbors Algorithm
    Uher, Vojtech
    Gajdos, Petr
    Snasel, Vaclav
    2016 IEEE SECOND INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2016, : 9 - 16
  • [2] Estimating Pairwise Distances in Large Graphs
    Christoforaki, Maria
    Suel, Torsten
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 335 - 344
  • [3] Conceptual graphs for corporate knowledge repositories
    Gerbe, O
    CONCEPTUAL STRUCTURES: FULFILLING PEIRCE'S DREAM, 1997, 1257 : 474 - 488
  • [4] CONCEPTUAL GRAPHS FOR SEMANTICS AND KNOWLEDGE PROCESSING
    FARGUES, J
    LANDAU, MC
    DUGOURD, A
    CATACH, L
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1986, 30 (01) : 70 - 79
  • [5] REPRESENTING TEMPORAL KNOWLEDGE IN CONCEPTUAL GRAPHS
    MOULIN, B
    COTE, D
    KNOWLEDGE-BASED SYSTEMS, 1991, 4 (04) : 197 - 208
  • [6] CONCEPTUAL GRAPHS AS A UNIVERSAL KNOWLEDGE REPRESENTATION
    SOWA, JF
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1992, 23 (2-5) : 75 - 93
  • [7] Extended Knowledge Graphs: A Conceptual Study
    Adrian, Weronika T.
    Adrian, Marek
    Kluza, Krzysztof
    Stachura-Terlecka, Bernadetta
    Ligeza, Antoni
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KEOD), VOL 2, 2020, : 173 - 180
  • [8] Composition norm dynamics calculation with conceptual graphs
    de Moor, A
    CONCEPTUAL STRUCTURES: LOGICAL, LINGUISTIC, AND COMPUTATIONAL ISSUES, PROCEEDINGS, 2000, 1867 : 525 - 539
  • [9] Organizing conceptual graphs for fast knowledge retrieval
    Ounis, I
    TENTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 120 - 129
  • [10] ALGORITHM TO COMPUTE DISTANCES IN FINITE, DIRECTED AND NONVALUED GRAPHS
    BAUERSFELD, G
    ESSMANN, C
    LOHLE, H
    COMPUTING, 1972, 10 (1-2) : 107 - 109