Empirical study of knowledge network based on complex network theory

被引:4
|
作者
Ding Lian-Hong [1 ]
Sun Bin [1 ]
Shi Peng [2 ]
机构
[1] Beijing Wuxi Univ, Sch Informat, Beijing 101149, Peoples R China
[2] Univ Sci & Technol Beijing, Natl Ctr Mat Serv Safety, Beijing 100083, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
microsoft concept graph; complex network; subnet extraction; approximate average path; INFLUENTIAL SPREADERS; EVOLUTION;
D O I
10.7498/aps.68.20190106
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Knowledge graph is a hot topic in artificial intelligence area and has been widely adopted in intelligent search and question-and-answer system. Knowledge graph can be regarded as a complex network system and analyzed by complex network theory, which studies the interaction or relationship between various factors and basic characteristics of complex system. Its characteristics and their physical meanings are very helpful in understanding the nature of the knowledge graph. Concept graph is a large-scaled knowledge graph published by Microsoft. In this paper, we construct a huge complex network according to Microsoft's concept graph. Its complex network characteristics, such as degree distribution, average shortest distance, clustering coefficient and degree correlation, are calculated and analyzed. The concept graph is not a connected network and its scale is very large; an approach is proposed to extract its largest connected subnet. The method has obvious advantages in both time complexity and space complexity. In this paper, we also present a method of calculating the approximate average shortest path of the largest connected subnet. The method estimates the maximum and minimum value of the shortest distance between nodes according to the distance between the central node and the network layer that the node belongs to and the distance between different layers. In order to calculate the clustering coefficient, different methods are introduced for nodes with different degree values and Map/Reduce idea is adopted to reduce the time cost. The experimental results show that the largest subnet of the concept graph is an ultra-small world network with the characteristics of scale-free. The average shortest path length decreases towards 4 with the network size increasing, which can be easily explained by the diamond-shaped network structure. The concept graph is a disassortative network where low degree nodes tend to connect to high degree nodes. The subConcepts account for 99.5% of nodes in the innermost k-core after k-shell decomposition. It shows that the subConcepts play an important role in the connectivity of network. The absence of subConcept affects the complexness of concept graph most, the concept next, and the instance least. The 82% instance nodes and 40% concept nodes of the concept graph each have a degree value of 1. It is believed that compared with the concept words, the instance words do not lead to the ambiguity in the understanding of natural language, caused by polysemy.
引用
收藏
页数:15
相关论文
共 34 条
  • [1] Aiello W., 2000, Proceedings of the Thirty Second Annual ACM Symposium on Theory of Computing, P171, DOI 10.1145/335305.335326
  • [2] Random evolution in massive graphs
    Aiello, W
    Chung, F
    Lu, LY
    [J]. 42ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2001, : 510 - 519
  • [3] Internet -: Diameter of the World-Wide Web
    Albert, R
    Jeong, H
    Barabási, AL
    [J]. NATURE, 1999, 401 (6749) : 130 - 131
  • [4] Error and attack tolerance of complex networks
    Albert, R
    Jeong, H
    Barabási, AL
    [J]. NATURE, 2000, 406 (6794) : 378 - 382
  • [5] A GDP-driven model for the binary and weighted structure of the International Trade Network
    Almog, Assaf
    Squartini, Tiziano
    Garlaschellii, Diego
    [J]. NEW JOURNAL OF PHYSICS, 2015, 17
  • [6] Features and evolution of international crude oil trade relationships: A trading-based network analysis
    An, Haizhong
    Zhong, Weiqiong
    Chen, Yurong
    Li, Huajiao
    Gao, Xiangyun
    [J]. ENERGY, 2014, 74 : 254 - 259
  • [7] Emergence of scaling in random networks
    Barabási, AL
    Albert, R
    [J]. SCIENCE, 1999, 286 (5439) : 509 - 512
  • [8] Graph structure in the Web
    Broder, A
    Kumar, R
    Maghoul, F
    Raghavan, P
    Rajagopalan, S
    Stata, R
    Tomkins, A
    Wiener, J
    [J]. COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6): : 309 - 320
  • [9] Control of a City Road Network: Distributed Exact Verification of Traffic Safety
    Colombo, Alessandro
    de Campos, Gabriel Rodrigues
    Della Rossa, Fabio
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (10) : 4933 - 4948
  • [10] Partial Mutual Information Analysis of Financial Networks
    Fiedor, P.
    [J]. ACTA PHYSICA POLONICA A, 2015, 127 (03) : 863 - 867