Self-emergence of knowledge trees: Extraction of the Wikipedia hierarchies

被引:44
|
作者
Muchnik, Lev
Itzhack, Royi
Solomon, Sorin
Louzoun, Yoram [1 ]
机构
[1] Bar Ilan Univ, Dept Phys, IL-52900 Ramat Gan, Israel
[2] Bar Ilan Univ, Dept Math, IL-52900 Ramat Gan, Israel
[3] Hebrew Univ Jerusalem, Racah Inst Phys, Jerusalem, Israel
[4] ISI, I-10133 Turin, Italy
来源
PHYSICAL REVIEW E | 2007年 / 76卷 / 01期
关键词
INTERNET; DYNAMICS;
D O I
10.1103/PhysRevE.76.016106
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
The rapid accumulation of knowledge and the recent emergence of new dynamic and practically unmoderated information repositories have rendered the classical concept of the hierarchal knowledge structure irrelevant and impossible to impose manually. This led to modern methods of data location, such as browsing or searching, which conceal the underlying information structure. We here propose methods designed to automatically construct a hierarchy from a network of related terms. We apply these methods to Wikipedia and compare the hierarchy obtained from the article network to the complementary acyclic category layer of the Wikipedia and show an excellent fit. We verify our methods in two networks with no a priori hierarchy (the E. Coli genetic regulatory network and the C. Elegans neural network) and a network of function libraries of modern computer operating systems that are intrinsically hierarchical and reproduce a known functional order.
引用
收藏
页数:12
相关论文
共 50 条