Clustering and Associating Method of Dual Heterogeneous Communities in Location Based Social Networks

被引:0
|
作者
Gong W.-H. [1 ]
Shen S. [1 ]
Pei X.-B. [2 ]
Yang X.-H. [1 ]
机构
[1] School of Computer Science and Technology, Zhejiang University of Technology, Hangzhou
[2] School of Software, Huazhong University of Science and Technology, Wuhan
来源
| 1909年 / Science Press卷 / 43期
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Heterogeneous community discovery; Hypergraph clustering; Location-based social networks; Multi-dimensional relations;
D O I
10.11897/SP.J.1016.2020.01909
中图分类号
学科分类号
摘要
In recent years, community discovery in heterogeneous information networks, especially in location-based social networks(LBSN), has become an emerging research hotspot attracting more and more attentions. However, since presently most of the traditional community discovery studies in social networks only focus on homogeneous network structures, there exists one obvious drawback in these studies that they cannot effectively integrate the multi-mode entities and their multi-dimensional heterogeneity relations included in LBSN. Therefore, in order to overcome this challenging problem, this paper proposes a novel dual heterogeneous communities clustering and associating method, which is called CCAM to fully fuse with multi-mode entities and their multi-relations. The main idea of CCAM is below: firstly, in the upper social media layer of LBSN, this method measures the similarity of publishing document topics between users by means of information entropy, and further transforms the similar interests clustering problem into solving the objective function based on fuzzy clustering to identify the overlapping topic-based communities of users. Then, in the bottom geographical location layer, the bipartite graph between users and locations with check-ins relationships is converted to the hyper graph model, and the proposed hyper edge clustering approach based on graph partition is exploited to obtain the point of interest clusters about geographical features of users. Finally, the representation model for associating the upper topic-based communities and the bottom geographical location clusters is established via users' social relations in the middle user layer of LBSN, and the local optimal solution of association function for the dual heterogeneous clusters is gained by using stochastic gradient descent method. The experimental results in two real LBSN datasets such as Foursquare(NYC) and Yelp show that our proposed CCAM method can effectively fuse three types of relations in LBSN, for example, social relations of users, social media publishing relations, and users check-ins relations. Consequently, this method can accurately obtain the closely correlated dual heterogeneous clusters such as the user's interest clusters and geographical location clusters, which not only makes the external structural characteristic and the internal interest cohesive index better than some traditional community clustering algorithms, but also outperforms these algorithms at least 32% through measuring the mean average precision in the both field of online interest topics recommendation and offline point of interest recommendation. © 2020, Science Press. All right reserved.
引用
收藏
页码:1909 / 1923
页数:14
相关论文
共 28 条
  • [1] Muhammad A J, Muhammad S Y, Siddique L, Et al., Community detection in networks: A multidisciplinary review, Journal of Network and Computer Applications, 108, pp. 87-111, (2018)
  • [2] Palla G, Derenyi I, Farkas I, Et al., Uncovering the overlapping community structures of complex networks in nature and society, Nature, 435, 7043, pp. 814-818, (2005)
  • [3] Raghavan U N, Albert R, Kumara S., Near linear time algorithm to detect community structures in large-scale networks, Physical Review E, 76, 3, (2007)
  • [4] Jierui Xie, Szymanski Boleslaw K, Xiaoming Liu, SLPA: Uncovering overlapping communities in social networks via a speaker-listener interaction dynamic process, Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops(ICDMW'11), pp. 344-349, (2011)
  • [5] Liu Shi-Chao, Zhu Fu-Xi, Gan Lin, A label-propagation-probability-based algorithm for overlapping community detection, Chinese Journal of Computers, 39, 4, pp. 717-729, (2016)
  • [6] Lu Meilian, Zhang Zhenglin, Qu Zhihe, Et al., LPANNI: Overlapping community detection using label propagation in large-scale complex networks, IEEE Transactions on Knowledge and Data Engineering, 31, 9, pp. 1736-1749, (2019)
  • [7] Shen Huawei, Cheng Xueqi, Cai Kai, Et al., Detect overlapping and hierarchical community structure in networks, Physica A: Statistical Mechanics and its Applications, 388, 8, pp. 1706-1712, (2009)
  • [8] Lee C, Reid F, McDaid A, Et al., Detecting highly overlapping community structure by greedy clique expansion, Proceedings of the 4th SNA-KDD Workshop(SNA-KDD'10), pp. 33-42, (2010)
  • [9] Zhang Zehua, Miao Duoqian, Qian Jin, Detecting overlapping communities with heuristic expansion method based on rough neighborhood, Chinese Journal of Computers, 36, 10, pp. 2078-2086, (2013)
  • [10] Ahn Y Y, Bagrow J, Lehmann S., Link communities reveal multiscale complexity in networks, Nature, 466, 7307, pp. 761-764, (2010)