A novel parallel distance metric-based approach for diversified ranking on large graphs

被引:2
|
作者
Li, Jin [1 ,2 ,3 ]
Yang, Yun [1 ,2 ,3 ]
Wang, Xiaoling [4 ]
Zhao, Zhiming [5 ]
Li, Tong [2 ,3 ]
机构
[1] Yunnan Univ, Sch Software, Kunming, Yunnan, Peoples R China
[2] Key Lab Software Engn Yunnan Prov, Kunming, Yunnan, Peoples R China
[3] Kunming Key Lab Data Sci & Intelligent Comp, Kunming, Yunnan, Peoples R China
[4] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai, Peoples R China
[5] Univ Amsterdam, Amsterdam, Netherlands
基金
中国国家自然科学基金;
关键词
Graph algorithms; Diversified ranking; Distance metric; Parallel computing; MapReduce; TEMPORAL DATA; ENSEMBLE;
D O I
10.1016/j.future.2018.05.031
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Diversified ranking on graphs (DRG) is an important and challenging issue in researching graph data mining. Traditionally, this problem is modeled by a submodular optimization objective, and solved by applying a cardinality constrained monotone submodular maximization. However, the existing submodular objectives do not directly capture the dis-similarity over pairs of nodes, while most of algorithms cannot easily take full advantage of the power of a distributed cluster computing platform, such as Spark, to significantly promote the efficiency of algorithms. To overcome the deficiencies of existing approaches, in this paper, a generalized distance metric based on a subadditive set function over the symmetry difference of neighbors of pairs of nodes is introduced to capture the pairwise dis-similarity over pairs of nodes. In our approach, DRG is formulated as a Max-Sum k-dispersion problem with metrical edge weights, which is NP-hard, in association with the proposed distance metric, a centralized linear time 2-approximation algorithm GA is then developed to significantly solve the problem of DRG. Moreover, we develop a highly parallelizable algorithm for DRG, which can be easily implemented in MapReduce style parallel computation models using GA as a basic reducer. Finally, extensive experiments are conducted on real network datasets to verify the effectiveness and efficiency of our proposed approaches. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 50 条
  • [31] Basic statistics for distributional symbolic variables: a new metric-based approach
    Antonio Irpino
    Rosanna Verde
    [J]. Advances in Data Analysis and Classification, 2015, 9 : 143 - 175
  • [32] Sigmoid distance metric-based spline adaptive filters for nonlinear adaptive noise cancellation
    Li, Wenqi
    Zhou, Zongtan
    Li, Hongxin
    Xu, Ming
    Tang, Jingsheng
    [J]. Information Sciences, 2024, 677
  • [33] A Metric-Based Approach to Modelling a Virtual Machine for Smart Contract Execution
    Luca, Spataru Alexe
    Pungila, Ciprian
    [J]. 2020 22ND INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2020), 2020, : 302 - 309
  • [34] Distance metric-based forest cover change detection using MODIS time series
    Huang, Xiaoman
    Fried, Mark A.
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2014, 29 : 78 - 92
  • [35] Program Code Understandability and Authenticating Code Predicting Systems: A Metric-Based Approach
    Jha, Pooja
    Patnaik, K. Sridhar
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL, NETWORKS, COMPUTING, AND SYSTEMS (ICSNCS 2016), VOL 2, 2016, 396 : 95 - 103
  • [36] An Approach to Counteracting the Common Cyber-attacks According to the Metric-Based Model
    Geramiparvar, Mohammad Sirwan
    Modiri, Nasser
    [J]. International Journal of Computer Science and Network Security, 2016, 16 (01): : 81 - 85
  • [37] Novel Metric-Based PAPR Reduction Schemes for MC-CDMA Systems
    Huang, Wan-Jen
    Hu, Wei-Wen
    Li, Chih-Peng
    Chen, Jung-Chieh
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2015, 64 (09) : 3982 - 3989
  • [38] Software metric-based neural network classification models of a very large telecommunications system
    Khoshgoftaar, TM
    Allen, EB
    Hudepohl, JP
    Aud, SJ
    [J]. APPLICATIONS AND SCIENCE OF ARTIFICIAL NEURAL NETWORKS II, 1996, 2760 : 634 - 645
  • [39] A Novel Distance Metric Based on Differential Evolution
    Ertugrul, Omer Faruk
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9641 - 9651
  • [40] An Approach for Assessing Similarity Metrics Used in Metric-based Clone Detection Techniques
    Shawky, Doaa M.
    Ali, Ahmed F.
    [J]. PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 580 - 584