A novel parallel distance metric-based approach for diversified ranking on large graphs

被引:2
|
作者
Li, Jin [1 ,2 ,3 ]
Yang, Yun [1 ,2 ,3 ]
Wang, Xiaoling [4 ]
Zhao, Zhiming [5 ]
Li, Tong [2 ,3 ]
机构
[1] Yunnan Univ, Sch Software, Kunming, Yunnan, Peoples R China
[2] Key Lab Software Engn Yunnan Prov, Kunming, Yunnan, Peoples R China
[3] Kunming Key Lab Data Sci & Intelligent Comp, Kunming, Yunnan, Peoples R China
[4] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai, Peoples R China
[5] Univ Amsterdam, Amsterdam, Netherlands
基金
中国国家自然科学基金;
关键词
Graph algorithms; Diversified ranking; Distance metric; Parallel computing; MapReduce; TEMPORAL DATA; ENSEMBLE;
D O I
10.1016/j.future.2018.05.031
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Diversified ranking on graphs (DRG) is an important and challenging issue in researching graph data mining. Traditionally, this problem is modeled by a submodular optimization objective, and solved by applying a cardinality constrained monotone submodular maximization. However, the existing submodular objectives do not directly capture the dis-similarity over pairs of nodes, while most of algorithms cannot easily take full advantage of the power of a distributed cluster computing platform, such as Spark, to significantly promote the efficiency of algorithms. To overcome the deficiencies of existing approaches, in this paper, a generalized distance metric based on a subadditive set function over the symmetry difference of neighbors of pairs of nodes is introduced to capture the pairwise dis-similarity over pairs of nodes. In our approach, DRG is formulated as a Max-Sum k-dispersion problem with metrical edge weights, which is NP-hard, in association with the proposed distance metric, a centralized linear time 2-approximation algorithm GA is then developed to significantly solve the problem of DRG. Moreover, we develop a highly parallelizable algorithm for DRG, which can be easily implemented in MapReduce style parallel computation models using GA as a basic reducer. Finally, extensive experiments are conducted on real network datasets to verify the effectiveness and efficiency of our proposed approaches. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 50 条
  • [21] A distance measure for large graphs based on prime graphs
    Lagraa, Sofiane
    Seba, Hamida
    Khennoufa, Riadh
    M'Baya, Abir
    Kheddouci, Hamamache
    [J]. PATTERN RECOGNITION, 2014, 47 (09) : 2993 - 3005
  • [22] Ensemble-based clustering of large probabilistic graphs using neighborhood and distance metric learning
    Malihe Danesh
    Morteza Dorrigiv
    Farzin Yaghmaee
    [J]. The Journal of Supercomputing, 2021, 77 : 4107 - 4134
  • [23] Ensemble-based clustering of large probabilistic graphs using neighborhood and distance metric learning
    Danesh, Malihe
    Dorrigiv, Morteza
    Yaghmaee, Farzin
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (04): : 4107 - 4134
  • [24] Metric-based approach to detect abstract data types and state encapsulations
    Girard J.-F.
    Koschke R.
    Schied G.
    [J]. Automated Software Engineering, 1999, 6 (4) : 357 - 386
  • [25] A Metric-Based Approach to Assess Risk for “On Cloud” Federated Identity Management
    Patricia Arias-Cabarcos
    Florina Almenárez-Mendoza
    Andrés Marín-López
    Daniel Díaz-Sánchez
    Rosa Sánchez-Guerrero
    [J]. Journal of Network and Systems Management, 2012, 20 : 513 - 533
  • [26] A Metric-Based Approach to Assess Risk for "On Cloud" Federated Identity Management
    Arias-Cabarcos, Patricia
    Almenarez-Mendoza, Florina
    Marin-Lopez, Andres
    Diaz-Sanchez, Daniel
    Sanchez-Guerrero, Rosa
    [J]. JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2012, 20 (04) : 513 - 533
  • [27] A metric-based approach to detect abstract data types and state encapsulations
    Girard, JF
    Koschke, R
    Scheid, G
    [J]. AUTOMATED SOFTWARE ENGINEERING, 12TH IEEE INTERNATIONAL CONFERENCE, PROCEEDINGS, 1997, : 82 - 89
  • [28] A Temporal Metric-Based Efficient Approach to Predict Citation Counts of Scientists
    Dewangan, Saumya Kumar
    Bhattacharjee, Shrutilipi
    Shetty, Ramya D.
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT I, 2023, 675 : 343 - 355
  • [29] Time Window Fréchet and Metric-Based Edit Distance for Passively Collected Trajectories
    Ding, Jiaxin
    Gao, Jie
    Skiena, Steven
    [J]. arXiv, 2022,
  • [30] A Metric-Based Approach for Anti-pattern Detection in UML Designs
    Fourati, Rahma
    Bouassida, Nadia
    Ben Abdallah, Hanene
    [J]. COMPUTER AND INFORMATION SCIENCE 2011, 2011, 364 : 17 - 33