A strong coreset algorithm to accelerate OPF as a graph-based machine learning in large-scale problems

被引:1
|
作者
Bostani, Hamid [1 ]
Sheikhan, Mansour [2 ]
Mahboobi, Behrad [3 ]
机构
[1] Islamic Azad Univ, South Tehran Branch, Young Researchers & Elite Club, Tehran, Iran
[2] Islamic Azad Univ, Dept Elect Engn, South Tehran Branch, Tehran, Iran
[3] Islamic Azad Univ, Commun Comp & Ind Network Res Ctr, Dept Elect & Comp Engn, Sci & Res Branch, Tehran, Iran
基金
美国国家科学基金会;
关键词
Coreset; Optimum-path forest; Large-scale problems; Massive datasets; OPTIMUM-PATH FOREST; INTRUSION DETECTION; CLASSIFICATION; HYBRID;
D O I
10.1016/j.ins.2020.10.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimum-path forest (OPF) is one of the efficient graph-based frameworks that can determine the patterns of input dataset by extracting the optimal partitions of graph obtained through encoding data into a graph. Since OPF was introduced based on simple assumptions without considering the requirements of large-scale problems, this machine learning is an effective algorithm only for a reasonable size of input datasets. To provide a scalable OPF, this study introduces a strong coreset for accelerating OPF algorithm. Applying this approach can expedite OPF procedure, especially when it is working on massive datasets. Accordingly, a novel algebra is developed to represent the problem of OPF as an optimization problem for the proposed coreset definition. A novel coreset construction algorithm that can approximate the OPF solutions is subsequently proposed in order to improve the OPF construction speed. The simulation results of diverse experiments on various benchmark datasets illustrate computation gain and superiority of the proposed algorithm in terms of the construction and classification speeds as compared to the original algorithm while displaying reliably accurate performance. The presented coreset construction algorithm performs the training and testing phases of OPF up to 6.1 and 4.9 times faster than before, respectively. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:424 / 441
页数:18
相关论文
共 50 条
  • [41] A Distributed Algorithm for Large-Scale Graph Partitioning
    Rahimian, Fatemeh
    Payberah, Amir H.
    Girdzijauskas, Sarunas
    Jelasity, Mark
    Haridi, Seif
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 10 (02)
  • [42] Accelerating Large-Scale Graph-Based Nearest Neighbor Search on a Computational Storage Platform
    Kim, Ji-Hoon
    Park, Yeo-Reum
    Do, Jaeyoung
    Ji, Soo-Young
    Kim, Joo-Young
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (01) : 278 - 290
  • [43] A Survey on Large-Scale Machine Learning
    Wang, Meng
    Fu, Weijie
    He, Xiangnan
    Hao, Shijie
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
  • [44] Graph-based Topological Exploration Planning in Large-scale 3D Environments
    Yang, Fan
    Lee, Dung-Han
    Keller, John
    Scherer, Sebastian
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 12730 - 12736
  • [45] A large-scale study of call graph-based impact prediction using mutation testing
    Vincenzo Musco
    Martin Monperrus
    Philippe Preux
    Software Quality Journal, 2017, 25 : 921 - 950
  • [46] Reflection on the energy graph-based visualisation approach to FDI of large-scale industrial systems
    Uren, Kenneth R.
    van Schoor, George
    IFAC PAPERSONLINE, 2023, 56 (02): : 4094 - 4107
  • [47] A large-scale study of call graph-based impact prediction using mutation testing
    Musco, Vincenzo
    Monperrus, Martin
    Preux, Philippe
    SOFTWARE QUALITY JOURNAL, 2017, 25 (03) : 921 - 950
  • [48] Procedural Generation of Large-Scale Forests Using a Graph-Based Neutral Landscape Model
    Li, Jiaqi
    Gu, Xiaoyan
    Li, Xinchi
    Tan, Junzhong
    She, Jiangfeng
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (03):
  • [49] A Sampling-Based Graph Clustering Algorithm for Large-Scale Networks
    Zhang J.-P.
    Chen H.-C.
    Wang K.
    Zhu K.-J.
    Wang Y.-W.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (08): : 1731 - 1737
  • [50] Large-scale knowledge graph representation learning
    Badrouni, Marwa
    Katar, Chaker
    Inoubli, Wissem
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (09) : 5479 - 5499