A strong coreset algorithm to accelerate OPF as a graph-based machine learning in large-scale problems

被引:1
|
作者
Bostani, Hamid [1 ]
Sheikhan, Mansour [2 ]
Mahboobi, Behrad [3 ]
机构
[1] Islamic Azad Univ, South Tehran Branch, Young Researchers & Elite Club, Tehran, Iran
[2] Islamic Azad Univ, Dept Elect Engn, South Tehran Branch, Tehran, Iran
[3] Islamic Azad Univ, Commun Comp & Ind Network Res Ctr, Dept Elect & Comp Engn, Sci & Res Branch, Tehran, Iran
基金
美国国家科学基金会;
关键词
Coreset; Optimum-path forest; Large-scale problems; Massive datasets; OPTIMUM-PATH FOREST; INTRUSION DETECTION; CLASSIFICATION; HYBRID;
D O I
10.1016/j.ins.2020.10.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimum-path forest (OPF) is one of the efficient graph-based frameworks that can determine the patterns of input dataset by extracting the optimal partitions of graph obtained through encoding data into a graph. Since OPF was introduced based on simple assumptions without considering the requirements of large-scale problems, this machine learning is an effective algorithm only for a reasonable size of input datasets. To provide a scalable OPF, this study introduces a strong coreset for accelerating OPF algorithm. Applying this approach can expedite OPF procedure, especially when it is working on massive datasets. Accordingly, a novel algebra is developed to represent the problem of OPF as an optimization problem for the proposed coreset definition. A novel coreset construction algorithm that can approximate the OPF solutions is subsequently proposed in order to improve the OPF construction speed. The simulation results of diverse experiments on various benchmark datasets illustrate computation gain and superiority of the proposed algorithm in terms of the construction and classification speeds as compared to the original algorithm while displaying reliably accurate performance. The presented coreset construction algorithm performs the training and testing phases of OPF up to 6.1 and 4.9 times faster than before, respectively. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:424 / 441
页数:18
相关论文
共 50 条
  • [21] TGE: Machine Learning Based Task Graph Embedding for Large-Scale Topology Mapping
    Choi, Jong Youl
    Logan, Jeremy
    Wolf, Matthew
    Ostrouchov, George
    Kurc, Tahsin
    Liu, Qing
    Podhorszki, Norbert
    Klasky, Scott
    Romanus, Melissa
    Sun, Qian
    Parashar, Manish
    Churchill, Randy Michael
    Chang, C. S.
    2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 587 - 591
  • [22] A branch and bound irredundant graph algorithm for large-scale MLCS problems
    Wang, Chunyang
    Wang, Yuping
    Cheung, Yiuming
    PATTERN RECOGNITION, 2021, 119
  • [23] Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control
    Zhao, Bocheng
    Huo, Mingying
    Li, Zheng
    Yu, Ze
    Qi, Naiming
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
  • [24] A Universal Machine Learning Algorithm for Large-Scale Screening of Materials
    Fanourgakis, George S.
    Gkagkas, Konstantinos
    Tylianakis, Emmanuel
    Froudakis, George E.
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2020, 142 (08) : 3814 - 3822
  • [25] Large-scale Graph Representation Learning
    Leskovec, Jure
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4 - 4
  • [26] Graph-Based Bayesian Optimization for Large-Scale Objective-Based Experimental Design
    Imani, Mahdi
    Ghoreishi, Seyede Fatemeh
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5913 - 5925
  • [27] A k-Hop Graph-Based Observer for Large-Scale Networked Systems
    Gasparri, Andrea
    Marino, Alessandro
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [28] A survey of large-scale graph-based semi-supervised classification algorithms
    Song Y.
    Zhang J.
    Zhang C.
    International Journal of Cognitive Computing in Engineering, 2022, 3 : 188 - 198
  • [29] GraphMeta: A Graph-Based Engine for Managing Large-Scale HPC Rich Metadata
    Dai, Dong
    Chen, Yong
    Carns, Philip
    Jenkins, John
    Zhang, Wei
    Ross, Robert
    2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 298 - 307
  • [30] EGM: Enhanced Graph-based Model for Large-scale Video Advertisement Search
    Yu, Tan
    Liu, Jie
    Yang, Yi
    Li, Yi
    Fei, Hongliang
    Li, Ping
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4443 - 4451