Density-Based Data Selection and Management for Edge Computing

被引:2
|
作者
Oikawa, Hiroki [1 ]
Kondo, Masaaki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
关键词
edge computing; data management; REPRESENTATIVE SUBSET; INTERNET; NETWORK;
D O I
10.1109/PERCOM50583.2021.9439127
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wide spread of IoT devices has made it possible to acquire enormous amounts of realtime sensor information. Due to the explosive increase in the sensing data volume, it becomes difficult to collect and process all the data in one central place. On one hand, storing and processing data on edge devices, so called edge computing, is becoming important. On the other hand, edge devices usually have only limited computing and memory resources, and hence it is not practical to process and save all the acquired data. There is a great demand of effectively selecting data to process on an edge device or to transfer it to a cloud server. In this paper, we propose an efficient density-based data selection and management method called O-D2M by which edge devices store the data representing inherent data distribution. We use a low cost graph algorithm to analyze input data trend and its density. We evaluate effectiveness of the proposed O-D2M comparing to other methods in terms of the accuracy of machine learning models trained by the selected data. Throughout the evaluation, we confirm that O-D2M obtains higher accuracy and lower computation cost while it can reduce the amount of data to be processed or transferred by up to 20 points.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Geometric algorithms for density-based data clustering
    Chen, DZ
    Smid, M
    Xu, B
    ALGORITHMS-ESA 2002, PROCEEDINGS, 2002, 2461 : 284 - 296
  • [22] Density-based clustering for exploration of analytical data
    Daszykowski, M
    Walczak, B
    Massart, DL
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2004, 380 (03) : 370 - 372
  • [23] Share density-based clustering of income data
    Condino, Francesca
    STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (04) : 336 - 347
  • [24] Geometric algorithms for density-based data clustering
    Chen, DZ
    Smid, M
    Xu, B
    INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2005, 15 (03) : 239 - 260
  • [25] Density-based hierarchical clustering for streaming data
    Tu, Q.
    Lu, J. F.
    Yuan, B.
    Tang, J. B.
    Yang, J. Y.
    PATTERN RECOGNITION LETTERS, 2012, 33 (05) : 641 - 645
  • [26] Parallel Density-Based Downsampling of Cytometry Data
    Nemcek, Martin
    Jarabek, Tomas
    Lucka, Maria
    PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 1005 : 87 - 95
  • [27] Anytime density-based clustering of complex data
    Mai, Son T.
    He, Xiao
    Feng, Jing
    Plant, Claudia
    Boehm, Christian
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (02) : 319 - 355
  • [28] Hierarchical density-based clustering of uncertain data
    Kriegel, HP
    Pfeifle, M
    Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 689 - 692
  • [29] Density-based outlier scoring on Kepler data
    Giles, Daniel K.
    Walkowicz, Lucianne
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2020, 499 (01) : 524 - 542
  • [30] Density-based clustering for exploration of analytical data
    M. Daszykowski
    B. Walczak
    D. L. Massart
    Analytical and Bioanalytical Chemistry, 2004, 380 : 370 - 372