Cost-effective and adaptive clustering algorithm for stream processing on cloud system

被引:0
|
作者
Yue Xia
Junhua Fang
Pingfu Chao
Zhicheng Pan
Jedi S. Shang
机构
[1] Soochow University,School of Computer Science and Technology
[2] The University of Queensland,School of Information Technology and Electrical Engineering
[3] Thinvent Technology Co. LTD.,undefined
来源
GeoInformatica | 2023年 / 27卷
关键词
Real-time processing; Density-based clustering; Window model; Time interval; Cluster evolution;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering is a fundamental operation that plays an essential role in data management and analysis. Clustering algorithms have been well studied over the past two decades, but the real-time clustering has yet to be maturely applied. For applications based on clustering calculations, capturing the dynamic changes of clusters and trends of moving objects in a real-time manner can maximize the value of the data. Although the DSPE (D istributed S tream P rocessing E ngine) is capable of such workloads, it still faces the problems of fixed window size and computational resources waste. In this paper, we introduce a new C ost-e ffective and A daptive C lustering method (CeAC), which can improve computational efficiency while ensuring the accuracy of the clustering result. Specifically, we design a composite window model which contains the latest data records and maintains historical states. To achieve a lightweight clustering, we propose a fully online clustering algorithm based on grid density, which can capture clusters with arbitrary shape and effectively handle outliers in parallel. We further introduce an adaptive calculation model to accelerate the clustering operation by shedding workload according to the incoming data characteristic. Experimental results show that the proposed method is accurate and efficient in real-time data stream clustering.
引用
收藏
页码:1 / 21
页数:20
相关论文
共 50 条
  • [21] Cost-efficient Stream Processing on the Cloud
    Tri Minh Truong
    Harwood, Aaron
    Sinnott, Richard O.
    Chen, Shiping
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 209 - 213
  • [22] Cost-Effective Resource Provisioning for MapReduce in a Cloud
    Palanisamy, Balaji
    Singh, Aameek
    Liu, Ling
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (05) : 1265 - 1279
  • [23] A Cost-Effective Cloud Event Archival for SIEMs
    Serckumecka, Adriano
    Medeiros, Iberia
    Ferreira, Bernardo
    Bessani, Alysson
    2019 38TH INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS WORKSHOPS (SRDSW 2019), 2019, : 31 - 36
  • [24] Toward a Cost-effective Cloud Storage Service
    Kim, Shin-gyu
    Han, Hyuck
    Eom, Hyeonsang
    Yeom, Heon Y.
    12TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY: ICT FOR GREEN GROWTH AND SUSTAINABLE DEVELOPMENT, VOLS 1 AND 2, 2010, : 99 - 102
  • [25] Scalable and cost-effective NGS genotyping in the cloud
    Yassine Souilmi
    Alex K. Lancaster
    Jae-Yoon Jung
    Ettore Rizzo
    Jared B. Hawkins
    Ryan Powles
    Saaïd Amzazi
    Hassan Ghazal
    Peter J. Tonellato
    Dennis P. Wall
    BMC Medical Genomics, 8
  • [26] Scalable and cost-effective NGS genotyping in the cloud
    Souilmi, Yassine
    Lancaster, Alex K.
    Jung, Jae-Yoon
    Rizzo, Ettore
    Hawkins, Jared B.
    Powles, Ryan
    Amzazi, Saaid
    Ghazal, Hassan
    Tonellato, Peter J.
    Wall, Dennis P.
    BMC MEDICAL GENOMICS, 2015, 8
  • [27] On Achieving Cost-Effective Adaptive Cloud Gaming in Geo-Distributed Data Centers
    Tian, Hao
    Wu, Di
    He, Jian
    Xu, Yuedong
    Chen, Min
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (12) : 2064 - 2077
  • [28] Effective clustering algorithm for probabilistic data stream
    Dai, Dong-Bo
    Zhao, Gang
    Sun, Sheng-Li
    Ruan Jian Xue Bao/Journal of Software, 2009, 20 (05): : 1313 - 1328
  • [29] Complementary Base Station Clustering for Cost-Effective and Energy-Efficient Cloud-RAN
    Chen, Longbiao
    Liu, Linjin
    Fan, Xiaoliang
    Li, Johnthan
    Wang, Cheng
    Pan, Gang
    Jakubowicz, Jeremie
    Thi-Mai-Trang Nguyen
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [30] An Adaptive Density Data Stream Clustering Algorithm
    Shifei Ding
    Jian Zhang
    Hongjie Jia
    Jun Qian
    Cognitive Computation, 2016, 8 : 30 - 38