Cost-effective and adaptive clustering algorithm for stream processing on cloud system

被引:0
|
作者
Yue Xia
Junhua Fang
Pingfu Chao
Zhicheng Pan
Jedi S. Shang
机构
[1] Soochow University,School of Computer Science and Technology
[2] The University of Queensland,School of Information Technology and Electrical Engineering
[3] Thinvent Technology Co. LTD.,undefined
来源
GeoInformatica | 2023年 / 27卷
关键词
Real-time processing; Density-based clustering; Window model; Time interval; Cluster evolution;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering is a fundamental operation that plays an essential role in data management and analysis. Clustering algorithms have been well studied over the past two decades, but the real-time clustering has yet to be maturely applied. For applications based on clustering calculations, capturing the dynamic changes of clusters and trends of moving objects in a real-time manner can maximize the value of the data. Although the DSPE (D istributed S tream P rocessing E ngine) is capable of such workloads, it still faces the problems of fixed window size and computational resources waste. In this paper, we introduce a new C ost-e ffective and A daptive C lustering method (CeAC), which can improve computational efficiency while ensuring the accuracy of the clustering result. Specifically, we design a composite window model which contains the latest data records and maintains historical states. To achieve a lightweight clustering, we propose a fully online clustering algorithm based on grid density, which can capture clusters with arbitrary shape and effectively handle outliers in parallel. We further introduce an adaptive calculation model to accelerate the clustering operation by shedding workload according to the incoming data characteristic. Experimental results show that the proposed method is accurate and efficient in real-time data stream clustering.
引用
收藏
页码:1 / 21
页数:20
相关论文
共 50 条
  • [41] Cost-effective Reconfiguration for Multi-cloud Applications
    Parlavantzas, Nikos
    Linh Manh Pham
    Sinha, Arnab
    Morin, Christine
    2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 521 - 528
  • [42] A cost-effective adaptive random testing algorithm for object-oriented software testing
    Zhou, Yue
    Wang, Xiujun
    Guo, Shu
    Wen, Yi
    He, Jingsha
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (03) : 4415 - 4423
  • [43] Cost-Effective Service Provisioning for Hybrid Cloud Applications
    Liu, Fangming
    Luo, Bin
    Niu, Yipei
    MOBILE NETWORKS & APPLICATIONS, 2017, 22 (02): : 153 - 160
  • [44] A secure cost-effective migration of enterprise applications to the cloud
    Huang, Daochao
    Yi, Li
    Song, Fei
    Yang, Dong
    Zhang, Hongke
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2014, 27 (12) : 3996 - 4013
  • [45] Cost-Effective Cloud Edge Traffic Engineering with CASCARA
    Singh, Rachee
    Agarwal, Sharad
    Calder, Matt
    Bahl, Paramvir
    PROCEEDINGS OF THE 18TH USENIX SYMPOSIUM ON NETWORKED SYSTEM DESIGN AND IMPLEMENTATION, 2021, : 201 - 216
  • [46] ESTELLE: An Efficient and Cost-effective Cloud Log Engine
    Zhang, Yupu
    Cong, Guanglin
    Qu, Jihan
    Xu, Ran
    Fu, Yuan
    Li, Weiqi
    Hu, Feiran
    Liu, Jing
    Zhang, Wenliang
    Zheng, Kai
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 201 - 213
  • [47] Cost-effective routing as a service in sensor-cloud
    Sen, Biplab Kanti
    Sarkar, Anupam
    Khatua, Sunirmal
    Das, Rajib K.
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2020, 32 (01) : 42 - 53
  • [48] Cost-effective complex service mapping in cloud infrastructures
    Tran, Khanh-Toan
    Agoulmine, Nazim
    Iraqi, Youssef
    2012 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2012, : 1 - 8
  • [49] Cost-Effective Service Provisioning for Hybrid Cloud Applications
    Fangming Liu
    Bin Luo
    Yipei Niu
    Mobile Networks and Applications, 2017, 22 : 153 - 160
  • [50] InftyDedup: Scalable and Cost-Effective Cloud Tiering with Deduplication
    Kotlarska, Iwona
    Jackowski, Andrzej
    Lichota, Krzysztof
    Welnicki, Michal
    Dubnicki, Cezary
    Iwanicki, Konrad
    PROCEEDINGS OF THE 21ST USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, FAST 2023, 2023, : 33 - 48