GeeLytics: Geo-distributed Edge Analytics for Large Scale IoT Systems Based on Dynamic Topology

被引:0
|
作者
Cheng, Bin [1 ]
Papageorgiou, Apostolos [1 ]
Cirillo, Flavio [1 ]
Kovacs, Ernoe [1 ]
机构
[1] NEC Labs Europe, Heidelberg, Germany
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
High data rate sensors such as video cameras, audio sensors, and motion sensors are becoming ubiquitous in the Internet of Things (IoT). In large scale IoT systems like smart cities, a large number of sensors are now widely deployed at different locations, generating a huge amount of stream data. Although the generated data provide us great potential to sense our live environments, it still remains a big challenge to efficiently extract real-time results from sensor data to make fast decisions. Existing stream processing platforms, such as Storm, Spark Streaming, and S4, are well designed to process stream data within a cluster in the Cloud, but they are not suitable for highly distributed IoT systems in which data are naturally geo-distributed and low latency analytics results are expected to be shared across users and applications. To tackle this problem, we design an edge analytics platform called GeeLytics, which can perform real-time stream processing both at the network edges and in the Cloud in a dynamic and transparent manner. In this position paper we discuss its use cases, motivation, and preliminary architecture design. As compared with the start of the art, GeeLytics is designed to support dynamic stream processing topologies by taking into account the system characteristics of heterogeneous edge/Cloud nodes and also the current system workload. This shall achieve low latency analytics results while minimizing the edge-to-Cloud bandwidth consumption. In addition, using docker application containers for packaging up deployable tasks and a distributed pub/sub mechanism for inter-task stream data routing, GeeLytics shall provide better resource isolation and system efficiency to support multi-tenancy.
引用
收藏
页码:565 / 570
页数:6
相关论文
共 50 条
  • [1] Geo-Distributed IoT Data Analytics With Deadline Constraints Across Network Edge
    Chen, Yiting
    Luo, Lailong
    Ren, Bangbang
    Guo, Deke
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (22) : 22914 - 22929
  • [2] Adaptive Partitioning for Large-Scale Graph Analytics in Geo-Distributed Data Centers
    Zhou, Amelie Chi
    Luo, Juanyun
    Qiu, Ruibo
    Tan, Haobin
    He, Bingsheng
    Mao, Rui
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2818 - 2830
  • [3] Optimal Query Plans for Geo-distributed Data Analytics at Scale
    Pradhan, Ahana
    Karthik, Srinivas
    Subramanya, Raghunandan
    PROCEEDINGS OF 7TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA, CODS-COMAD 2024, 2024, : 247 - 251
  • [4] Distributed Profitable Deployment of Network Services to Geo-distributed Edge Systems
    Chen, Yi-Chia
    Yen, Li-Hsing
    APNOMS 2020: 2020 21ST ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2020, : 208 - 213
  • [5] Multi-Objective Optimizations in Geo-Distributed Data Analytics Systems
    Niu, Zhaojie
    He, Bingsheng
    Zhou, Amelie Chi
    Tong, Lau Chiew
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 519 - 528
  • [6] Trading Cost and Throughput in Geo-Distributed Analytics With A Two Time Scale Approach
    Xu, Xinping
    Li, Wenxin
    Xu, Renhai
    Qi, Heng
    Li, Keqiu
    Zhou, Xiaobo
    Chen, Sheng
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (03) : 2163 - 2177
  • [7] A TTL-based Approach for Data Aggregation in Geo-distributed Streaming Analytics
    Kumar, Dhruv
    Li, Jian
    Chandra, Abhishek
    Sitaraman, Ramesh K.
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2019, 3 (02)
  • [8] Deep Reinforcement Learning based VNF Management in Geo-distributed Edge Computing
    Gu, Lin
    Zeng, Deze
    Li, Wei
    Guo, Song
    Zomaya, Albert Y.
    Jin, Hai
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 934 - 943
  • [9] ePulsar: Control Plane for Publish-Subscribe Systems on Geo-Distributed Edge Infrastructure
    Gupta, Harshit
    Landle, Tyler C.
    Ramachandran, Umakishore
    2021 ACM/IEEE 6TH SYMPOSIUM ON EDGE COMPUTING (SEC 2021), 2021, : 228 - 241
  • [10] Octopus: Based on Congestion-aware Scheduling on Geo-distributed Big Data Analytics Cluster
    Du, Haizhou
    Zhang, Keke
    Yang, Zhenchen
    2018 5TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2018, : 490 - 495