Streaming Analytics with Adaptive Near-data Processing

被引:0
|
作者
Sandur, Atul [1 ,4 ]
Park, ChanHo [2 ]
Volos, Stavros [3 ]
Agha, Gul [1 ]
Jeon, Myeongjae [2 ]
机构
[1] Univ Illinois, Urbana, IL 61820 USA
[2] UNIST, Ulsan, South Korea
[3] Microsoft Res, Cambridge, England
[4] AMD Res, Sao Paulo, Brazil
关键词
Datacenter monitoring; Streaming analytics; Wide area network; Query partitioning; Edge computing; CLOUD;
D O I
10.1145/3487553.3524858
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming analytics applications need to process massive volumes of data in a timely manner, in domains ranging from datacenter telemetry and geo-distributed log analytics to Internet-of-Things systems. Such applications suffer from significant network transfer costs to transport the data to a stream processor and compute costs to analyze the data in a timely manner. Pushing the computation closer to the data source by partitioning the analytics query is an effective strategy to reduce resource costs for the stream processor. However, the partitioning strategy depends on the nature of resource bottleneck and resource variability that is encountered at the compute resources near the data source. In this paper, we investigate different issues which affect query partitioning strategies. We first study new partitioning techniques within cloud datacenters which operate under constrained compute conditions varying widely across data sources and different time slots. With insights obtained from the study, we suggest several different ways to improve the performance of stream analytics applications operating in different resource environments, by making effective partitioning decisions for a variety of use cases such as geo-distributed streaming analytics.
引用
收藏
页码:563 / 566
页数:4
相关论文
共 50 条
  • [1] Practical Near-Data Processing for In-memory Analytics Frameworks
    Gao, Mingyu
    Ayers, Grant
    Kozyrakis, Christos
    [J]. 2015 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION (PACT), 2015, : 113 - 124
  • [2] NEAR-DATA PROCESSING
    Balasubramonian, Rajeev
    Grot, Boris
    [J]. IEEE MICRO, 2016, 36 (01) : 4 - 5
  • [3] Overcoming Challenges to Near-Data Processing
    Jayasena, Nuwan
    [J]. IEEE MICRO, 2016, 36 (01) : 8 - 9
  • [4] Near-Data Processing of Neural Networks
    Chen, Yunji
    Tao, Jinhua
    [J]. IEEE MICRO, 2016, 36 (01) : 9 - 10
  • [5] Optimizing Near-Data Processing for Spark
    Rachuri, Sri Pramodh
    Gantasala, Arun
    Emanuel, Prajeeth
    Gandhi, Anshul
    Foley, Robert
    Puhov, Peter
    Gkountouvas, Theodoros
    Lei, Hui
    [J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 636 - 646
  • [6] An Architecture for Near-Data Processing Systems
    Vermij, Erik
    Hagleitner, Christoph
    Fiorin, Leandro
    Jongerius, Rik
    van Lunteren, Jan
    Bertels, Koen
    [J]. PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF'16), 2016, : 357 - 360
  • [7] JAFAR: Near-Data Processing for Databases
    Babarinsa, Oreoluwa
    Idreos, Stratos
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 2069 - 2070
  • [8] Jarvis: Large-scale Server Monitoring with Adaptive Near-data Processing
    Sandur, Atul
    Park, ChanHo
    Volos, Stavros
    Agha, Gul
    Jeon, Myeongjae
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1408 - 1422
  • [9] GraNDe: Near-Data Processing Architecture With Adaptive Matrix Mapping for Graph Convolutional Networks
    Yun, Sungmin
    Kim, Byeongho
    Park, Jaehyun
    Nam, Hwayong
    Ahn, Jung Ho
    Lee, Eojin
    [J]. IEEE COMPUTER ARCHITECTURE LETTERS, 2022, 21 (02) : 45 - 48
  • [10] Computing En-Route for Near-Data Processing
    Huang, Jiayi
    Majumder, Pritam
    Kim, Sungkeun
    Fulton, Troy
    Puli, Ramprakash Reddy
    Yum, Ki Hwan
    Kim, Eun Jung
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (06) : 906 - 921