Streaming Analytics with Adaptive Near-data Processing

被引:0
|
作者
Sandur, Atul [1 ,4 ]
Park, ChanHo [2 ]
Volos, Stavros [3 ]
Agha, Gul [1 ]
Jeon, Myeongjae [2 ]
机构
[1] Univ Illinois, Urbana, IL 61820 USA
[2] UNIST, Ulsan, South Korea
[3] Microsoft Res, Cambridge, England
[4] AMD Res, Sao Paulo, Brazil
关键词
Datacenter monitoring; Streaming analytics; Wide area network; Query partitioning; Edge computing; CLOUD;
D O I
10.1145/3487553.3524858
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming analytics applications need to process massive volumes of data in a timely manner, in domains ranging from datacenter telemetry and geo-distributed log analytics to Internet-of-Things systems. Such applications suffer from significant network transfer costs to transport the data to a stream processor and compute costs to analyze the data in a timely manner. Pushing the computation closer to the data source by partitioning the analytics query is an effective strategy to reduce resource costs for the stream processor. However, the partitioning strategy depends on the nature of resource bottleneck and resource variability that is encountered at the compute resources near the data source. In this paper, we investigate different issues which affect query partitioning strategies. We first study new partitioning techniques within cloud datacenters which operate under constrained compute conditions varying widely across data sources and different time slots. With insights obtained from the study, we suggest several different ways to improve the performance of stream analytics applications operating in different resource environments, by making effective partitioning decisions for a variety of use cases such as geo-distributed streaming analytics.
引用
收藏
页码:563 / 566
页数:4
相关论文
共 50 条
  • [11] Sorting big data on heterogeneous near-data processing systems
    Vermij, Erik
    Fiorin, Leandro
    Hagleitner, Christoph
    Bertels, Koen
    [J]. ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2017, 2017, : 349 - 354
  • [12] GraNDe: Near-Data Processing Architecture With Adaptive Matrix Mapping for Graph Convolutional Networks
    Yun, Sungmin
    Kim, Byeongho
    Park, Jaehyun
    Nam, Hwayong
    Ahn, Jung Ho
    Lee, Eojin
    [J]. IEEE COMPUTER ARCHITECTURE LETTERS, 2022, 21 (02) : 45 - 48
  • [13] Biscuit: A Framework for Near-Data Processing of Big Data Workloads
    Gu, Boncheol
    Yoon, Andre S.
    Bae, Duck-Ho
    Jo, Insoon
    Lee, Jinyoung
    Yoon, Jonghyun
    Kang, Jeong-Uk
    Kwon, Moonsang
    Yoon, Chanho
    Cho, Sangyeun
    Jeong, Jaeheon
    Chang, Duckhyun
    [J]. 2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, : 153 - 165
  • [14] Near-data processing based on dynamic task offloading
    Hua, Xing-Cheng
    Liu, Peng
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2019, 53 (12): : 2348 - 2356
  • [15] SecNDP: Secure Near-Data Processing with Untrusted Memory
    Xiong, Wenjie
    Ke, Liu
    Jankov, Dimitrije
    Kounavis, Michael
    Wang, Xiaochen
    Northup, Eric
    Yang, Jie Amy
    Acun, Bilge
    Wu, Carole-Jean
    Tang, Ping Tak Peter
    Suh, G. Edward
    Zhang, Xuan
    Lee, Hsien-Hsin S.
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 244 - 258
  • [16] Efficient Machine Learning execution with Near-Data Processing
    Cordeiro, Aline S.
    dos Santos, Sairo R.
    Moreira, Francis B.
    Santos, Paulo C.
    Carro, Luigi
    Alves, Marco A. Z.
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2022, 90
  • [17] Application Codesign of Near-Data Processing for Similarity Search
    Lee, Vincent T.
    Mazumdar, Amrita
    del Mundo, Carlo C.
    Alaghi, Armin
    Ceze, Luis
    Oskin, Mark
    [J]. 2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 896 - 907
  • [18] Machine Learning Migration for Efficient Near-Data Processing
    Cordeiro, Aline S.
    dos Santos, Sairo R.
    Moreira, Francis B.
    Santos, Paulo C.
    Carro, Luigi
    Alves, Marco A. Z.
    [J]. 2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 212 - 219
  • [19] A Survey of Near-Data Processing Architectures for Neural Networks
    Hassanpour, Mehdi
    Riera, Marc
    Gonzalez, Antonio
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (01): : 66 - 102
  • [20] GCiM: A Near-Data Processing Accelerator for Graph Construction
    He, Lei
    Liu, Cheng
    Wang, Ying
    Liang, Shengwen
    Li, Huawei
    Li, Xiaowei
    [J]. 2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 205 - 210