FlowKV: A Semantic-Aware Store for Large-Scale State Management of Stream Processing Engines

被引:1
|
作者
Lee, Gyewon [1 ,2 ]
Maeng, Jaewoo [2 ]
Park, Jinsol [2 ]
Seo, Jangho [3 ]
Cho, Haeyoon [2 ]
Yang, Youngseok [4 ]
Um, Taegeon [5 ]
Lee, Jongsung [2 ,6 ]
Lee, Jae W. [2 ]
Chun, Byung-Gon [1 ,2 ]
机构
[1] FriendliAI, Seoul, South Korea
[2] Seoul Natl Univ, Seoul, South Korea
[3] NAVER Corp, Seongnam, South Korea
[4] Mirny Inc, Seoul, South Korea
[5] Samsung Res, Seoul, South Korea
[6] Samsung Elect, Suwon, South Korea
关键词
stream processing; KV store; state management; PERFORMANCE;
D O I
10.1145/3552326.3567493
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose FlowKV, a persistent store tailored for large-scale state management of streaming applications. Unlike existing KV stores, FlowKV leverages information from stream processing engines by taking a principled approach toward exploiting information about how and when the applications access data. FlowKV categorizes data access patterns of window operations according to how window boundaries are set and how tuples inside a window are aggregated, and deploys customized in-memory and on-disk data structures optimized for each pattern. In addition, FlowKV takes window metadata as explicit arguments of read and write methods to predict the moment when a window is read, and then loads the tuples of windows in batches from storage ahead of time. Using the NEXMark benchmark as workload, our experiments show that Apache Flink on FlowKV outperforms Flink on RocksDB or Faster with up to 4.12x throughput gain.
引用
收藏
页码:768 / 783
页数:16
相关论文
共 50 条
  • [11] Optimizing data stream processing for large-scale applications
    Cappellari, Paolo
    Roantree, Mark
    Chun, Soon Ae
    SOFTWARE-PRACTICE & EXPERIENCE, 2018, 48 (09): : 1607 - 1641
  • [12] DeepDSAIR: Deep 6-DOF camera relocalization using deblurred semantic-aware image representation for large-scale outdoor environments
    Esfahani, Mandi Abolfazli
    Wu, Keyu
    Yuan, Shenghai
    Wang, Han
    IMAGE AND VISION COMPUTING, 2019, 89 : 120 - 130
  • [13] Alovera: A Fast Stream Processing System for Large-Scale Data
    Zhang, Zhen'An
    Zhang, Dongjie
    Yu, Xiaopeng
    Wang, Jing
    He, Chunjiang
    Yuan, Pingpeng
    Jin, Hai
    2013 8TH CHINAGRID ANNUAL CONFERENCE (CHINAGRID), 2013, : 74 - 79
  • [14] Processing Online News Streams for Large-Scale Semantic Analysis
    Krstajic, Milos
    Mansmann, Florian
    Stoffel, Andreas
    Atkinson, Martin
    Keim, Daniel A.
    2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 215 - 220
  • [15] Large-Scale Experiment for Topology-Aware Resource Management
    Georgiou, Yiannis
    Mercier, Guillaume
    Villiermet, Adele
    EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 179 - 186
  • [16] Predicting the Stability of Large-scale Distributed Stream Processing Systems on the Cloud
    Tri Minh Truong
    Harwood, Aaron
    Sinnott, Richard O.
    CLOSER: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2017, : 575 - 582
  • [17] Performance Analysis of Large-scale Distributed Stream Processing Systems on the Cloud
    Tri Minh Truong
    Harwood, Aaron
    Sinnott, Richard O.
    Chen, Shiping
    PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 754 - 761
  • [18] A United Framework for Large-Scale Resource Description Framework Stream Processing
    Fang, Hong
    Zhao, Bo
    Zhang, Xiao-Wang
    Yang, Xuan-Xing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2019, 34 (04) : 762 - 774
  • [19] Storage optimization for large-scale distributed stream-processing systems
    Hildrum, Kirsten
    Douglis, Fred
    Wolf, Joel L.
    Yu, Philip S.
    Fleischer, Lisa
    Katta, Akshay
    ACM Transactions on Storage, 2008, 3 (04)
  • [20] A United Framework for Large-Scale Resource Description Framework Stream Processing
    Hong Fang
    Bo Zhao
    Xiao-Wang Zhang
    Xuan-Xing Yang
    Journal of Computer Science and Technology, 2019, 34 : 762 - 774