FlowKV: A Semantic-Aware Store for Large-Scale State Management of Stream Processing Engines

被引:1
|
作者
Lee, Gyewon [1 ,2 ]
Maeng, Jaewoo [2 ]
Park, Jinsol [2 ]
Seo, Jangho [3 ]
Cho, Haeyoon [2 ]
Yang, Youngseok [4 ]
Um, Taegeon [5 ]
Lee, Jongsung [2 ,6 ]
Lee, Jae W. [2 ]
Chun, Byung-Gon [1 ,2 ]
机构
[1] FriendliAI, Seoul, South Korea
[2] Seoul Natl Univ, Seoul, South Korea
[3] NAVER Corp, Seongnam, South Korea
[4] Mirny Inc, Seoul, South Korea
[5] Samsung Res, Seoul, South Korea
[6] Samsung Elect, Suwon, South Korea
关键词
stream processing; KV store; state management; PERFORMANCE;
D O I
10.1145/3552326.3567493
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose FlowKV, a persistent store tailored for large-scale state management of streaming applications. Unlike existing KV stores, FlowKV leverages information from stream processing engines by taking a principled approach toward exploiting information about how and when the applications access data. FlowKV categorizes data access patterns of window operations according to how window boundaries are set and how tuples inside a window are aggregated, and deploys customized in-memory and on-disk data structures optimized for each pattern. In addition, FlowKV takes window metadata as explicit arguments of read and write methods to predict the moment when a window is read, and then loads the tuples of windows in batches from storage ahead of time. Using the NEXMark benchmark as workload, our experiments show that Apache Flink on FlowKV outperforms Flink on RocksDB or Faster with up to 4.12x throughput gain.
引用
收藏
页码:768 / 783
页数:16
相关论文
共 50 条
  • [21] Adaptive correlated prefetch with large-scale hybrid memory system for stream processing
    Lee, Sung Min
    Yoon, Su-Kyung
    Kim, Jeong-Geun
    Kim, Shin-Dug
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (09): : 4746 - 4770
  • [22] Adaptive correlated prefetch with large-scale hybrid memory system for stream processing
    Sung Min Lee
    Su-Kyung Yoon
    Jeong-Geun Kim
    Shin-Dug Kim
    The Journal of Supercomputing, 2018, 74 : 4746 - 4770
  • [23] Autonomous and Energy-Aware Management of Large-Scale Cloud Infrastructures
    Feller, Eugen
    Morin, Christine
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2542 - 2545
  • [24] Large-Scale Real-Time Semantic Processing Framework for Internet of Things
    Chen, Xi
    Chen, Huajun
    Zhang, Ningyu
    Huang, Jue
    Zhang, Wen
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
  • [25] ShuffleBench: A Benchmark for Large-Scale Data Shuffling Operations with Distributed Stream Processing Frameworks
    Henning, Soeren
    Vogel, Adriano
    Leichtfried, Michael
    Ertl, Otmar
    Rabiser, Rick
    PROCEEDINGS OF THE 15TH ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, ICPE 2024, 2024, : 2 - 13
  • [26] GraphSAR: A Sparsity-Aware Processing-in-Memory Architecture for Large-scale Graph Processing on ReRAMs
    Dai, Guohao
    Huang, Tianhao
    Wang, Yu
    Yang, Huazhong
    Wawrzynek, John
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 120 - 126
  • [27] Delaunay State Management for Large-scale Networked Virtual Environments
    Chien, Chien-Hao
    Hu, Shun-Yun
    Jiang, Jehn-Ruey
    PROCEEDINGS OF THE 2008 14TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, : 781 - 786
  • [28] Distributed cache management for context-aware services in large-scale networks
    Takase, Masaaki
    Sano, Takeshi
    Fukuda, Kenichi
    Chugo, Akira
    MANAGING NEXT GENERATION NETWORKS AND SERVICES, PROCEEDINGS, 2007, 4773 : 31 - +
  • [29] Semantic Communication-Aware End-to-End Routing in Large-Scale LEO Satellite Networks
    Guo, Binquan
    Xiong, Zehui
    Wang, Bo
    Quek, Tony Q. S.
    Han, Zhu
    2024 IEEE INTERNATIONAL CONFERENCE ON METAVERSE COMPUTING, NETWORKING, AND APPLICATIONS, METACOM 2024, 2024, : 137 - 142
  • [30] Context-Aware Network for Semantic Segmentation Toward Large-Scale Point Clouds in Urban Environments
    Liu, Chun
    Zeng, Doudou
    Akbar, Akram
    Wu, Hangbin
    Jia, Shoujun
    Xu, Zeran
    Yue, Han
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60