Effective Multi-stream Joining in Apache Samza Framework

被引:10
|
作者
Zhuang, Zhenyun [1 ]
Feng, Tao [1 ]
Pan, Yi [1 ]
Ramachandra, Haricharan [1 ]
Sridharan, Badri [1 ]
机构
[1] LinkedIn Corp, 2029 Stierlin Court, Mountain View, CA 94043 USA
关键词
Multi-stream joining; Samza; Stream processing; Big Data;
D O I
10.1109/BigDataCongress.2016.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasing adoption of Big Data in business environments have driven the needs of stream joining in realtime fashion. Multi-stream joining is an important stream processing type in todays Internet companies, and it has been used to generate higher-quality data in business pipelines. Multi-stream joining can be performed in two models: ( 1) All-In-One (AIO) Joining and (2) Step-By-Step (SBS) Joining. Both models have advantages and disadvantages with regard to memory footprint, joining latency, deployment complexity, etc. In this work, we analyze the performance tradeoffs associated with these two models using Apache Samza.
引用
收藏
页码:267 / 274
页数:8
相关论文
共 50 条
  • [31] On Multi-stream Multi-source Multicast Routing
    Chen, Yuh-Rong
    Radhakrishnan, Sridhar
    Dhall, Sudarshan K.
    Karabuk, Suleyman
    [J]. 2012 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2012,
  • [32] A spatiotemporal multi-stream learning framework based on attention mechanism for automatic modulation recognition
    Wang, Xu
    Liu, Dejun
    Zhang, Yuhao
    Li, Yang
    Wu, Shiwei
    [J]. DIGITAL SIGNAL PROCESSING, 2022, 130
  • [33] A novel optimization framework for designing multi-stream compact heat exchangers and associated network
    Wang, Zhe
    Sunden, Bengt
    Li, Yanzhong
    [J]. APPLIED THERMAL ENGINEERING, 2017, 116 : 110 - 125
  • [34] A Pronunciation Prior Assisted Vowel Reduction Detection Framework with Multi-Stream Attention Method
    Liu, Zongming
    Huang, Zhihua
    Wang, Li
    Zhang, Pengyuan
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [35] A spatiotemporal multi-stream learning framework based on attention mechanism for automatic modulation recognition
    College of Information Science and Engineering, China University of Petroleum-Beijing, Beijing, China
    [J]. Digital Signal Process Rev J,
  • [36] Discovering significant patterns in multi-stream sequences
    Gwadera, Robert
    Crestani, Fabio
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 827 - 832
  • [37] Fast Algorithms for Multi-stream Content Detection
    Huang, Yuping
    Chen, Sanfeng
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL II, 2009, : 34 - +
  • [38] Mixing enhancement in a multi-stream injection nozzle
    Peter Vorobieff
    C. Randall Truman
    Adam M. Ragheb
    Gregory S. Elliott
    Julia K. Laystrom-Woodard
    Darren M. King
    David L. Carroll
    Wayne C. Solomon
    [J]. Experiments in Fluids, 2011, 51 : 711 - 722
  • [39] Research on Grey Modeling for Multi-stream Information
    Liu, Xin
    Dai, Jin
    Zhou, Weijie
    [J]. JOURNAL OF GREY SYSTEM, 2016, 28 (04): : 127 - 137
  • [40] Freezing Frozen Pages with Multi-Stream SSDs
    Park, Hyun-Woo
    Choi, Soyee
    An, Mijin
    Lee, Sang-Won
    [J]. 15TH INTERNATIONAL WORKSHOP ON DATA MANAGEMENT ON NEW HARDWARE (DAMON 2019), 2019,