Improvement Design for Distributed Real-Time Stream Processing Systems

被引:0
|
作者
Wei Jiang [1 ]
LiuGen Xu [1 ]
HaiBo Hu [1 ]
Yue Ma [2 ]
机构
[1] the School of Information and Software Engineering, University of Electronic Science and Technology of China
[2] the Department of Computer Science and Engineering, University of Notre
关键词
D O I
暂无
中图分类号
学科分类号
摘要
In the era of Big Data, typical architecture of distributed real-time stream processing systems is the combination of Flume, Kafka, and Storm. As a kind of distributed message system, Kafka has the characteristics of horizontal scalability and high throughput, which is manly deployed in many areas in order to address the problem of speed mismatch between message producers and consumers. When using Kafka, we need to quickly receive data sent by producers. In addition, we need to send data to consumers quickly. Therefore, the performance of Kafka is of critical importance to the performance of the whole stream processing system. In this paper, we propose the improved design of real-time stream processing systems, and focus on improving the Kafka’s data loading process.We use Kafka cat to transfer data from the source to Kafka topic directly, which can reduce the network transmission. We also utilize the memory file system to accelerate the process of data loading, which can address the bottleneck and performance problems caused by disk I/O. Extensive experiments are conducted to evaluate the performance, which show the superiority of our improved design.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Improvement design for distributed real-time stream processing systems
    Jiang, Wei
    Xu, Liu-Gen
    Hu, Hai-Bo
    Ma, Yue
    [J]. Journal of Electronic Science and Technology, 2019, 17 (01) : 3 - 12
  • [2] Improvement Design for Distributed Real-Time Stream Processing Systems
    Wei Jiang
    Liu-Gen Xu
    Hai-Bo Hu
    Yue Ma
    [J]. Journal of Electronic Science and Technology, 2019, (01) : 3 - 12
  • [3] Patterns for Distributed Real-Time Stream Processing
    Basanta-Val, Pablo
    Fernandez-Garcia, Norberto
    Sanchez-Fernandez, Luis
    Arias-Fisteus, Jesus
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (11) : 3243 - 3257
  • [4] Fine real-time processing in distributed systems
    Yakoh, T
    Sato, H
    Aoyama, T
    [J]. 2000 IEEE INTERNATIONAL WORKSHOP ON FACTORY COMMUNICATION SYSTEMS, PROCEEDINGS, 2000, : 135 - 142
  • [5] RASP: Real-time Network Analytics with Distributed NoSQL Stream Processing
    Touloupas, Georgios
    Konstantinou, Ioannis
    Koziris, Nectarios
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 2414 - 2419
  • [6] Mobile Storm: Distributed Real-time Stream Processing for Mobile Clouds
    Ning, Qian
    Chen, Chien-An
    Stoleru, Radu
    Chen, Congcong
    [J]. 2015 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (CLOUDNET), 2015, : 139 - 145
  • [7] MDDRSPF: A Model Driven Distributed Real-time Stream Processing Framework
    Wen, Yijun
    Zhang, Li
    Wang, Cheng
    [J]. 2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1352 - 1358
  • [8] Commit processing in distributed real-time database systems
    Gupta, R
    Haritsa, J
    Ramamritham, K
    Seshadri, S
    [J]. 17TH IEEE REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 1996, : 220 - 229
  • [9] Hierarchical design method for real-time distributed systems
    Yamane, S
    [J]. FIFTH INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 1998, : 189 - 192
  • [10] Network conscious design of distributed real-time systems
    Park, JW
    Kim, YS
    Hong, SS
    Saksena, M
    Noh, SH
    Kwon, WH
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 1998, 45 (02) : 131 - 156