Improvement design for distributed real-time stream processing systems

被引:2
|
作者
Jiang W. [1 ]
Xu L.-G. [1 ]
Hu H.-B. [1 ]
Ma Y. [2 ]
机构
[1] School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu
[2] Department of Computer Science and Engineering, University of Notre Dame, Notre Dame
关键词
Kafka; Kafka cat; Memory file system; Message queue; Real-time stream processing system;
D O I
10.11989/JEST.1674-862X.80904011
中图分类号
学科分类号
摘要
In the era of Big Data, typical architecture of distributed real-time stream processing systems is the combination of Flume, Kafka, and Storm. As a kind of distributed message system, Kafka has the characteristics of horizontal scalability and high throughput, which is manly deployed in many areas in order to address the problem of speed mismatch between message producers and consumers. When using Kafka, we need to quickly receive data sent by producers. In addition, we need to send data to consumers quickly. Therefore, the performance of Kafka is of critical importance to the performance of the whole stream processing system. In this paper, we propose the improved design of real-time stream processing systems, and focus on improving the Kafka's data loading process. We use Kafka cat to transfer data from the source to Kafka topic directly, which can reduce the network transmission. We also utilize the memory file system to accelerate the process of data loading, which can address the bottleneck and performance problems caused by disk I/O. Extensive experiments are conducted to evaluate the performance, which show the superiority of our improved design. © 2008-2016 Journal of Eletronic Science and Technology.
引用
收藏
页码:3 / 12
页数:9
相关论文
共 50 条
  • [21] Real-time Visual Tracker by Stream Processing
    Mateo Lozano, Oscar
    Otsuka, Kazuhiro
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2009, 57 (02): : 285 - 295
  • [22] Real-time stream processing for Big Data
    Wingerath, Wolfram
    Gessert, Felix
    Friedrich, Steffen
    Ritter, Norbert
    IT-INFORMATION TECHNOLOGY, 2016, 58 (04): : 186 - 194
  • [23] The 8 requirements of real-time stream processing
    Stonebraker, M
    Çetintemel, U
    Zdonik, S
    SIGMOD RECORD, 2005, 34 (04) : 42 - 47
  • [24] A Computing Model for Real-Time Stream Processing
    Li Zhao
    Zhang Chuang
    Xu Ke-fu
    Chen Meng-meng
    2014 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2014, : 134 - 137
  • [25] The Method of Communication Quality Improvement in Distributed Systems with Real-Time Services
    Hajder, M.
    Bartczak, T.
    HUMAN-COMPUTER SYSTEMS INTERACTION: BACKGROUNDS AND APPLICATIONS 2, PT 1, 2012, 98 : 253 - 269
  • [26] AN APPROACH TO THE DESIGN OF DISTRIBUTED REAL-TIME OPERATING-SYSTEMS
    CVIJOVIC, M
    KUNC, M
    MICROPROCESSORS AND MICROSYSTEMS, 1992, 16 (02) : 81 - 89
  • [28] Design and evaluation of real-time distributed manufacturing control systems
    Agent Oriented Software Limited, Mill Lane, Cambridge CB2 1RX, United Kingdom
    不详
    Int. J. Manuf. Technol. Manage., 2006, 1-3 (264-282):
  • [29] Performance-based design of distributed real-time systems
    Kang, DI
    Gerber, R
    Saksena, M
    THIRD IEEE REAL-TIME TECHNOLOGY AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 1997, : 2 - 13
  • [30] Design and realization of distributed real-time controllers for mechatronic systems
    Deppe, M
    Zanella, M
    DESIGN AND ANALYSIS OF DISTRIBUTED EMBEDDED SYSTEMS, 2002, 91 : 277 - 286