A Specialized Architecture for Object Serialization with Applications to Big Data Analytics

被引:15
|
作者
Jang, Jaeyoung [1 ]
Jung, Sung Jun [2 ]
Jeong, Sunmin [2 ]
Heo, Jun [2 ]
Shin, Hoon [2 ]
Ham, Tae Jun [2 ]
Lee, Jae W. [2 ]
机构
[1] Sungkyunkwan Univ, Seoul, South Korea
[2] Seoul Natl Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Object serialization; Domain-specific architecture; Data analytics; Apache Spark; Hardware-software co-design; MEMORY;
D O I
10.1109/ISCA45697.2020.00036
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Object serialization and deserialization (S/D) is an essential feature for efficient communication between distributed computing nodes with potentially non-uniform execution environments. S/D operations are widely used in big data analytics frameworks for remote procedure calls and massive data transfers like shuffles. However, frequent S/D operations incur significant performance and energy overheads as they must traverse and process a large object graph. Prior approaches improve S/D throughput by effectively hiding disk or network I/O latency with computation, increasing compression ratio, and/or application-specific customization. However, inherent dependencies in the existing (de)serialization formats and algorithms eventually become the major performance bottleneck. Thus, we propose Cereal, a specialized hardware accelerator for memory object serialization. By co-designing the serialization format with hardware architecture, Cereal effectively utilizes abundant parallelism in the S/D process to deliver high throughput. Cereal also employs an efficient object packing scheme to compress metadata such as object reference offsets and a space-efficient bitmap representation for the object layout. Our evaluation of Cereal using both a cycle-level simulator and synthesizable Chisel RTL demonstrates that Cereal delivers 43.4x higher average S/D throughput than 88 other S/D libraries on Java Serialization Benchmark Suite. For six Spark applications Cereal achieves 7.97x and 4.81x speedups on average for S/D operations over Java built-in serializer and Kryo, respectively, while saving S/D energy by 227.75x and 136.28x.
引用
收藏
页码:322 / 334
页数:13
相关论文
共 50 条
  • [1] Big data analytics with applications
    Bi, Zhuming
    Cochran, David
    [J]. JOURNAL OF MANAGEMENT ANALYTICS, 2014, 1 (04) : 249 - 265
  • [2] The δ big data architecture for mobility analytics
    Vouros, George A.
    Glenis, Apostolos
    Doulkeridis, Christos
    [J]. 2020 IEEE SIXTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2020), 2020, : 25 - 32
  • [3] A Dockerized Big Data Architecture for Sports Analytics
    Ozguven, Yavuz Melih
    Gonener, Utku
    Eken, Suleyman
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2022, 19 (02) : 957 - 978
  • [4] Big Data Analytics Architecture for Security Intelligence
    Dauda, Ahmed
    Mclean, Scott
    Almehmadi, Abdulaziz
    El-Khatib, Khalil
    [J]. 11TH INTERNATIONAL CONFERENCE ON SECURITY OF INFORMATION AND NETWORKS (SIN 2018), 2018,
  • [5] A Big Data Analytics Architecture for Industry 4.0
    Santos, Maribel Yasmina
    Oliveira e Sa, Jorge
    Costa, Carlos
    Galvao, Joao
    Andrade, Carina
    Martinho, Bruno
    Lima, Francisca Vale
    Costa, Eduarda
    [J]. RECENT ADVANCES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2017, 570 : 175 - 184
  • [6] SwiftAnalytics: Optimizing Object Storage for Big Data Analytics
    Rupprecht, Lukas
    Zhang, Rui
    Owen, Bill
    Pietzuch, Peter
    Hildebrand, Dean
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2017), 2017, : 245 - 251
  • [7] A Distributed Big Data Analytics Architecture for Vehicle Sensor Data
    Alexakis, Theodoros
    Peppes, Nikolaos
    Demestichas, Konstantinos
    Adamopoulou, Evgenia
    [J]. SENSORS, 2023, 23 (01)
  • [8] Applications of Big Data Analytics Tools for Data Management
    Jamshidi M.
    Tannahill B.
    Ezell M.
    Yetis Y.
    Kaplan H.
    [J]. Jamshidi, Mo (moj@wacong.org), 1600, Springer Science and Business Media Deutschland GmbH (18): : 177 - 199
  • [9] On Urban Data Analytics and Applications in the Big Data Era
    Tomaras, Dimitrios
    [J]. PROCEEDINGS OF THE 2024 25TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT, MDM 2024, 2024, : 328 - 330
  • [10] Intelligent technologies and applications for big data analytics
    You, Ilsun
    Ogiela, Marek R.
    Hwang, Myunggwon
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2015, 45 (08): : 1019 - 1021