A Specialized Architecture for Object Serialization with Applications to Big Data Analytics

被引:15
|
作者
Jang, Jaeyoung [1 ]
Jung, Sung Jun [2 ]
Jeong, Sunmin [2 ]
Heo, Jun [2 ]
Shin, Hoon [2 ]
Ham, Tae Jun [2 ]
Lee, Jae W. [2 ]
机构
[1] Sungkyunkwan Univ, Seoul, South Korea
[2] Seoul Natl Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Object serialization; Domain-specific architecture; Data analytics; Apache Spark; Hardware-software co-design; MEMORY;
D O I
10.1109/ISCA45697.2020.00036
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Object serialization and deserialization (S/D) is an essential feature for efficient communication between distributed computing nodes with potentially non-uniform execution environments. S/D operations are widely used in big data analytics frameworks for remote procedure calls and massive data transfers like shuffles. However, frequent S/D operations incur significant performance and energy overheads as they must traverse and process a large object graph. Prior approaches improve S/D throughput by effectively hiding disk or network I/O latency with computation, increasing compression ratio, and/or application-specific customization. However, inherent dependencies in the existing (de)serialization formats and algorithms eventually become the major performance bottleneck. Thus, we propose Cereal, a specialized hardware accelerator for memory object serialization. By co-designing the serialization format with hardware architecture, Cereal effectively utilizes abundant parallelism in the S/D process to deliver high throughput. Cereal also employs an efficient object packing scheme to compress metadata such as object reference offsets and a space-efficient bitmap representation for the object layout. Our evaluation of Cereal using both a cycle-level simulator and synthesizable Chisel RTL demonstrates that Cereal delivers 43.4x higher average S/D throughput than 88 other S/D libraries on Java Serialization Benchmark Suite. For six Spark applications Cereal achieves 7.97x and 4.81x speedups on average for S/D operations over Java built-in serializer and Kryo, respectively, while saving S/D energy by 227.75x and 136.28x.
引用
收藏
页码:322 / 334
页数:13
相关论文
共 50 条
  • [41] Big data and predictive analytics: A sytematic review of applications
    Jamarani, Amirhossein
    Haddadi, Saeid
    Sarvizadeh, Raheleh
    Kashani, Mostafa Haghi
    Akbari, Mohammad
    Moradi, Saeed
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (07)
  • [42] Advanced Machine Learning Applications in Big Data Analytics
    Li, Taiyong
    Deng, Wu
    Wu, Jiang
    [J]. ELECTRONICS, 2023, 12 (13)
  • [43] Big data analytics opportunities for applications in process engineering
    Sadat Lavasani, Mitra
    Raeisi Ardali, Nahid
    Sotudeh-Gharebagh, Rahmat
    Zarghami, Reza
    Abonyi, Janos
    Mostoufi, Navid
    [J]. REVIEWS IN CHEMICAL ENGINEERING, 2023, 39 (03) : 479 - 511
  • [44] Tutorial: Big Data Analytics: Concepts, Technologies, and Applications
    Watson, Hugh J.
    [J]. COMMUNICATIONS OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2014, 34 : 1247 - 1268
  • [45] Big Data and Predictive Analytics Applications in the Care of Children
    Suresh, Srinivasan
    [J]. PEDIATRIC CLINICS OF NORTH AMERICA, 2016, 63 (02) : 357 - +
  • [46] Big Data And Analytics: Concepts, Techniques, Methods, and Applications
    Kaisler, Stephen H.
    Armour, Frank J.
    Espinosa, Alberto J.
    [J]. PROCEEDINGS OF THE 49TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS 2016), 2016, : 1059 - 1060
  • [47] A SURVEY ON BIG DATA: INFRASTRUCTURE, ANALYTICS, VISUALIZATION AND APPLICATIONS
    Saraswathi, S.
    Deepa, G.
    Vennila, G.
    Parthasarathy, S.
    Ramadoss, B.
    [J]. INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2022, 29 (05): : 618 - 648
  • [48] Investigation and Characterization of MapReduce Applications for Big Data Analytics
    Li, Y.
    Lam, T. B. V.
    Do, T. V. Van
    Chakka, R.
    Rotter, C.
    [J]. JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2018, 77 (09): : 493 - 498
  • [49] Explore Big Data Analytics Applications and Opportunities: A Review
    Al-Sai, Zaher Ali
    Husin, Mohd Heikal
    Syed-Mohamad, Sharifah Mashita
    Abdin, Rasha Moh'd Sadeq
    Damer, Nour
    Abualigah, Laith
    Gandomi, Amir H. H.
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (04)
  • [50] AI Empowered Big Data Analytics for Industrial Applications
    Kumar, V. D. Ambeth
    Varadarajan, Vijayakumar
    Gupta, Mukesh Kumar
    Rodrigues, Joel J. P. C.
    Janu, Neha
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2022, 28 (08) : 877 - 881