Performance Analysis of Java']Java Virtual Machine for Machine Learning Workloads using Apache Spark

被引:0
|
作者
Hema, N. [1 ]
Srinivasa, K. G. [1 ]
Chidambaram, Saravanan [2 ]
Saraswat, Sandeep [2 ]
Saraswati, Sujoy [2 ]
Ramachandra, Ranganath [2 ]
Huttanagoudar, Jayashree B. [3 ]
机构
[1] MSRIT, Dept CSE, Bangalore 54, Karnataka, India
[2] Hewlett Packard Enterprise, Bangalore 560048, Karnataka, India
[3] RVCE, Dept CSE, Bangalore 59, Karnataka, India
关键词
Big data; Machine Learning (ML); Apache Spark; Hadoop; !text type='Java']Java[!/text] Virtual Machine (JVM);
D O I
10.1145/2980258.2982117
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Now a day's data is growing very rapidly, where processing and analyzing data to get useful information is the main task. There are many big data processing tools and framework such as Hadoop, Hive, Cassandra etc. Spark is one of the fastest big data processing framework in cluster computation. Basic Idea is to analyze the performance of java virtual machine (JVM) [1], by characterizing java virtual machine using SparkBench benchmark on Apache Spark (TM) [2]. Java virtual machine is a core execution platform for spark application. When we run the spark application on java virtual machine, its behavior is affected, which needs to be monitored to analyze the JVM performance. Here we are considering Machine Learning workloads like K-Means, Matrix Factorization and Logistic Regression. Main goal here is to analyze the machine learning workloads end to end across the cluster, with respect to following parameters such as garbage collection, memory such as heap usage, CPU process time. Characterization of JVM is done with spark cluster setup and HDFS is used as storage with distributed Hadoop cluster setup.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] On a new method for dataflow analysis of Java']Java Virtual Machine subroutines
    Hagiya, M
    Tozawa, A
    STATIC ANALYSIS, 1998, 1503 : 17 - 32
  • [32] Preservation of proof obligations from Java']Java to the Java']Java virtual machine
    Barthe, Gilles
    Gregoire, Benjamin
    Pavlova, Mariela
    AUTOMATED REASONING, PROCEEDINGS, 2008, 5195 : 83 - +
  • [33] Inside the Java virtual machine
    Lindholm, Tim
    Yellin, Frank
    Performance Computing/Unix Review, 1997, 15 (01):
  • [34] Hardware realization of a Java']Java Virtual Machine for high performance multimedia applications
    Berekovic, M
    Kloos, H
    Pirsch, P
    SIPS 97 - 1997 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS: DESIGN AND IMPLEMENTATION, 1997, : 479 - 488
  • [35] Cross-architectural performance Portability of a Java']Java virtual machine implementation
    Jacob, M
    Randall, K
    USENIX ASSOCIATION PROCEEDINGS OF THE 2ND JAVA(TM) VIRTUAL MACHINE RESEARCH AND TECHNOLOGY SYMPOSIUM, 2002, : 79 - 89
  • [36] Hardware realization of a java']java virtual machine for high performance multimedia applications
    Berekovic, M
    Kloos, H
    Pirsch, P
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 1999, 22 (01): : 31 - 43
  • [37] A fault-tolerant Java']Java virtual machine
    Napper, J
    Alvisi, L
    Vin, H
    2003 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2003, : 425 - 434
  • [38] Auto-tuning the Java']Java Virtual Machine
    Jayasena, Sanath
    Fernando, Milinda
    Rusira, Tharindu
    Perera, Chalitha
    Philips, Chamara
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1261 - 1270
  • [39] Garbage collection in an embedded Java']Java virtual machine
    Cha, Chang-Il
    Kim, Hyung-Jun
    Hwang, Kyu-Jeong
    Kim, Sang-Wook
    Lee, Sang-Yun
    Won, Hee-Sun
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2006, 4251 : 443 - 450
  • [40] Operational semantics of the Java']Java Card Virtual Machine
    Siveroni, IA
    JOURNAL OF LOGIC AND ALGEBRAIC PROGRAMMING, 2004, 58 (1-2): : 3 - 25