A Machine-Learning Algorithm with Disjunctive Model for Data-Driven Program Analysis

被引:17
|
作者
Jeon, Minseok [1 ]
Jeong, Sehun [1 ]
Cha, Sungdeok [1 ]
Oh, Hakjoo [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, 145 Anam Ro, Seoul 02841, South Korea
关键词
Data-driven program analysis; static analysis; context-sensitivity; flow-sensitivity; POINTS-TO ANALYSIS; CONTEXT-SENSITIVITY; STRATEGY; PRECISE; OCTAGON;
D O I
10.1145/3293607
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a new machine-learning algorithm with disjunctive model for data-driven program analysis. One major challenge in static program analysis is a substantial amount of manual effort required for tuning the analysis performance. Recently, data-driven program analysis has emerged to address this challenge by automatically adjusting the analysis based on data through a learning algorithm. Although this new approach has proven promising for various program analysis tasks, its effectiveness has been limited due to simple-minded learning models and algorithms that are unable to capture sophisticated, in particular disjunctive, program properties. To overcome this shortcoming, this article presents a new disjunctive model for data-driven program analysis as well as a learning algorithm to find the model parameters. Our model uses Boolean formulas over atomic features and therefore is able to express nonlinear combinations of program properties. A key technical challenge is to efficiently determine a set of good Boolean formulas, as brute-force search would simply be impractical. We present a stepwise and greedy algorithm that efficiently learns Boolean formulas. We show the effectiveness and generality of our algorithm with two static analyzers: context-sensitive points-to analysis for Java and flow-sensitive interval analysis for C. Experimental results show that our automated technique significantly improves the performance of the state-of-the-art techniques including ones hand-crafted by human experts.
引用
收藏
页数:41
相关论文
共 50 条
  • [1] A scalable learning algorithm for data-driven program analysis
    Cha, Sooyoung
    Jeong, Sehun
    Oh, Hakjoo
    INFORMATION AND SOFTWARE TECHNOLOGY, 2018, 104 : 1 - 13
  • [2] ANALYSIS OF PIEZOELECTRIC SEMICONDUCTORS VIA DATA-DRIVEN MACHINE-LEARNING TECHNIQUES
    Guo, Yu-ting
    Li, De-zhi
    Zhang, Chun-li
    PROCEEDINGS OF THE 2020 15TH SYMPOSIUM ON PIEZOELECTRCITY, ACOUSTIC WAVES AND DEVICE APPLICATIONS (SPAWDA), 2021, : 258 - 262
  • [3] DATA-DRIVEN PREDICTION OF CELLULAR NETWORKS COVERAGE: AN INTERPRETABLE MACHINE-LEARNING MODEL
    Ghasemi, Amir
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 604 - 608
  • [4] A Data-Driven Comparative Analysis of Machine-Learning Models for Familial Hypercholesterolemia Detection
    Kocejko, Tomasz
    Applied Sciences (Switzerland), 2024, 14 (23):
  • [5] A model for identifying the feeder-transformer relationship in distribution grids using a data-driven machine-learning algorithm
    Gao, Yongmin
    Kang, Bing
    Xiao, Hui
    Wang, Zongyao
    Ding, Guili
    Xu, Zhihao
    Liu, Chuan
    Wang, Daxing
    Li, Yutong
    FRONTIERS IN ENERGY RESEARCH, 2023, 11
  • [6] Data-Driven Machine-Learning Methods for Diabetes Risk Prediction
    Dritsas, Elias
    Trigka, Maria
    SENSORS, 2022, 22 (14)
  • [7] Personalized Tourist Recommender System: A Data-Driven and Machine-Learning Approach
    Shrestha, Deepanjal
    Tan, Wenan
    Shrestha, Deepmala
    Rajkarnikar, Neesha
    Jeong, Seung-Ryul
    COMPUTATION, 2024, 12 (03)
  • [8] Data-driven machine-learning analysis of potential embolic sources in embolic stroke of undetermined source
    Ntaios, G.
    Weng, S. F.
    Perlepe, K.
    Akyea, R.
    Condon, L.
    Lambrou, D.
    Sirimarco, G.
    Strambo, D.
    Eskandari, A.
    Karagkiozi, E.
    Vemmou, A.
    Korompoki, E.
    Manios, E.
    Makaritsis, K.
    Vemmos, K.
    Michel, P.
    EUROPEAN JOURNAL OF NEUROLOGY, 2021, 28 (01) : 192 - 201
  • [9] Machine-Learning Techniques Assist Data-Driven Well-Performance Optimization
    Carpenter, Chris
    JPT, Journal of Petroleum Technology, 2021, 73 (10): : 63 - 64
  • [10] Modeling and prediction of slug characteristics utilizing data-driven machine-learning methodology
    Kim, Tea-Woo
    Kim, Sungil
    Lim, Jung-Tek
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2020, 195