Machine Learning-Based Analysis of Program Binaries: A Comprehensive Study

被引:23
|
作者
Xue, Hongfa [1 ]
Sun, Shaowen [1 ]
Venkataramani, Guru [1 ]
Lan, Tian [1 ]
机构
[1] George Washington Univ, Washington, DC 20052 USA
关键词
Machine learning; program binary analysis; taxonomy; NEURAL-NETWORKS; N-GRAMS; MALWARE; CLASSIFICATION; SIMILARITY; SEARCH; MODEL;
D O I
10.1109/ACCESS.2019.2917668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary code analysis is crucial in various software engineering tasks, such as malware detection, code refactoring, and plagiarism detection. With the rapid growth of software complexity and the increasing number of heterogeneous computing platforms, binary analysis is particularly critical and more important than ever. Traditionally adopted techniques for binary code analysis are facing multiple challenges, such as the need for cross-platform analysis, high scalability and speed, and improved fidelity, to name a few. To meet these challenges, machine learning-based binary code analysis frameworks attract substantial attention due to their automated feature extraction and drastically reduced efforts needed on large-scale programs. In this paper, we provide the taxonomy of machine learning-based binary code analysis, describe the recent advances and key findings on the topic, and discuss the key challenges and opportunities. Finally, we present our thoughts for future directions on this topic.
引用
收藏
页码:65889 / 65912
页数:24
相关论文
共 50 条
  • [1] A Comprehensive Analysis of Machine Learning-Based Assessment and Prediction of Soil Enzyme Activity
    Shahare, Yogesh
    Singh, Mukund Partap
    Singh, Prabhishek
    Diwakar, Manoj
    Singh, Vijendra
    Kadry, Seifedine
    Sevcik, Lukas
    [J]. AGRICULTURE-BASEL, 2023, 13 (07):
  • [2] A Comprehensive Analysis on the Efficacy of Machine Learning-Based Algorithms for Breast Cancer Classification
    Senthilkumar, K. P.
    Narmatha, P.
    Narasimharao, Jonnadula
    Mustare, Narendra
    Rufus, N. Herald Anantha
    Singh, Yashapl
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 857 - 866
  • [3] A Comprehensive Study on Efficient and Accurate Machine Learning-Based Malicious PE Detection
    Barut, Onur
    Zhang, Tong
    Luo, Yan
    Li, Peilong
    [J]. 2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [4] Comprehensive Analysis of Clinical Logistic and Machine Learning-Based Models for the Evaluation of Pulmonary Nodules
    Zhang, Kai
    Wei, Zihan
    Nie, Yuntao
    Shen, Haifeng
    Wang, Xin
    Wang, Jun
    Yang, Fan
    Chen, Kezhong
    [J]. JTO CLINICAL AND RESEARCH REPORTS, 2022, 3 (04):
  • [5] A Comprehensive Overview of the COVID-19 Literature: Machine Learning-Based Bibliometric Analysis
    Abd-Alrazaq, Alaa
    Schneider, Jens
    Mifsud, Borbala
    Alam, Tanvir
    Househ, Mowafa
    Hamdi, Mounir
    Shah, Zubair
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (03)
  • [6] Machine Learning-Based Anomaly Detection in NFV: A Comprehensive Survey
    Zehra, Sehar
    Faseeha, Ummay
    Syed, Hassan Jamil
    Samad, Fahad
    Ibrahim, Ashraf Osman
    Abulfaraj, Anas W.
    Nagmeldin, Wamda
    [J]. SENSORS, 2023, 23 (11)
  • [7] Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning-Based Methods
    Sen, Ovishake
    Fuad, Mohtasim
    Islam, Md Nazrul
    Rabbi, Jakaria
    Masud, Mehedi
    Hasan, Md Kamrul
    Awal, Md Abdul
    Fime, Awal Ahmed
    Fuad, Md Tahmid Hasan
    Sikder, Delowar
    Iftee, Md Akil Raihan
    [J]. IEEE ACCESS, 2022, 10 : 38999 - 39044
  • [8] A Comprehensive Analysis of Machine Learning- and Deep Learning-Based Solutions for DDoS Attack Detection in SDN
    Naziya Aslam
    Shashank Srivastava
    M. M. Gore
    [J]. Arabian Journal for Science and Engineering, 2024, 49 : 3533 - 3573
  • [9] A Comprehensive Analysis of Machine Learning- and Deep Learning-Based Solutions for DDoS Attack Detection in SDN
    Aslam, Naziya
    Srivastava, Shashank
    Gore, M. M.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 49 (03) : 3897 - 3914
  • [10] Bangla natural language processing: A comprehensive analysis of classical, machine learning, and deep learning-based methods
    Sen, Ovishake
    Fuad, Mohtasim
    Islam, M.D. Nazrul
    Rabbi, Jakaria
    Masud, Mehedi
    Hasan, Kamrul
    Awal, M.D. Abdul
    Fime, Awal Ahmed
    Fuad, M.D. Tahmid Hasan
    Sikder, Delowar
    Iftee, M.D. Akil Raihan
    [J]. IEEE Access, 2022, 10 : 38999 - 39044