Bad Snakes: Understanding and Improving Python']Python Package Index Malware Scanning

被引:2
|
作者
Vu, Duc-Ly [1 ,2 ]
Newman, Zachary [1 ]
Meyers, John Speed [1 ]
机构
[1] Chainguard, Ho Chi Minh City, Vietnam
[2] FPT Univ, Hanoi, Vietnam
关键词
Open-source software (OSS) Supply Chain; Malware Detection; PyPI; Qualitative Study; Quantitative Study;
D O I
10.1109/ICSE48619.2023.00052
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Open-source, community-driven package repositories see thousands of malware packages each year, but do not currently run automated malware detection systems. In this work, we explore the security goals of the repository administrators and the requirements for deploying such malware scanners via a case study of the Python ecosystem and PyPI repository, including interviews with administrators and maintainers. Further, we evaluate existing malware detection techniques for deployment in this setting by creating a benchmark dataset and comparing several existing tools: the malware checks implemented in PyPI, Bandit4Mal, and OSSGadget's OSS Detect Backdoor. We find that repository administrators have exacting requirements for such malware detection tools. Specifically, they consider a false positive rate of even 0.1% to be unacceptably high, given the large number of package releases that might trigger false alerts. Measured tools have false positive rates between 15% and 97%; increasing thresholds for detection rules to reduce this rate renders the true positive rate useless. While automated tools are far from reaching these demands, we find that a socio-technical malware detection system has emerged to meet these needs: external security researchers perform repository malware scans, filter for useful results, and report the results to repository administrators. These parties face different incentives and constraints on their time and tooling. We conclude with recommendations for improving detection capabilities and strengthening the collaboration between security researchers and software repository administrators.
引用
收藏
页码:499 / 511
页数:13
相关论文
共 8 条
  • [1] Malicious Packages Lurking in User-Friendly Python']Python Package Index
    Liang, Genpei
    Zhou, Xiangyu
    Wang, Qingyu
    Du, Yutong
    Huang, Cheng
    [J]. 2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 606 - 613
  • [2] Deathdaily: A Python']Python Package Index for predicting the number of daily COVID-19 deaths
    Takefuji, Yoshiyasu
    [J]. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2022, 11 (01):
  • [3] SSHAA: A Python']Python Package Index for visualizing features of SSH attacks with text mining in classification
    Nakamura, Yuki
    Fukuda, Tsukasa
    Yang, Xuanzhou
    Takefuji, Yoshiyasu
    [J]. INFORMATION SECURITY JOURNAL, 2024, : 50 - 62
  • [4] TopoTEM: A Python']Python Package for Quantifying and Visualizing Scanning Transmission Electron Microscopy Data of Polar Topologies
    O'Connell, Eoghan N.
    Moore, Kalani
    McFall, Elora
    Hennessy, Michael
    Moynihan, Eoin
    Bangert, Ursel
    Conroy, Michele
    [J]. MICROSCOPY AND MICROANALYSIS, 2022, 28 (04) : 1444 - 1452
  • [5] Utility of the Python']Python package Geoweaver_cwl for improving workflow reusability: an illustration with multidisciplinary use cases
    Kale, Amruta
    Sun, Ziheng
    Ma, Xiaogang
    [J]. EARTH SCIENCE INFORMATICS, 2023, 16 (03) : 2955 - 2961
  • [6] Deathdaily: A Python Package Index for predicting the number of daily COVID-19 deaths
    Yoshiyasu Takefuji
    [J]. Network Modeling Analysis in Health Informatics and Bioinformatics, 2022, 11
  • [7] Utility of the Python package Geoweaver_cwl for improving workflow reusability: an illustration with multidisciplinary use cases
    Amruta Kale
    Ziheng Sun
    Xiaogang Ma
    [J]. Earth Science Informatics, 2023, 16 : 2955 - 2961
  • [8] The spectralrao-monitoring Python']Python package: A RAO's Q diversity index-based application for land-cover/land-use change detection in multifunctional agricultural areas
    Tassi, Andrea
    Massetti, Andrea
    Gil, Artur
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 196