Logging requirement for continuous auditing of responsible machine learning-based applications

被引:0
|
作者
Patrick Loic Foalem [1 ]
Leuson Da Silva [1 ]
Foutse Khomh [1 ]
Heng Li [1 ]
Ettore Merlo [1 ]
机构
[1] Polytechnique Montreal,Department of Computer Engineering and Software Engineering
关键词
Empirical; GitHub repository; Machine learning; Responsible ML; Logging; Auditing; Transparency; Fairness; Accountability;
D O I
10.1007/s10664-025-10656-8
中图分类号
学科分类号
摘要
Machine learning (ML) is increasingly used across various industries to automate decision-making processes. However, concerns about the ethical and legal compliance of ML models have arisen due to their lack of transparency, fairness, and accountability. Monitoring, particularly through logging, is a widely used technique in traditional software systems that could be leveraged to assist in auditing ML-based applications. Logs provide a record of an application’s behavior, which can be used for continuous auditing, debugging, and analyzing both the behavior and performance of the application. In this study, we investigate the logging practices of ML practitioners to capture responsible ML-related information in ML applications. We analyzed 85 ML projects hosted on GitHub, leveraging 20 responsible ML libraries that span principles such as privacy, transparency & explainability, fairness, and security & safety. Our analysis revealed important differences in the implementation of responsible AI principles. For example, out of 5,733 function calls analyzed, privacy accounted for 89.3% (5,120 calls), while fairness represented only 2.1% (118 calls), highlighting the uneven emphasis on these principles across projects. Furthermore, our manual analysis of 44,877 issue discussions revealed that only 8.1% of the sampled issues addressed responsible AI principles, with transparency & explainability being the most frequently discussed principles (32.2% of all issues related to responsible AI principles). Additionally, a survey conducted with ML practitioners provided direct insights into their perspectives, informing our exploration of ways to enhance logging practices for more effective, responsible ML auditing. We discovered that while privacy, model interpretability & explainability, fairness, and security & safety are commonly considered, there is a gap in how metrics associated with these principles are logged. Specifically, crucial fairness metrics like group and individual fairness, privacy metrics such as epsilon and delta, and explainability metrics like SHAP values are not considered current logging practices. The insights from this study highlight the need for ML practitioners and logging tool developers to adopt enhanced logging strategies that incorporate a broader range of responsible AI metrics. This adjustment will facilitate the development of auditable and ethically responsible ML applications, ensuring they meet emerging regulatory and societal expectations. These specific insights offer actionable guidance for improving the accountability and trustworthiness of ML systems.
引用
收藏
相关论文
共 50 条
  • [31] Machine Learning-Based Continuous Intracranial Pressure Prediction for Traumatic Injury Patients
    YE, G. U. O. C. H. A. N. G.
    BALASUBRAMANIAN, V. I. G. N. E. S. H.
    LI, J. O. H. N. K-J.
    KAYA, M. E. H. M. E. T.
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2022, 10
  • [32] Machine Learning-Based Probabilistic Seismic Demand Model of Continuous Girder Bridges
    Li, Wenshan
    Huang, Yong
    Xie, Zikai
    ADVANCES IN CIVIL ENGINEERING, 2022, 2022
  • [33] A Full Population Auditing Method Based on Machine Learning
    Chen, Yasheng
    Wu, Zhuojun
    Yan, Hui
    SUSTAINABILITY, 2022, 14 (24)
  • [34] Novel machine learning-based EOIR sensor performance modeling for naval applications
    Crow, Brandon J.
    Espinola, Richard L.
    Owens, Saba
    Wilson, Rebecca
    INFRARED IMAGING SYSTEMS: DESIGN, ANALYSIS, MODELING, AND TESTING XXXIV, 2023, 12533
  • [35] Machine Learning-Based Modeling for Structural Engineering: A Comprehensive Survey and Applications Overview
    Etim, Bassey
    Al-Ghosoun, Alia
    Renno, Jamil
    Seaid, Mohammed
    Mohamed, M. Shadi
    BUILDINGS, 2024, 14 (11)
  • [36] Machine learning-based physical layer security: techniques, open challenges, and applications
    Anil Kumar Kamboj
    Poonam Jindal
    Pankaj Verma
    Wireless Networks, 2021, 27 : 5351 - 5383
  • [37] Machine Learning-Based Monitoring of DC-DC Converters in Photovoltaic Applications
    Bindi, Marco
    Corti, Fabio
    Aizenberg, Igor
    Grasso, Francesco
    Lozito, Gabriele Maria
    Luchetta, Antonio
    Piccirilli, Maria Cristina
    Reatti, Alberto
    ALGORITHMS, 2022, 15 (03)
  • [38] Correction to: A review of machine learning-based human activity recognition for diverse applications
    Farzana Kulsoom
    Sanam Narejo
    Zahid Mehmood
    Hassan Nazeer Chaudhry
    Ayesha Butt
    Ali Kashif Bashir
    Neural Computing and Applications, 2023, 35 : 5591 - 5591
  • [39] Machine learning-based physical layer security: techniques, open challenges, and applications
    Kamboj, Anil Kumar
    Jindal, Poonam
    Verma, Pankaj
    WIRELESS NETWORKS, 2021, 27 (08) : 5351 - 5383
  • [40] Orchestrating the Development Lifecycle of Machine Learning-based IoT Applications: A Taxonomy and Survey
    Qian, Bin
    Su, Jie
    Wen, Zhenyu
    Jha, Devki Nandan
    Li, Yinhao
    Guan, Yu
    Puthal, Deepak
    James, Philip
    Yang, Renyu
    Zomaya, Albert Y.
    Rana, Omer
    Wang, Lizhe
    Koutny, Maciej
    Ranjan, Rajiv
    ACM COMPUTING SURVEYS, 2020, 53 (04)