Logging requirement for continuous auditing of responsible machine learning-based applications

被引：0

作者：

Patrick Loic Foalem ^{[1
]}

Leuson Da Silva ^{[1
]}

Foutse Khomh ^{[1
]}

Heng Li ^{[1
]}

Ettore Merlo ^{[1
]}

机构：

[1] Polytechnique Montreal,Department of Computer Engineering and Software Engineering

来源：

Empirical Software Engineering | 2025年 / 30卷 / 3期

关键词：

Empirical; GitHub repository; Machine learning; Responsible ML; Logging; Auditing; Transparency; Fairness; Accountability;

D O I：

10.1007/s10664-025-10656-8

中图分类号：

学科分类号：

摘要：

Machine learning (ML) is increasingly used across various industries to automate decision-making processes. However, concerns about the ethical and legal compliance of ML models have arisen due to their lack of transparency, fairness, and accountability. Monitoring, particularly through logging, is a widely used technique in traditional software systems that could be leveraged to assist in auditing ML-based applications. Logs provide a record of an application’s behavior, which can be used for continuous auditing, debugging, and analyzing both the behavior and performance of the application. In this study, we investigate the logging practices of ML practitioners to capture responsible ML-related information in ML applications. We analyzed 85 ML projects hosted on GitHub, leveraging 20 responsible ML libraries that span principles such as privacy, transparency & explainability, fairness, and security & safety. Our analysis revealed important differences in the implementation of responsible AI principles. For example, out of 5,733 function calls analyzed, privacy accounted for 89.3% (5,120 calls), while fairness represented only 2.1% (118 calls), highlighting the uneven emphasis on these principles across projects. Furthermore, our manual analysis of 44,877 issue discussions revealed that only 8.1% of the sampled issues addressed responsible AI principles, with transparency & explainability being the most frequently discussed principles (32.2% of all issues related to responsible AI principles). Additionally, a survey conducted with ML practitioners provided direct insights into their perspectives, informing our exploration of ways to enhance logging practices for more effective, responsible ML auditing. We discovered that while privacy, model interpretability & explainability, fairness, and security & safety are commonly considered, there is a gap in how metrics associated with these principles are logged. Specifically, crucial fairness metrics like group and individual fairness, privacy metrics such as epsilon and delta, and explainability metrics like SHAP values are not considered current logging practices. The insights from this study highlight the need for ML practitioners and logging tool developers to adopt enhanced logging strategies that incorporate a broader range of responsible AI metrics. This adjustment will facilitate the development of auditable and ethically responsible ML applications, ensuring they meet emerging regulatory and societal expectations. These specific insights offer actionable guidance for improving the accountability and trustworthiness of ML systems.

引用

共 50 条

[31] Machine Learning-Based Continuous Intracranial Pressure Prediction for Traumatic Injury Patients
YE, G. U. O. C. H. A. N. G.
BALASUBRAMANIAN, V. I. G. N. E. S. H.
LI, J. O. H. N. K-J.
KAYA, M. E. H. M. E. T.
IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2022, 10
[32] Machine Learning-Based Probabilistic Seismic Demand Model of Continuous Girder Bridges
Li, Wenshan
Huang, Yong
Xie, Zikai
ADVANCES IN CIVIL ENGINEERING, 2022, 2022
[33] A Full Population Auditing Method Based on Machine Learning
Chen, Yasheng
Wu, Zhuojun
Yan, Hui
SUSTAINABILITY, 2022, 14 (24)
[34] Novel machine learning-based EOIR sensor performance modeling for naval applications
Crow, Brandon J.
Espinola, Richard L.
Owens, Saba
Wilson, Rebecca
INFRARED IMAGING SYSTEMS: DESIGN, ANALYSIS, MODELING, AND TESTING XXXIV, 2023, 12533
[35] Machine Learning-Based Modeling for Structural Engineering: A Comprehensive Survey and Applications Overview
Etim, Bassey
Al-Ghosoun, Alia
Renno, Jamil
Seaid, Mohammed
Mohamed, M. Shadi
BUILDINGS, 2024, 14 (11)
[36] Machine learning-based physical layer security: techniques, open challenges, and applications
Anil Kumar Kamboj
Poonam Jindal
Pankaj Verma
Wireless Networks, 2021, 27 : 5351 - 5383
[37] Machine Learning-Based Monitoring of DC-DC Converters in Photovoltaic Applications
Bindi, Marco
Corti, Fabio
Aizenberg, Igor
Grasso, Francesco
Lozito, Gabriele Maria
Luchetta, Antonio
Piccirilli, Maria Cristina
Reatti, Alberto
ALGORITHMS, 2022, 15 (03)
[38] Correction to: A review of machine learning-based human activity recognition for diverse applications
Farzana Kulsoom
Sanam Narejo
Zahid Mehmood
Hassan Nazeer Chaudhry
Ayesha Butt
Ali Kashif Bashir
Neural Computing and Applications, 2023, 35 : 5591 - 5591
[39] Machine learning-based physical layer security: techniques, open challenges, and applications
Kamboj, Anil Kumar
Jindal, Poonam
Verma, Pankaj
WIRELESS NETWORKS, 2021, 27 (08) : 5351 - 5383
[40] Orchestrating the Development Lifecycle of Machine Learning-based IoT Applications: A Taxonomy and Survey
Qian, Bin
Su, Jie
Wen, Zhenyu
Jha, Devki Nandan
Li, Yinhao
Guan, Yu
Puthal, Deepak
James, Philip
Yang, Renyu
Zomaya, Albert Y.
Rana, Omer
Wang, Lizhe
Koutny, Maciej
Ranjan, Rajiv
ACM COMPUTING SURVEYS, 2020, 53 (04)

← 1 2 3 4 5 →