A study on quality control using delta data with machine learning technique

被引:4
|
作者
Liang, Yufang [1 ]
Wang, Zhe [2 ]
Huang, Dawei [1 ,3 ]
Wang, Wei [4 ]
Feng, Xiang [2 ]
Han, Zewen [2 ]
Song, Biao [2 ]
Wang, Qingtao [1 ,5 ]
Zhou, Rui [1 ,5 ]
机构
[1] Capital Med Univ, Beijing Chao yang Hosp, Dept Lab Med, Beijing, Peoples R China
[2] Inner Mongolia Wesure Date Technol Co Ltd, Hohhot, Inner Mongolia, Peoples R China
[3] Beijing Longfu Hosp, Dept Lab Med, Beijing, Peoples R China
[4] Capital Med Univ, Beijing Ditan Hosp, Dept Blood Transfus, Beijing, Peoples R China
[5] Beijing Ctr Clin Labs, Beijing, Peoples R China
关键词
Delta data; Machine learning; Random forest; Data processing; Patient-based real-time quality control; PERFORMANCE; ALGORITHM;
D O I
10.1016/j.heliyon.2022.e09935
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: In the big data era, patient-based real-time quality control (PBRTQC), as an emerging quality control (QC) method, is expanding within the clinical laboratory industry. However, the main issue of current PBRTQC methodology is data stability. Our study is aimed to explore a novel protocol for data stability by combining delta data with machine learning (ML) technique to improve the capacity of QC event detection.Methods: A data set of 423,290 laboratory results from Beijing Chao-yang Hospital 2019 patient results were used as a training set (n = 380960, 90%) and internal validation set (n = 42330, 10%). A further 22,460 results from Beijing Long-fu Hospital 2019 patient results were used as a test set. Three-type data (1) Single-type data pro-cessed by truncation limits; (2) delta-type data processed by truncation limits and (3)delta-type data processed by Isolated Forest (IF) algorithm were evaluated with accuracy, sensitivity, NPed, etc., and compared with previously published statistical methods. Results: The optimal model was based on Random Forest (RF) algorithm by using delta-type data processed by IF algorithm. The model had a better accuracy (0.99), sensitivity (0.99) specificity (0.99) and AUC (0.99) with the dependent test set, surpassing the critical bias of PBRTQC by over 50%. For the LYMPH#, HGB, and PLT, the cumulative MNPed of MLQC were reduced by 95.43%, 97.39%, and 97.97% respectively when compared to the best of the PBRTQC. Conclusion: Final results indicate that by integrating an innovative ML algorithm with the overall data processing protocol the detection of QC events is improved.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Predictive Analysis of Medical Data using a Hybrid Machine Learning Technique
    Rajawat, Pushpendra Singh
    Gupta, Deepak Kumar
    Rathore, Santosh Singh
    Singh, Avtar
    2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 228 - 233
  • [22] Market Data Analysis by Using Support Vector Machine Learning Technique
    Reddy, Raghavendra
    Shyam, Gopal K.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING (ICCIDE 2018), 2019, 28 : 19 - 27
  • [23] A MACHINE LEARNING APPROACH FOR DATA QUALITY CONTROL OF EARTH OBSERVATION DATA MANAGEMENT SYSTEM
    Hau, Weiguo
    Jochum, Matthew
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 3101 - 3103
  • [24] To Control Diabetes Using Machine Learning Algorithm and Calorie Measurement Technique
    Viveka, T.
    Columbus, C. Christopher
    Velmurugan, N. Senthil
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 33 (01): : 535 - 547
  • [25] Machine learning and financial big data control using IoT
    Xiao, Jian
    Intelligent Decision Technologies, 2024, 18 (04) : 2657 - 2670
  • [26] A study of job involvement prediction using machine learning technique
    Choi, Youngkeun
    Choi, Jae Won
    INTERNATIONAL JOURNAL OF ORGANIZATIONAL ANALYSIS, 2021, 29 (03) : 788 - 800
  • [27] Using machine learning prediction models for quality control: a case study from the automotive industry
    Msakni M.K.
    Risan A.
    Schütz P.
    Computational Management Science, 2023, 20 (1)
  • [28] Data Quality for Machine Learning Tasks
    Gupta, Nitin
    Mujumdar, Shashank
    Patel, Hima
    Masuda, Satoshi
    Panwar, Naveen
    Bandyopadhyay, Sambaran
    Mehta, Sameep
    Guttula, Shanmukha
    Afzal, Shazia
    Mittal, Ruhi Sharma
    Munigala, Vitobha
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4040 - 4041
  • [29] Machine learning for quality control system
    San-Payo, Goncalo
    Ferreira, Joao Carlos
    Santos, Pedro
    Martins, Ana Lucia
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (11) : 4491 - 4500
  • [30] Machine learning for quality control system
    Gonçalo San-Payo
    João Carlos Ferreira
    Pedro Santos
    Ana Lúcia Martins
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 4491 - 4500