A study on quality control using delta data with machine learning technique

被引:4
|
作者
Liang, Yufang [1 ]
Wang, Zhe [2 ]
Huang, Dawei [1 ,3 ]
Wang, Wei [4 ]
Feng, Xiang [2 ]
Han, Zewen [2 ]
Song, Biao [2 ]
Wang, Qingtao [1 ,5 ]
Zhou, Rui [1 ,5 ]
机构
[1] Capital Med Univ, Beijing Chao yang Hosp, Dept Lab Med, Beijing, Peoples R China
[2] Inner Mongolia Wesure Date Technol Co Ltd, Hohhot, Inner Mongolia, Peoples R China
[3] Beijing Longfu Hosp, Dept Lab Med, Beijing, Peoples R China
[4] Capital Med Univ, Beijing Ditan Hosp, Dept Blood Transfus, Beijing, Peoples R China
[5] Beijing Ctr Clin Labs, Beijing, Peoples R China
关键词
Delta data; Machine learning; Random forest; Data processing; Patient-based real-time quality control; PERFORMANCE; ALGORITHM;
D O I
10.1016/j.heliyon.2022.e09935
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: In the big data era, patient-based real-time quality control (PBRTQC), as an emerging quality control (QC) method, is expanding within the clinical laboratory industry. However, the main issue of current PBRTQC methodology is data stability. Our study is aimed to explore a novel protocol for data stability by combining delta data with machine learning (ML) technique to improve the capacity of QC event detection.Methods: A data set of 423,290 laboratory results from Beijing Chao-yang Hospital 2019 patient results were used as a training set (n = 380960, 90%) and internal validation set (n = 42330, 10%). A further 22,460 results from Beijing Long-fu Hospital 2019 patient results were used as a test set. Three-type data (1) Single-type data pro-cessed by truncation limits; (2) delta-type data processed by truncation limits and (3)delta-type data processed by Isolated Forest (IF) algorithm were evaluated with accuracy, sensitivity, NPed, etc., and compared with previously published statistical methods. Results: The optimal model was based on Random Forest (RF) algorithm by using delta-type data processed by IF algorithm. The model had a better accuracy (0.99), sensitivity (0.99) specificity (0.99) and AUC (0.99) with the dependent test set, surpassing the critical bias of PBRTQC by over 50%. For the LYMPH#, HGB, and PLT, the cumulative MNPed of MLQC were reduced by 95.43%, 97.39%, and 97.97% respectively when compared to the best of the PBRTQC. Conclusion: Final results indicate that by integrating an innovative ML algorithm with the overall data processing protocol the detection of QC events is improved.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] seqQscorer: automated quality control of next-generation sequencing data using machine learning
    Steffen Albrecht
    Maximilian Sprang
    Miguel A. Andrade-Navarro
    Jean-Fred Fontaine
    Genome Biology, 22
  • [12] Updating digital twins: Methodology for data accuracy quality control using machine learning techniques
    Rodriguez, Fabio
    Chicaiza, William D.
    Sanchez, Adolfo
    Escano, Juan M.
    COMPUTERS IN INDUSTRY, 2023, 151
  • [13] Quality Assessment of Data Using Statistical and Machine Learning Methods
    Singh, Prerna
    Suri, Bharti
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 89 - 97
  • [14] Correction: Quality Assessment of Seed Using Supervised Machine Learning Technique
    Ramanath Kini M G
    Rekha Bhandarkar
    Journal of The Institution of Engineers (India): Series B, 2024, 105 (3) : 751 - 751
  • [15] A machine learning approach to quality-control Argo temperature data
    Zhang, Qi
    Qian, Chenyan
    Dong, Changming
    ATMOSPHERIC AND OCEANIC SCIENCE LETTERS, 2023, 16 (04)
  • [16] Multistage Quality Control Using Machine Learning in the Automotive Industry
    Peres, Ricardo Silva
    Barata, Jose
    Leitao, Paulo
    Garcia, Gisela
    IEEE ACCESS, 2019, 7 : 79908 - 79916
  • [17] A machine learning approach to quality-control Argo temperature data
    Qi Zhang
    Chenyan Qian
    Changming Dong
    AtmosphericandOceanicScienceLetters, 2023, 16 (04) : 3 - 9
  • [18] Time Series Data Prediction using IoT and Machine Learning Technique
    Kumar, Raghavendra
    Kumar, Pardeep
    Kumar, Yugal
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 373 - 381
  • [19] Protecting Data from Malware Threats using Machine Learning Technique
    Chowdhury, Mozammel
    Rahman, Azizur
    Islam, Rafiqul
    PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 1691 - 1694
  • [20] Improvement of Lidar data classification algorithm using the machine learning technique
    Haider, Ali
    Tan, Songxin
    POLARIZATION SCIENCE AND REMOTE SENSING IX, 2019, 11132