Identifying Continuous Glucose Monitoring Data Using Machine Learning

被引:6
|
作者
Herrero, Pau [1 ,3 ]
Reddy, Monika [2 ]
Georgiou, Pantelis [1 ]
Oliver, Nick S. [2 ]
机构
[1] Imperial Coll London, Ctr Bioinspired Technol, Dept Elect & Elect Engn, London, England
[2] Fac Med Imperial Coll, Dept Med, Div Diabet Endocrinol & Metab, London, England
[3] Imperial Coll London, Ctr Bioinspired Technol, Dept Elect & Elect Engn, South Kensington Campus, London SW7 2AZ, England
关键词
Type; 1; diabetes; Continuous glucose monitoring; Cybersecurity; Data privacy; Machine learning; CHALLENGES; CARE;
D O I
10.1089/dia.2021.0498
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and Aims: The recent increase in wearable devices for diabetes care, and in particular the use of continuous glucose monitoring (CGM), generates large data sets and associated cybersecurity challenges. In this study, we demonstrate that it is possible to identify CGM data at an individual level by using standard machine learning techniques.Methods: The publicly available REPLACE-BG data set (NCT02258373) containing 226 adult participants with type 1 diabetes (T1D) wearing CGM over 6 months was used. A support vector machine (SVM) binary classifier aiming to determine if a CGM data stream belongs to an individual participant was trained and tested for each of the subjects in the data set. To generate the feature vector used for classification, 12 standard glycemic metrics were selected and evaluated at different time periods of the day (24 h, day, night, breakfast, lunch, and dinner). Different window lengths of CGM data (3, 7, 15, and 30 days) were chosen to evaluate their impact on the classification performance. A recursive feature selection method was employed to select the minimum subset of features that did not significantly degrade performance.Results: A total of 40 features were generated as a result of evaluating the glycemic metrics over the selected time periods (24 h, day, night, breakfast, lunch, and dinner). A window length of 15 days was found to perform the best in terms of accuracy (86.8% +/- 12.8%) and F1 score (0.86 +/- 0.16). The corresponding sensitivity and specificity were 85.7% +/- 19.5% and 87.9% +/- 17.5%, respectively. Through recursive feature selection, a subset of 9 features was shown to perform similarly to the 40 features.Conclusion: It is possible to determine with a relatively high accuracy if a CGM data stream belongs to an individual. The proposed approach can be used as a digital CGM "fingerprint" or for detecting glycemic changes within an individual, for example during intercurrent illness.
引用
下载
收藏
页码:403 / 408
页数:6
相关论文
共 50 条
  • [31] Developing an Algorithm for Identifying Mortality in MarketScan Claims Data Using Machine Learning
    Xie, Fenglong
    Zhao, Hong
    Yun, Huifeng
    Bernatsky, Sasha
    Curtis, Jeffrey R.
    ARTHRITIS & RHEUMATOLOGY, 2020, 72
  • [32] Identifying multiple sclerosis subtypes using unsupervised machine learning and MRI data
    Eshaghi, Arman
    Young, Alexandra L.
    Wijeratne, Peter A.
    Prados, Ferran
    Arnold, Douglas L.
    Narayanan, Sridar
    Guttmann, Charles R. G.
    Barkhof, Frederik
    Alexander, Daniel C.
    Thompson, Alan J.
    Chard, Declan
    Ciccarelli, Olga
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [33] Data Gap Modeling in Continuous Glucose Monitoring Sensor Data
    Drecogna, Martina
    Vettoretti, Martina
    Del Favero, Simone
    Facchinetti, Andrea
    Sparacino, Giovanni
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 4379 - 4382
  • [34] Data Driven Network Monitoring and Intrusion Detection using Machine Learning
    Williams, Brandon
    Dong, Xishuang
    Qian, Lijun
    2020 SEVENTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORK ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2020, : 262 - 268
  • [35] Using machine learning techniques for Data Quality Monitoring in CMS and ALICE
    Deja, Kamil
    7TH ANNUAL CONFERENCE ON LARGE HADRON COLLIDER PHYSICS, LHCP2019, 2019,
  • [36] Crop Monitoring Using Satellite/UAV Data Fusion and Machine Learning
    Maimaitijiang, Maitiniyazi
    Sagan, Vasit
    Sidike, Paheding
    Daloye, Ahmad M.
    Erkbol, Hasanjan
    Fritschi, Felix B.
    REMOTE SENSING, 2020, 12 (09)
  • [37] Improving the Accuracy of Continuous Blood Glucose Measurement Using Personalized Calibration and Machine Learning
    Kumari, Ranjita
    Anand, Pradeep Kumar
    Shin, Jitae
    DIAGNOSTICS, 2023, 13 (15)
  • [38] Discrete glucose profiles identified using continuous glucose monitoring data and their association with adverse pregnancy outcomes
    Battarbee, Ashley N.
    Sauer, Sara M.
    Sanusi, Ayodeji
    Fulcher, Isabel
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2024, 231 (01) : 122e1 - 122e9
  • [39] Evaluation of the Mean Absolute Glucose Change as a Measure of Glycemic Variability Using Continuous Glucose Monitoring Data
    Kohnert, Klaus-Dieter
    Heinke, Peter
    Fritzsche, Gert
    Vogt, Lutz
    Augstein, Petra
    Salzsieder, Eckhard
    DIABETES TECHNOLOGY & THERAPEUTICS, 2013, 15 (06) : 448 - 454
  • [40] Towards an Interpretable Continuous Glucose Monitoring Data Modeling
    Gaitan-Guerrero J.F.
    Lopez J.L.
    Espinilla M.
    Martinez-Cruz C.
    IEEE Internet of Things Journal, 2024, 11 (19) : 1 - 1