Sharing personal ECG time-series data privately

被引:1
|
作者
Bonomi, Luca [1 ]
Wu, Zeyun [2 ]
Fan, Liyue [3 ]
机构
[1] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Nashville, TN 37203 USA
[2] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
[3] Univ North Carolina Charlotte, Dept Comp Sci, Charlotte, NC USA
基金
美国国家科学基金会;
关键词
data privacy; ECG data; time-series; data sharing; predictive analytics; TELEMEDICINE; DEVICES;
D O I
10.1093/jamia/ocac047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Emerging technologies (eg, wearable devices) have made it possible to collect data directly from individuals (eg, time-series), providing new insights on the health and well-being of individual patients. Broadening the access to these data would facilitate the integration with existing data sources (eg, clinical and genomic data) and advance medical research. Compared to traditional health data, these data are collected directly from individuals, are highly unique and provide fine-grained information, posing new privacy challenges. In this work, we study the applicability of a novel privacy model to enable individual-level time-series data sharing while maintaining the usability for data analytics. Methods and materials We propose a privacy-protecting method for sharing individual-level electrocardiography (ECG) time-series data, which leverages dimensional reduction technique and random sampling to achieve provable privacy protection. We show that our solution provides strong privacy protection against an informed adversarial model while enabling useful aggregate-level analysis. Results We conduct our evaluations on 2 real-world ECG datasets. Our empirical results show that the privacy risk is significantly reduced after sanitization while the data usability is retained for a variety of clinical tasks (eg, predictive modeling and clustering). Discussion Our study investigates the privacy risk in sharing individual-level ECG time-series data. We demonstrate that individual-level data can be highly unique, requiring new privacy solutions to protect data contributors. Conclusion The results suggest our proposed privacy-protection method provides strong privacy protections while preserving the usefulness of the data.
引用
收藏
页码:1152 / 1160
页数:9
相关论文
共 50 条
  • [1] Privacy-Aware Time-Series Data Sharing With Deep Reinforcement Learning
    Erdemir, Ecenaz
    Dragotti, Pier Luigi
    Gunduz, Deniz
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 389 - 401
  • [2] ANALYZING TIME-SERIES FOR FORECASTING (A PERSONAL VIEW)
    ANDERSON, OD
    [J]. RAIRO-RECHERCHE OPERATIONNELLE-OPERATIONS RESEARCH, 1989, 23 (02): : 113 - 150
  • [3] Time-Series Data Mining
    Esling, Philippe
    Agon, Carlos
    [J]. ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [4] Fuzzy data mining for time-series data
    Chen, Chun-Hao
    Hong, Tzung-Pei
    Tseng, Vincent S.
    [J]. APPLIED SOFT COMPUTING, 2012, 12 (01) : 536 - 542
  • [5] THE ROLE OF TIME-SERIES ANALYSIS IN FORECASTING - A PERSONAL VIEW
    DEGOOIJER, JG
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 1990, 6 (04) : 449 - 451
  • [6] Spectral analysis of time-series data
    Gregson, RAM
    [J]. CONTEMPORARY PSYCHOLOGY-APA REVIEW OF BOOKS, 1999, 44 (04): : 306 - 309
  • [7] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    [J]. PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [8] DIRECTIONAL CORRELATION IN TIME-SERIES DATA
    STRAHAN, RR
    [J]. PSYCHOPHYSIOLOGY, 1970, 6 (05) : 652 - &
  • [9] A time-series database for environmental data
    Halliburton, GA
    [J]. ENVIRONMENTAL SOFTWARE SYSTEMS, VOL 2, 1997, : 205 - 208
  • [10] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    [J]. JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438