Data encoding for healthcare data democratization and information leakage prevention

被引:0
|
作者
Thakur, Anshul [1 ]
Zhu, Tingting [1 ]
Abrol, Vinayak [2 ]
Armstrong, Jacob [1 ]
Wang, Yujiang [1 ,3 ]
Clifton, David A. [1 ,3 ]
机构
[1] Univ Oxford, Dept Engn Sci, Oxford OX3 7DQ, Oxon, England
[2] IIIT Delhi, Infosys Ctr AI, Delhi, India
[3] Oxford Suzhou Ctr Adv Res, Suzhou, Peoples R China
基金
英国惠康基金;
关键词
PREDICTION;
D O I
10.1038/s41467-024-45777-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that irreversible data encoding can provide an effective solution to achieve data democratization without violating the privacy constraints imposed on healthcare data and clinical models. An ideal encoding framework transforms the data into a new space where it is imperceptible to a manual or computational inspection. However, encoded data should preserve the semantics of the original data such that deep learning models can be trained effectively. This paper hypothesizes the characteristics of the desired encoding framework and then exploits random projections and random quantum encoding to realize this framework for dense and longitudinal or time-series data. Experimental evaluation highlights that models trained on encoded time-series data effectively uphold the information bottleneck principle and hence, exhibit lesser information leakage from trained models. Healthcare data democratization is often hampered by privacy constraints governing the sensitive healthcare data. Here, the authors show that encoding healthcare data could be a potential solution for achieving healthcare democratization within the context of deep learning.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Data encoding for healthcare data democratization and information leakage prevention
    Anshul Thakur
    Tingting Zhu
    Vinayak Abrol
    Jacob Armstrong
    Yujiang Wang
    David A. Clifton
    Nature Communications, 15
  • [2] Data and Information Leakage Prevention Within the Scope of Information Security
    Hauer, Barbara
    IEEE ACCESS, 2015, 3 : 2554 - 2565
  • [3] Maximal information leakage from quantum encoding of classical data
    Farokhi, Farhad
    PHYSICAL REVIEW A, 2024, 109 (02)
  • [4] Prevention of Data Content Leakage
    Jadhav, Meghana N.
    Sanghavi, Jignyasa
    HELIX, 2018, 8 (05): : 3989 - 3992
  • [5] Prevention of Data Leakage due to Implicit Information Flows in Android Applications
    Inayoshi, Hiroki
    Kakei, Shohei
    Takimoto, Eiji
    Mouri, Koichi
    Saito, Shoichi
    2019 14TH ASIA JOINT CONFERENCE ON INFORMATION SECURITY (ASIAJCIS 2019), 2019, : 103 - 110
  • [6] An approach for prevention of privacy breach and information leakage in sensitive data mining
    Prakash, M.
    Singaravel, G.
    COMPUTERS & ELECTRICAL ENGINEERING, 2015, 45 : 134 - 140
  • [7] Data Leakage Prevention for Secure Cross-Domain Information Exchange
    Kongsgard, Kyrre Wahl
    Nordbotten, Nils Agne
    Mancini, Federico
    Haakseth, Raymond
    Engelstad, Paal E.
    IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (10) : 37 - 43
  • [8] Detecting data semantic: A data leakage prevention approach
    Alneyadi, Sultan
    Sithirasenan, Elankayer
    Muthukkumarasamy, Vallipuram
    2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 1, 2015, : 910 - 917
  • [9] A survey on data leakage prevention systems
    Alneyadi, Sultan
    Sithirasenan, Elankayer
    Muthukkumarasamy, Vallipuram
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 62 : 137 - 152
  • [10] DLPFS: The Data Leakage Prevention FileSystem
    Braghin, Stefano
    Simioni, Marco
    Sinn, Mathieu
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY WORKSHOPS, ACNS 2022, 2022, 13285 : 380 - 397