NeuroCrypt: Machine Learning Over Encrypted Distributed Neuroimaging Data

被引:6
|
作者
Senanayake, Nipuna [1 ]
Podschwadt, Robert [1 ]
Takabi, Daniel [1 ]
Calhoun, Vince D. [1 ]
Plis, Sergey M. [1 ]
机构
[1] Georgia State Univ, Atlanta, GA 30303 USA
关键词
Neuroimaging; Machine learning; Privacy; Secure multiparty computation; Logistic regression; Convolutional neural networks; DISEASE; SCHIZOPHRENIA; BIOMARKERS;
D O I
10.1007/s12021-021-09525-8
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The field of neuroimaging can greatly benefit from building machine learning models to detect and predict diseases, and discover novel biomarkers, but much of the data collected at various organizations and research centers is unable to be shared due to privacy or regulatory concerns (especially for clinical data or rare disorders). In addition, aggregating data across multiple large studies results in a huge amount of duplicated technical debt and the resources required can be challenging or impossible for an individual site to build. Training on the data distributed across organizations can result in models that generalize much better than models trained on data from any of organizations alone. While there are approaches for decentralized sharing, these often do not provide the highest possible guarantees of sample privacy that only cryptography can provide. In addition, such approaches are often focused on probabilistic solutions. In this paper, we propose an approach that leverages the potential of datasets spread among a number of data collecting organizations by performing joint analyses in a secure and deterministic manner when only encrypted data is shared and manipulated. The approach is based on secure multiparty computation which refers to cryptographic protocols that enable distributed computation of a function over distributed inputs without revealing additional information about the inputs. It enables multiple organizations to train machine learning models on their joint data and apply the trained models to encrypted data without revealing their sensitive data to the other parties. In our proposed approach, organizations (or sites) securely collaborate to build a machine learning model as it would have been trained on the aggregated data of all the organizations combined. Importantly, the approach does not require a trusted party (i.e. aggregator), each contributing site plays an equal role in the process, and no site can learn individual data of any other site. We demonstrate effectiveness of the proposed approach, in a range of empirical evaluations using different machine learning algorithms including logistic regression and convolutional neural network models on human structural and functional magnetic resonance imaging datasets.
引用
收藏
页码:91 / 108
页数:18
相关论文
共 50 条
  • [1] NeuroCrypt: Machine Learning Over Encrypted Distributed Neuroimaging Data
    Nipuna Senanayake
    Robert Podschwadt
    Daniel Takabi
    Vince D. Calhoun
    Sergey M. Plis
    [J]. Neuroinformatics, 2022, 20 : 91 - 108
  • [2] Machine Learning Classification over Encrypted Data
    Bost, Raphael
    Popa, Raluca Ada
    Tu, Stephen
    Goldwasser, Shafi
    [J]. 22ND ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2015), 2015,
  • [3] Distributed computing over encrypted data
    Freris, Nikolaos M.
    Patrinos, Panagiotis
    [J]. 2016 54TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2016, : 1116 - 1122
  • [4] Distributed Query Evaluation over Encrypted Data
    di Vimercati, Sabrina De Capitani
    Foresti, Sara
    Jajodia, Sushil
    Livraga, Giovanni
    Paraboschi, Stefano
    Samarati, Pierangela
    [J]. DATA AND APPLICATIONS SECURITY AND PRIVACY XXXV, 2021, 12840 : 96 - 114
  • [5] Efficient machine learning over encrypted data with non-interactive communication
    Park, Heejin
    Kim, Pyung
    Kim, Heeyoul
    Park, Ki-Woong
    Lee, Younho
    [J]. COMPUTER STANDARDS & INTERFACES, 2018, 58 : 87 - 108
  • [6] Private AI: Machine Learning on Encrypted Data
    Lauter, Kristin
    [J]. RECENT ADVANCES IN INDUSTRIAL AND APPLIED MATHEMATICS, 2022, : 97 - 113
  • [7] Machine Learning Training on Encrypted Data with TFHE
    Montero, Luis
    Frery, Jordan
    Kherfallah, Celia
    Bredehoft, Roman
    Stoian, Andrei
    [J]. PROCEEDINGS OF THE 10TH ACM INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS, IWSPA 2024, 2024, : 71 - 76
  • [8] Machine Learning Approach for Analysing Encrypted Data
    Pradeepthi, K., V
    Tiwari, Vikas
    Saxena, Ashutosh
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2018, : 70 - 73
  • [9] Ranked Keyword Search Over Encrypted Cloud Data Through Machine Learning Method
    Miao, Yinbin
    Zheng, Wei
    Jia, Xiaohua
    Liu, Ximeng
    Choo, Kim-Kwang Raymond
    Deng, Robert H.
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (01) : 525 - 536
  • [10] Support vector machine classification over encrypted data
    Hai Huang
    Yongjian Wang
    Haoran Zong
    [J]. Applied Intelligence, 2022, 52 : 5938 - 5948