Distributed Analytics on Sensitive Medical Data: The Personal Health Train

被引:61
|
作者
Beyan, Oya [1 ,2 ]
Choudhury, Ananya [3 ]
van Soest, Johan [3 ,4 ]
Kohlbacher, Oliver [5 ,6 ,7 ,8 ]
Zimmermann, Lukas [7 ]
Stenzhorn, Holger [7 ]
Karim, Md Rezaul [1 ,2 ]
Dumontier, Michel [4 ]
Decker, Stefan [1 ,2 ]
Santos, Luiz Olavo Bonino da Silva [9 ]
Dekker, Andre [3 ]
机构
[1] Fraunhofer Inst Appl Informat Technol FIT, D-53754 St Augustin, Germany
[2] Rhein Westfal TH Aachen, D-52056 Aachen, Germany
[3] Maastricht Univ, GROW Sch Oncol & Dev Biol, Dept Radiat Oncol MAASTRO, Med Ctr, NL-6200 MD Maastricht, Netherlands
[4] Maastricht Univ, Inst Data Sci, Univ Singel 60, NL-6229 ER Maastricht, Netherlands
[5] Univ Tubingen, Dept Comp Sci, D-72076 Tubingen, Baden Wurttembe, Germany
[6] Univ Tubingen, Quantitat Biol Ctr, D-72076 Tubingen, Baden Wurttembe, Germany
[7] Univ Tubingen, Inst Translat Bioinformat, D-72076 Tubingen, Baden Wurttembe, Germany
[8] Univ Tubingen, Ctr Bioinformat, Tubingen, Germany
[9] Go FAIR Int Support & Coordinat Off GFISCO, Leiden, Netherlands
关键词
Distributed analytics; Data reuse; FAIR; Health data; Ethics and privacy;
D O I
10.1162/dint_a_00032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, as newer technologies have evolved around the healthcare ecosystem, more and more data have been generated. Advanced analytics could power the data collected from numerous sources, both from healthcare institutions, or generated by individuals themselves via apps and devices, and lead to innovations in treatment and diagnosis of diseases; improve the care given to the patient; and empower citizens to participate in the decision-making process regarding their own health and well-being. However, the sensitive nature of the health data prohibits healthcare organizations from sharing the data. The Personal Health Train (PHT) is a novel approach, aiming to establish a distributed data analytics infrastructure enabling the (re)use of distributed healthcare data, while data owners stay in control of their own data. The main principle of the PHT is that data remain in their original location, and analytical tasks visit data sources and execute the tasks. The PHT provides a distributed, flexible approach to use data in a network of participants, incorporating the FAIR principles. It facilitates the responsible use of sensitive and/or personal data by adopting international principles and regulations. This paper presents the concepts and main components of the PHT and demonstrates how it complies with FAIR principles.
引用
收藏
页码:96 / 107
页数:12
相关论文
共 50 条
  • [21] A study on interoperability between two Personal Health Train infrastructures in leukodystrophy data analysis
    Welten, Sascha
    Herr, Marius de Arruda Botelho
    Hempel, Lars
    Hieber, David
    Placzek, Peter
    Graf, Michael
    Weber, Sven
    Neumann, Laurenz
    Jugl, Maximilian
    Tirpitz, Liam
    Kindermann, Karl
    Geisler, Sandra
    Santos, Luiz Olavo Bonino da Silva
    Decker, Stefan
    Pfeifer, Nico
    Kohlbacher, Oliver
    Kirsten, Toralf
    [J]. SCIENTIFIC DATA, 2024, 11 (01)
  • [22] Introduction to the Personal Data: Analytics and Management Minitrack
    Davies, Phil
    Oh, Joo Hee
    Parry, Glenn
    [J]. Proceedings of the Annual Hawaii International Conference on System Sciences, 2022, 2022-January : 1810 - 1811
  • [23] The Data Swarm: A Next Step for Distributed Data Analytics
    Smith, Jeffrey
    Rege, Manjeet
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2016, 6 (01) : 52 - 64
  • [24] Medical Data Analytics Is Not a Simple Task
    Babic, Frantisek
    Vadovsky, Michal
    Paralic, Jan
    [J]. BUILDING CONTINENTS OF KNOWLEDGE IN OCEANS OF DATA: THE FUTURE OF CO-CREATED EHEALTH, 2018, 247 : 371 - 375
  • [25] Indoor Air Monitoring Platform and Personal Health Reporting System Big Data Analytics for Public Health Research
    Ho, Kin-Fai
    Hirai, Hoyee W.
    Kuo, Yong-Hong
    Meng, Helen M.
    Tsoi, Kelvin K. F.
    [J]. 2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 309 - 312
  • [26] Distributed Big Data Analytics in the Internet of Signals
    Anavangot, Vijay
    Menon, Varun G.
    Nayyar, Anand
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART), 2018, : 73 - 77
  • [27] Implementation of Electronic Medical Record in Community Health Center Towards Medical Big Data Analytics Application
    Setiawan, Agung W.
    Utami, Nedya
    Mengko, Tati R.
    Indrayanto, Adi
    [J]. 2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 30 - 35
  • [28] Personal health indexing based on medical examinations: A data mining approach
    Chen, Ling
    Li, Xue
    Yang, Yi
    Kurniawati, Hanna
    Sheng, Quan Z.
    Hu, Hsiao-Yun
    Huang, Nicole
    [J]. DECISION SUPPORT SYSTEMS, 2016, 81 : 54 - 65
  • [29] Distributed algorithm for big data analytics in healthcare
    Forestiero, Agostino
    Papuzzo, Giuseppe
    [J]. 2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 776 - 779
  • [30] Data Distribution and Scheduling for Distributed Analytics Tasks
    Pasteris, Stephen
    Wang, Shiqiang
    Makaya, Christian
    Chan, Kevin
    Herbster, Mark
    [J]. 2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,