A Secure and Reusable Software Architecture for Supporting Online Data Harmonization

被引:0
|
作者
Feric, Zlatan [1 ]
Agostini, Nicolas Bohm [1 ]
Beene, Daniel [4 ]
Signes-Pastor, Antonio J. [6 ]
Halchenko, Yuliya [6 ]
Watkins, Deborah [5 ]
MacKenzie, Debra [4 ]
Karagas, Margaret [6 ]
Manjourides, Justin [2 ]
Alshawabkeh, Akram [3 ]
Kaeli, David [1 ]
机构
[1] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA
[2] Northeastern Univ, Bouve Coll Hlth Sci, Boston, MA 02115 USA
[3] Northeastern Univ, Dept Civil & Environm Engn, Boston, MA 02115 USA
[4] Univ New Mexico, Coll Pharm, Hlth Sci Ctr, Community Environm Hlth Program, Albuquerque, NM 87131 USA
[5] Univ Michigan, Sch Publ Hlth, Environm Hlth Sci, Ann Arbor, MI 48109 USA
[6] Geisel Sch Med Dartmouth, Dept Epidemiol, Hanover, NH USA
关键词
CHILD HEALTH OUTCOMES; ENVIRONMENTAL-INFLUENCES; EXPOSURE; MINE;
D O I
10.1109/BigData52589.2021.9671538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Retrospective data harmonization across multiple research cohorts and studies is frequently done to increase statistical power, provide comparison analysis, and create a richer data source for data mining. However, when combining disparate data sources, harmonization projects face data management and analysis challenges. These include differences in the data dictionaries and variable definitions, privacy concerns surrounding health data representing sensitive populations, and lack of properly defined data models. With the availability of mature open-source web-based database technologies, developing a complete software architecture to overcome the challenges associated with the harmonization process can alleviate many roadblocks. By leveraging state-of-the-art software engineering and database principles, we can ensure data quality and enable cross-center online access and collaboration. This paper outlines a complete software architecture developed and customized using the Django web framework, leveraged to harmonize sensitive data collected from three NIH-support birth cohorts. We describe our framework and show how we successfully overcame challenges faced when harmonizing data from these cohorts. We discuss our efforts in data cleaning, data sharing, data transformation, data visualization, and analytics, while reflecting on what we have learned to date from these harmonized datasets.
引用
收藏
页码:2801 / 2812
页数:12
相关论文
共 50 条
  • [41] Guidelines for Supporting Software Engineers in Developing Secure Web Applications
    Svensson, Klara
    Axelrod, Drake
    Mohamad, Mazen
    Wohlrab, Rebekka
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROFES 2024, 2025, 15452 : 123 - 138
  • [42] Evaluating energy efficiency of Internet of Things software architecture based on reusable software components
    Kim, Doohwan
    Choi, Jae-Young
    Hong, Jang-Eui
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2017, 13 (01):
  • [43] A Reference Architecture for Supporting Secure Big Data Analytics over Cloud-Enabled Relational Databases
    Cuzzocrea, Alfredo
    PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC), VOL 2, 2016, : 356 - 358
  • [44] A Reusable Software Architecture for Geographic Information Systems Based on Software Product Line Engineering
    Brisaboa, Nieves R.
    Cortinas, Alejandro
    Luaces, Miguel R.
    Pol'la, Matias
    MODEL AND DATA ENGINEERING, MEDI 2015, 2015, 9344 : 320 - 331
  • [45] Supporting the development of adaptable and secure software systems: An NFR approach
    Subramanian, N
    Chung, L
    SERP '05: Proceedings of the 2005 International Conference on Software Engineering Research and Practice, Vols 1 and 2, 2005, : 108 - 114
  • [46] A Solution Supporting Secure Transmission of Big Data
    Ling, Haonan
    Gao, Yan
    Wang, Huibin
    Chen, Ming
    TSINGHUA SCIENCE AND TECHNOLOGY, 2023, 28 (05): : 838 - 848
  • [47] Data broadcasting software architecture supporting real-time caching and monitoring in interactive TV
    Park, DH
    Ku, TY
    Moon, KD
    Fourth Annual ACIS International Conference on Computer and Information Science, Proceedings, 2005, : 593 - 597
  • [48] A distributed intrusion detection approach for secure software architecture
    Inverardi, P
    Mostarda, L
    SOFTWARE ARCHITECTURE, PROCEEDINGS, 2005, 3527 : 168 - 184
  • [49] VMCrypt - Modular Software Architecture for Scalable Secure Computation
    Malka, Lior
    PROCEEDINGS OF THE 18TH ACM CONFERENCE ON COMPUTER & COMMUNICATIONS SECURITY (CCS 11), 2011, : 715 - 724
  • [50] Secure Vehicle Software Updates: Requirements for a Reference Architecture
    Strandberg, Kim
    Arnljung, Ulf
    Olovsson, Tomas
    Oka, Dennis Kengo
    2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,