A Secure and Reusable Software Architecture for Supporting Online Data Harmonization

被引:0
|
作者
Feric, Zlatan [1 ]
Agostini, Nicolas Bohm [1 ]
Beene, Daniel [4 ]
Signes-Pastor, Antonio J. [6 ]
Halchenko, Yuliya [6 ]
Watkins, Deborah [5 ]
MacKenzie, Debra [4 ]
Karagas, Margaret [6 ]
Manjourides, Justin [2 ]
Alshawabkeh, Akram [3 ]
Kaeli, David [1 ]
机构
[1] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA
[2] Northeastern Univ, Bouve Coll Hlth Sci, Boston, MA 02115 USA
[3] Northeastern Univ, Dept Civil & Environm Engn, Boston, MA 02115 USA
[4] Univ New Mexico, Coll Pharm, Hlth Sci Ctr, Community Environm Hlth Program, Albuquerque, NM 87131 USA
[5] Univ Michigan, Sch Publ Hlth, Environm Hlth Sci, Ann Arbor, MI 48109 USA
[6] Geisel Sch Med Dartmouth, Dept Epidemiol, Hanover, NH USA
关键词
CHILD HEALTH OUTCOMES; ENVIRONMENTAL-INFLUENCES; EXPOSURE; MINE;
D O I
10.1109/BigData52589.2021.9671538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Retrospective data harmonization across multiple research cohorts and studies is frequently done to increase statistical power, provide comparison analysis, and create a richer data source for data mining. However, when combining disparate data sources, harmonization projects face data management and analysis challenges. These include differences in the data dictionaries and variable definitions, privacy concerns surrounding health data representing sensitive populations, and lack of properly defined data models. With the availability of mature open-source web-based database technologies, developing a complete software architecture to overcome the challenges associated with the harmonization process can alleviate many roadblocks. By leveraging state-of-the-art software engineering and database principles, we can ensure data quality and enable cross-center online access and collaboration. This paper outlines a complete software architecture developed and customized using the Django web framework, leveraged to harmonize sensitive data collected from three NIH-support birth cohorts. We describe our framework and show how we successfully overcame challenges faced when harmonizing data from these cohorts. We discuss our efforts in data cleaning, data sharing, data transformation, data visualization, and analytics, while reflecting on what we have learned to date from these harmonized datasets.
引用
收藏
页码:2801 / 2812
页数:12
相关论文
共 50 条
  • [1] Reusable Secure Connectors for Secure Software Architecture
    Shin, Michael
    Gomaa, Hassan
    Pathirage, Don
    SOFTWARE REUSE: BRIDGING WITH SOCIAL-AWARENESS, 2016, 9679 : 181 - 196
  • [2] Supporting search for reusable software objects
    Isakowitz, T
    Kauffman, RJ
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (06) : 407 - 423
  • [3] Supporting the Management of Reusable Automotive Software
    Larrucea, Xabier
    Walker, Alastair
    Colomo-Palacios, Ricardo
    IEEE SOFTWARE, 2017, 34 (03) : 40 - 47
  • [4] Reusable Design of Data Visualization Software Architecture for Unmanned Aerial Vehicles
    Jovanovic, Mladan
    Starcevic, Dusan
    Jovanovic, Zoran
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2014, 11 (06): : 359 - 371
  • [5] Hierarchical Organizations and a Supporting Software Architecture for Floating Car Data
    Haesevoets, Robrecht
    Weyns, Danny
    Holvoet, Tom
    Joosen, Wouter
    Valckenaers, Paul
    SASOW 2008: SECOND IEEE INTERNATIONAL CONFERENCE ON SELF-ADAPTIVE AND SELF-ORGANIZING SYSTEMS WORKSHOPS, PROCEEDINGS, 2008, : 31 - +
  • [6] The research of a reusable software architecture in a project
    Wang, Qiang
    ICIM 2006: Proceedings of the Eighth International Conference on Industrial Management, 2006, : 1065 - 1069
  • [7] Software evolution via reusable architecture
    Harn, M
    Berzins, V
    Luqi
    ECBS '99, IEEE CONFERENCE AND WORKSHOP ON ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 1999, : 11 - 17
  • [8] CLARAty: An architecture for reusable robotic software
    Nesnas, I
    Wright, A
    Bajracharya, M
    Simmons, R
    Estlin, T
    Kim, WS
    UNMANNED GROUND VEHICLE TECHNOLOGY V, 2003, 5083 : 253 - 264
  • [9] Auxo: an architecture-centric framework supporting the online tuning of software adaptivity
    Wang HuaiMin
    Ding Bo
    Shi DianXi
    Cao JianNong
    Chan, Alvin T. S.
    SCIENCE CHINA-INFORMATION SCIENCES, 2015, 58 (09) : 1 - 15
  • [10] Auxo: an architecture-centric framework supporting the online tuning of software adaptivity
    WANG HuaiMin
    DING Bo
    SHI DianXi
    CAO JianNong
    Alvin T.S.Chan
    Science China(Information Sciences), 2015, 58 (09) : 31 - 45