Migrating a research data warehouse to a public cloud: challenges and opportunities

被引:10
|
作者
Kahn, Michael G. [1 ,2 ]
Mui, Joyce Y. [2 ]
Ames, Michael J. [3 ]
Yamsani, Anoop K. [2 ]
Pozdeyev, Nikita [2 ,4 ,5 ]
Rafaels, Nicholas [2 ,5 ]
Brooks, Ian M. [2 ,5 ]
机构
[1] Univ Colorado, Dept Pediat, Sch Med, Sect Informat & Data Sci, Aurora, CO 80045 USA
[2] Univ Colorado, Sch Med, Colorado Ctr Personalized Med, Aurora, CO 80045 USA
[3] SADA Inc, North Hollywood, CA USA
[4] Univ Colorado, Sch Med, Div Endocrinol Metab & Diabet, Aurora, CO 80045 USA
[5] Univ Colorado, Dept Med, Div Biomed Informat & Personalized Med, Aurora, CO USA
关键词
data warehousing; cloud computing; big data; research data governance;
D O I
10.1093/jamia/ocab278
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Clinical research data warehouses (RDWs) linked to genomic pipelines and open data archives are being created to support innovative, complex data-driven discoveries. The computing and storage needs of these research environments may quickly exceed the capacity of on-premises systems. New RDWs are migrating to cloud platforms for the scalability and flexibility needed to meet these challenges. We describe our experience in migrating a multi-institutional RDW to a public cloud. Materials and Methods This study is descriptive. Primary materials included internal and public presentations before and after the transition, analysis documents, and actual billing records. Findings were aggregated into topical categories. Results Eight categories of migration issues were identified. Unanticipated challenges included legacy system limitations; network, computing, and storage architectures that realize performance and cost benefits in the face of hyper-innovation, complex security reviews and approvals, and limited cloud consulting expertise. Discussion Cloud architectures enable previously unavailable capabilities, but numerous pitfalls can impede realizing the full benefits of a cloud environment. Rapid changes in cloud capabilities can quickly obsolete existing architectures and associated institutional policies. Touchpoints with on-premise networks and systems can add unforeseen complexity. Governance, resource management, and cost oversight are critical to allow rapid innovation while minimizing wasted resources and unnecessary costs. Conclusions Migrating our RDW to the cloud has enabled capabilities and innovations that would not have been possible with an on-premises environment. Notwithstanding the challenges of managing cloud resources, the resulting RDW capabilities have been highly positive to our institution, research community, and partners.
引用
收藏
页码:592 / 600
页数:9
相关论文
共 50 条
  • [1] VHA Corporate Data Warehouse height and weight data: Opportunities and challenges for health services research
    Noel, Polly Hitchcock
    Copeland, Laurel A.
    Perrin, Ruth A.
    Lancaster, A. Elizabeth
    Pugh, Mary Jo
    Wang, Chen-Pin
    Bollinger, Mary J.
    Hazuda, Helen P.
    [J]. JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 2010, 47 (08): : 739 - 750
  • [3] Biometrics in the Cloud: Challenges and Research Opportunities
    Castiglione, Aniello
    Choo, Kim-Kwang Raymond
    Nappi, Michele
    Narducci, Fabio
    [J]. IEEE CLOUD COMPUTING, 2017, 4 (04): : 12 - 17
  • [4] Data management in the cloud: Challenges and opportunities
    [J]. 1600, Morgan and Claypool Publishers (04):
  • [5] Opportunity and Challenges for Migrating Big Data Analytics in Cloud
    Manekar, Amitkumar S.
    Pradeepini, G.
    [J]. INTERNATIONAL CONFERENCE ON MATERIALS, ALLOYS AND EXPERIMENTAL MECHANICS (ICMAEM-2017), 2017, 225
  • [6] Data Warehouse for Business Process Evaluation Approach Opportunities and Challenges
    Mousa, Ayad Hameed
    Shiratuddin, Norshuhada
    Abu Bakar, Muhamad Shahbani
    [J]. PROCEEDING OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2014, VOLS 1 AND 2, 2014, : 465 - 471
  • [7] Research Opportunities and Challenges of Security Concerns associated with Big Data in Cloud Computing
    Anandaraj, S. P.
    Kemal, Mohammed
    [J]. 2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 746 - 751
  • [8] Federation in Cloud Data Management: Challenges and Opportunities
    Chen, Gang
    Jagadish, H. V.
    Jiang, Dawei
    Maier, David
    Ooi, Beng Chin
    Tan, Kian-Lee
    Tan, Wang-Chiew
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (07) : 1670 - 1678
  • [9] Cloud Virtualization with Data Security: Challenges and Opportunities
    Abraham, Joshua Johnson
    Sunny, Christy
    Assisi, Anlin
    Jayapandian, N.
    [J]. PROCEEDING OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS, BIG DATA AND IOT (ICCBI-2018), 2020, 31 : 865 - 872
  • [10] Challenges and Opportunities in Big Data and Cloud Computing
    Sohail, Hassan
    Zameer, Zeenia
    Ahmed, Hafiz Farhan
    Iqbal, Usama
    Shah, Pir Amad Ali
    [J]. FUTURE INTELLIGENT VEHICULAR TECHNOLOGIES, FUTURE 5V 2016, 2017, 185 : 175 - 181