Collaborative Workflow for Analyzing Large-Scale Data for Antimicrobial Resistance: An Experience Report

被引:0
|
作者
Hou, Pei-Yu [1 ]
Ao, Jing [1 ]
Rindos, Andrew [2 ]
Keelara, Shivaramu [1 ]
Fedorka-Cray, Paula J. [1 ]
Chirkova, Rada [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
[2] IBM Corp, Res Triangle Pk, NC 27709 USA
关键词
data analytics; data integration; antimicrobial resistance; experts-in-the-loop; analysts-in-the-loop;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real-life analytics-oriented information-integration projects, the processes of information curation and integration cannot be completely automated. Rather, in each large-scale project the key objectives include maximizing scalability and throughput, while at the same time keeping the processes manageable and productive for the human experts in the loop. In this paper, we describe our experience with addressing these major objectives in the process of building a scalable end-to end data-extraction, integration, and analytics workflow in the domain of antimicrobial resistance (AMR). The workflow is built using open-source tools, with the aims of enhancing the efficiency and accuracy of data collection and integration, while involving an acceptable level of efforts by collaborative multidisciplinary teams of humans-in-the-loop. We present the components of the proposed workflow, outline the challenges encountered in its development and testing, and discuss the experiences and lessons learned in enabling AMR experts and data analysts to interact. with the workflow, with some of the lessons potentially applicable to other application domains.
引用
下载
收藏
页码:4608 / 4617
页数:10
相关论文
共 50 条
  • [31] Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
    Liu, Jun
    Liu, Feng
    Ansari, Nirwan
    IEEE NETWORK, 2014, 28 (04): : 32 - 39
  • [32] A Data-Centric Approach for Analyzing Large-Scale Deep Learning Applications
    Vineet, S. Sai
    Joseph, Natasha Meena
    Korgaonkar, Kunal
    Paul, Arnab K.
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, ICDCN 2023, 2023, : 282 - 283
  • [33] SDRS-an algorithm for analyzing large-scale dose-response data
    Ji, Rui-Ru
    Siemers, Nathan O.
    Lei, Ming
    Schweizer, Liang
    Bruccoleri, Robert E.
    BIOINFORMATICS, 2011, 27 (20) : 2921 - 2923
  • [34] The Civic Data Deluge: Understanding the Challenges of Analyzing Large-Scale Community Input
    Mahyar, Narges
    Nguyen, Diana, V
    Chan, Maggie
    Zheng, Jiayi
    Dow, Steven P.
    PROCEEDINGS OF THE 2019 ACM DESIGNING INTERACTIVE SYSTEMS CONFERENCE (DIS 2019), 2019, : 1171 - 1181
  • [35] Large-scale genomic analysis of antimicrobial resistance in the zoonotic pathogen Streptococcus suis
    Nazreen F. Hadjirin
    Eric L. Miller
    Gemma G. R. Murray
    Phung L. K. Yen
    Ho D. Phuc
    Thomas M. Wileman
    Juan Hernandez-Garcia
    Susanna M. Williamson
    Julian Parkhill
    Duncan J. Maskell
    Rui Zhou
    Nahuel Fittipaldi
    Marcelo Gottschalk
    A. W. ( Dan) Tucker
    Ngo Thi Hoa
    John J. Welch
    Lucy A. Weinert
    BMC Biology, 19
  • [36] Large-scale genomic analysis of antimicrobial resistance in the zoonotic pathogen Streptococcus suis
    Hadjirin, Nazreen F.
    Miller, Eric L.
    Murray, Gemma G. R.
    Yen, Phung L. K.
    Phuc, Ho D.
    Wileman, Thomas M.
    Hernandez-Garcia, Juan
    Williamson, Susanna M.
    Parkhill, Julian
    Maskell, Duncan J.
    Zhou, Rui
    Fittipaldi, Nahuel
    Gottschalk, Marcelo
    Tucker, A. W.
    Ngo Thi Hoa
    Welch, John J.
    Weinert, Lucy A.
    BMC BIOLOGY, 2021, 19 (01)
  • [37] A Data-Intensive Workflow Scheduling Algorithm for Large-scale Cooperative Work Platform
    Cui, Lizhen
    Xu, Meng
    Wang, Haiyang
    2009 13TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, 2009, : 486 - 491
  • [38] Analyzing Large-Scale Studies: Benefits and Challenges
    Ertl, Bernhard
    Hartmann, Florian G.
    Heine, Jorg-Henrik
    FRONTIERS IN PSYCHOLOGY, 2020, 11
  • [39] Towards an automated workflow for large-scale housing retrofit
    Tan, Ling Min
    Arbabi, Hadi
    Ward, Wil
    Li, Xinyi
    Tingley, Danielle Densley
    Khan, Ahsan
    Mayfield, Martin
    ENVIRONMENTAL RESEARCH LETTERS, 2023, 18 (06)
  • [40] Large-scale surgical workflow segmentation for laparoscopic sacrocolpopexy
    Yitong Zhang
    Sophia Bano
    Ann-Sophie Page
    Jan Deprest
    Danail Stoyanov
    Francisco Vasconcelos
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 467 - 477