Estimating contact network properties by integrating multiple data sources associated with infectious diseases

被引:2
|
作者
Goyal, Ravi [1 ]
Carnegie, Nicole [2 ]
Slipher, Sally [3 ]
Turk, Philip [4 ]
Little, Susan J. [5 ]
De Gruttola, Victor [6 ]
机构
[1] Univ Calif San Diego, Div Infect Dis & Global Publ, San Diego, CA 92103 USA
[2] Publ Hlth Co, Palo Alto, CA USA
[3] Montana State Univ, Dept Math Sci, Bozeman, MT USA
[4] Univ Mississippi, Dept Data Sci, Med Ctr, Jackson, MS USA
[5] Univ Calif San Diego, Div Infect Dis & Global Publ, La Jolla, CA USA
[6] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA USA
基金
美国国家卫生研究院;
关键词
Bayesian inference; contact network; epidemic model; phylodynamics; TRANSMISSION; EPIDEMIC; IMPACT; INFERENCE; MODELS; PARTNERSHIPS; PREVALENCE; NUMBER; MEN; DNA;
D O I
10.1002/sim.9816
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
To effectively mitigate the spread of communicable diseases, it is necessary to understand the interactions that enable disease transmission among individuals in a population; we refer to the set of these interactions as a contact network. The structure of the contact network can have profound effects on both the spread of infectious diseases and the effectiveness of control programs. Therefore, understanding the contact network permits more efficient use of resources. Measuring the structure of the network, however, is a challenging problem. We present a Bayesian approach to integrate multiple data sources associated with the transmission of infectious diseases to more precisely and accurately estimate important properties of the contact network. An important aspect of the approach is the use of the congruence class models for networks. We conduct simulation studies modeling pathogens resembling SARS-CoV-2 and HIV to assess the method; subsequently, we apply our approach to HIV data from the University of California San Diego Primary Infection Resource Consortium. Based on simulation studies, we demonstrate that the integration of epidemiological and viral genetic data with risk behavior survey data can lead to large decreases in mean squared error (MSE) in contact network estimates compared to estimates based strictly on risk behavior information. This decrease in MSE is present even in settings where the risk behavior surveys contain measurement error. Through these simulations, we also highlight certain settings where the approach does not improve MSE.
引用
收藏
页码:3593 / 3615
页数:23
相关论文
共 50 条
  • [31] Curating and Integrating Data from Multiple Sources to Support Healthcare Analytics
    Ng, Kenney
    Kakkanatt, Chris
    Benigno, Michael
    Thompson, Clay
    Jackson, Margaret
    Cahan, Amos
    Zhu, Xinxin
    Zhang, Ping
    Huang, Paul
    MEDINFO 2015: EHEALTH-ENABLED HEALTH, 2015, 216 : 1056 - 1056
  • [32] Integrating multiple data sources in match-fixing warning systems
    Oetting, Marius
    Langrock, Roland
    Deutscher, Christian
    STATISTICAL MODELLING, 2018, 18 (5-6) : 483 - 504
  • [33] Integrating multiple data sources for learning analytics-review of literature
    Samuelsen, Jeanette
    Chen, Weiqin
    Wasson, Barbara
    RESEARCH AND PRACTICE IN TECHNOLOGY ENHANCED LEARNING, 2019, 14 (01)
  • [34] Integrating human single-cell data from multiple sources
    Chenwei Li
    Zedao Liu
    Zemin Zhang
    Quantitative Biology, 2022, 10 (03) : 299 - 300
  • [35] Integrating Multiple Data Sources for Combinatorial Marker Discovery: A Study in Tumorigenesis
    Bandyopadhyay, Sanghamitra
    Mallik, Saurav
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (02) : 673 - 687
  • [36] Integrating multiple sources of ecological data to unveil macroscale species abundance
    Keiichi Fukaya
    Buntarou Kusumoto
    Takayuki Shiono
    Junichi Fujinuma
    Yasuhiro Kubota
    Nature Communications, 11
  • [37] Big data and portfolio optimization: A novel approach integrating DEA with multiple data sources
    Zhou, Zhongbao
    Gao, Meng
    Xiao, Helu
    Wang, Rui
    Liu, Wenbin
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2021, 104
  • [38] Analysis of data with multiple sources of correlation in the framework of generalized estimating equations
    Shults, J
    Whitt, MC
    Kumanyika, S
    STATISTICS IN MEDICINE, 2004, 23 (20) : 3209 - 3226
  • [39] Estimating the mortality rate of hepatitis C using multiple data sources
    Wu, C
    Chang, HG
    McNutt, LA
    Smith, PF
    EPIDEMIOLOGY AND INFECTION, 2005, 133 (01): : 121 - 125
  • [40] Estimating age specific transmission rates for infectious diseases: fitting a survey of contact patterns to seroprevalence data for VZV and parvovirus B19
    Melegraro, A.
    Jit, M.
    Gay, N.
    Edmunds, J.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2007, 17 : 58 - 58