A machine learning-based approach for estimating and testing associations with multivariate outcomes

被引:2
|
作者
Benkeser, David [1 ]
Mertens, Andrew [2 ]
Colford, John M. [2 ]
Hubbard, Alan [3 ]
Arnold, Benjamin F. [4 ]
Stein, Aryeh [5 ]
van der Laan, Mark J. [3 ]
机构
[1] Emory Univ, Sch Publ Hlth, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[2] Univ Calif Berkeley, Dept Epidemiol, Berkeley, CA USA
[3] Univ Calif Berkeley, Dept Biostat, Berkeley, CA 94720 USA
[4] Univ Calif San Francisco, Francis I Proctor Fdn, San Francisco, CA USA
[5] Emory Univ, Hubert Dept Global Hlth, Rollins Sch Publ Hlth, Atlanta, GA 30322 USA
来源
关键词
canonical correlation; epidemiology; machine learning; multivariate outcomes; variable importance; GROWTH;
D O I
10.1515/ijb-2019-0061
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a method for summarizing the strength of association between a set of variables and a multivariate outcome. Classical summary measures are appropriate when linear relationships exist between covariates and outcomes, while our approach provides an alternative that is useful in situations where complex relationships may be present. We utilize machine learning to detect nonlinear relationships and covariate interactions and propose a measure of association that captures these relationships. A hypothesis test about the proposed associative measure can be used to test the strong null hypothesis of no association between a set of variables and a multivariate outcome. Simulations demonstrate that this hypothesis test has greater power than existing methods against alternatives where covariates have nonlinear relationships with outcomes. Weadditionally propose measures of variable importance for groups of variables, which summarize each groups' association with the outcome. We demonstrate our methodology using data from a birth cohort study on childhood health and nutrition in the Philippines.
引用
收藏
页码:7 / 21
页数:15
相关论文
共 50 条
  • [1] A machine learning-based approach for estimating available bandwidth
    Chen, Ling-Jyh
    Chou, Cheng-Fu
    Wang, Bo-Chun
    [J]. TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 164 - +
  • [2] Machine learning-based placental clusters and their associations with adverse pregnancy outcomes
    Petersen, Julie M.
    Parker, Samantha E.
    Dukes, Kimberly A.
    Hutcheon, Jennifer A.
    Ahrens, Katherine A.
    Werler, Martha M.
    [J]. PAEDIATRIC AND PERINATAL EPIDEMIOLOGY, 2023, 37 (04) : 350 - 361
  • [3] A Machine Learning-Based Data Fusion Approach for Improved Corrosion Testing
    Christoph Völker
    Sabine Kruschwitz
    Gino Ebell
    [J]. Surveys in Geophysics, 2020, 41 : 531 - 548
  • [4] A Machine Learning-Based Data Fusion Approach for Improved Corrosion Testing
    Voelker, Christoph
    Kruschwitz, Sabine
    Ebell, Gino
    [J]. SURVEYS IN GEOPHYSICS, 2020, 41 (03) : 531 - 548
  • [5] An Improved Air Quality Index Machine Learning-Based Forecasting with Multivariate Data Imputation Approach
    Alkabbani, Hanin
    Ramadan, Ashraf
    Zhu, Qinqin
    Elkamel, Ali
    [J]. ATMOSPHERE, 2022, 13 (07)
  • [6] Machine learning-based approach to GPS antijamming
    Wang, Cheng-Zhen
    Kong, Ling-Wei
    Jiang, Junjie
    Lai, Ying-Cheng
    [J]. GPS SOLUTIONS, 2021, 25 (03)
  • [7] A Machine Learning-based Approach for Groundwater Mapping
    Zzaman, Rashed Uz
    Nowreen, Sara
    Khan, Irtesam Mahmud
    Islam, Md Rajibul
    Ibtehaz, Nabil
    Rahman, M. Saifur
    Zahid, Anwar
    Farzana, Dilruba
    Sharmin, Afroza
    Rahman, M. Sohel
    [J]. NATURAL RESOURCES RESEARCH, 2022, 31 (01) : 281 - 299
  • [8] A Machine Learning-based Approach for Groundwater Mapping
    Rashed Uz Zzaman
    Sara Nowreen
    Irtesam Mahmud Khan
    Md. Rajibul Islam
    Nabil Ibtehaz
    M. Saifur Rahman
    Anwar Zahid
    Dilruba Farzana
    Afroza Sharmin
    M. Sohel Rahman
    [J]. Natural Resources Research, 2022, 31 : 281 - 299
  • [9] Machine learning-based approach to GPS antijamming
    Cheng-Zhen Wang
    Ling-Wei Kong
    Junjie Jiang
    Ying-Cheng Lai
    [J]. GPS Solutions, 2021, 25
  • [10] Machine Learning-Based Fuzz Testing Techniques: A Survey
    Zhang, Ao
    Zhang, Yiying
    Xu, Yao
    Wang, Cong
    Li, Siwei
    [J]. IEEE ACCESS, 2024, 12 : 14437 - 14454