Learning from data with structured missingness

被引:0
|
作者
Robin Mitra
Sarah F. McGough
Tapabrata Chakraborti
Chris Holmes
Ryan Copping
Niels Hagenbuch
Stefanie Biedermann
Jack Noonan
Brieuc Lehmann
Aditi Shenvi
Xuan Vinh Doan
David Leslie
Ginestra Bianconi
Ruben Sanchez-Garcia
Alisha Davies
Maxine Mackintosh
Eleni-Rosalina Andrinopoulou
Anahid Basiri
Chris Harbron
Ben D. MacArthur
机构
[1] The Alan Turing Institute,Statistical Science
[2] University College London,Department of Medical Physics & Biomedical Engineering and UCL Cancer Institute
[3] Genentech,Department of Statistics
[4] University College London,School of Mathematics and Statistics
[5] University of Oxford,School of Mathematics
[6] F. Hoffmann-La Roche AG,Department of Statistics
[7] The Open University,Warwick Business School
[8] Cardiff University,The Digital Environment Research Institute
[9] University of Warwick,School of Mathematical Sciences
[10] University of Warwick,Mathematical Sciences
[11] Queen Mary University of London,Faculty of Health and Life Sciences
[12] Queen Mary University of London,Department of Biostatistics and Department of Epidemiology
[13] University of Southampton,School of Geographical & Earth Sciences
[14] Swansea University,Faculty of Medicine
[15] Public Health Wales,undefined
[16] Genomics England,undefined
[17] Erasmus MC,undefined
[18] University of Glasgow,undefined
[19] Roche Pharmaceuticals,undefined
[20] University of Southampton,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Missing data are an unavoidable complication in many machine learning tasks. When data are ‘missing at random’ there exist a range of tools and techniques to deal with the issue. However, as machine learning studies become more ambitious, and seek to learn from ever-larger volumes of heterogeneous data, an increasingly encountered problem arises in which missing values exhibit an association or structure, either explicitly or implicitly. Such ‘structured missingness’ raises a range of challenges that have not yet been systematically addressed, and presents a fundamental hindrance to machine learning at scale. Here we outline the current literature and propose a set of grand challenges in learning from data with structured missingness.
引用
收藏
页码:13 / 23
页数:10
相关论文
共 50 条
  • [41] Integrative Clustering Analysis for Omics Data with Missingness
    Zhao, Yinqi
    Darst, Burcu
    Conti, David V.
    [J]. GENETIC EPIDEMIOLOGY, 2021, 45 (07) : 806 - 806
  • [42] Nonrandom missingness in categorical data: Strengths and limitations
    Molenberghs, G
    Goetghebeur, EJT
    Lipsitz, SR
    Kenward, MG
    [J]. AMERICAN STATISTICIAN, 1999, 53 (02): : 110 - 118
  • [43] Differential missingness of antibiotic indications in claims data
    Strassle, Paula D.
    Ross, Rachael K.
    Marx, Ashley H.
    Willis, Zachary I.
    Farel, Claire E.
    Kinlaw, Alan K.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 210 - 210
  • [44] IMPUTATION OF MISSING DATA WITH DIFFERENT MISSINGNESS MECHANISM
    Kang, Ho Ming
    Yusof, Fadhilah
    Mohamad, Ismail
    [J]. JURNAL TEKNOLOGI, 2012, 57
  • [45] Analysis of Missingness Scenarios for Observational Health Data
    Zamanian, Alireza
    von Kleist, Henrik
    Ciora, Octavia-Andreea
    Piperno, Marta
    Lancho, Gino
    Ahmidi, Narges
    [J]. JOURNAL OF PERSONALIZED MEDICINE, 2024, 14 (05):
  • [46] Learning structured data from unspecific reinforcement (vol 33, pg 6843, 2000)
    Biehl, M
    Kühn, R
    Stamatescu, IO
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2001, 34 (19): : 4267 - 4267
  • [47] Learning Structured Knowledge from Social Tagging Data A critical review of methods and techniques
    Dong, Hang
    Wang, Wei
    Liang, Hai-Ning
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 307 - 314
  • [48] Detecting irrelevant subtrees to improve probabilistic learning from tree-structured data
    Habrard, A
    Bernard, M
    Sebban, M
    [J]. FUNDAMENTA INFORMATICAE, 2005, 66 (1-2) : 103 - 130
  • [49] Data Exclusion in Policy Survey and Questionnaire Data: Aberrant Responses and Missingness
    Hong, Maxwell
    Carter, Matthew
    Kim, Casey
    Cheng, Ying
    [J]. POLICY INSIGHTS FROM THE BEHAVIORAL AND BRAIN SCIENCES, 2023, 10 (01) : 11 - 17
  • [50] Learning structured representations from experience
    Doumas, Leonidas A. A.
    Martin, Andrea E.
    [J]. PSYCHOLOGY OF LEARNING AND MOTIVATION, VOL 69, 2018, 69 : 165 - 203