Learning from data with structured missingness

被引:0
|
作者
Robin Mitra
Sarah F. McGough
Tapabrata Chakraborti
Chris Holmes
Ryan Copping
Niels Hagenbuch
Stefanie Biedermann
Jack Noonan
Brieuc Lehmann
Aditi Shenvi
Xuan Vinh Doan
David Leslie
Ginestra Bianconi
Ruben Sanchez-Garcia
Alisha Davies
Maxine Mackintosh
Eleni-Rosalina Andrinopoulou
Anahid Basiri
Chris Harbron
Ben D. MacArthur
机构
[1] The Alan Turing Institute,Statistical Science
[2] University College London,Department of Medical Physics & Biomedical Engineering and UCL Cancer Institute
[3] Genentech,Department of Statistics
[4] University College London,School of Mathematics and Statistics
[5] University of Oxford,School of Mathematics
[6] F. Hoffmann-La Roche AG,Department of Statistics
[7] The Open University,Warwick Business School
[8] Cardiff University,The Digital Environment Research Institute
[9] University of Warwick,School of Mathematical Sciences
[10] University of Warwick,Mathematical Sciences
[11] Queen Mary University of London,Faculty of Health and Life Sciences
[12] Queen Mary University of London,Department of Biostatistics and Department of Epidemiology
[13] University of Southampton,School of Geographical & Earth Sciences
[14] Swansea University,Faculty of Medicine
[15] Public Health Wales,undefined
[16] Genomics England,undefined
[17] Erasmus MC,undefined
[18] University of Glasgow,undefined
[19] Roche Pharmaceuticals,undefined
[20] University of Southampton,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Missing data are an unavoidable complication in many machine learning tasks. When data are ‘missing at random’ there exist a range of tools and techniques to deal with the issue. However, as machine learning studies become more ambitious, and seek to learn from ever-larger volumes of heterogeneous data, an increasingly encountered problem arises in which missing values exhibit an association or structure, either explicitly or implicitly. Such ‘structured missingness’ raises a range of challenges that have not yet been systematically addressed, and presents a fundamental hindrance to machine learning at scale. Here we outline the current literature and propose a set of grand challenges in learning from data with structured missingness.
引用
收藏
页码:13 / 23
页数:10
相关论文
共 50 条
  • [1] Learning from data with structured missingness
    Mitra, Robin
    McGough, Sarah F.
    Chakraborti, Tapabrata
    Holmes, Chris
    Copping, Ryan
    Hagenbuch, Niels
    Biedermann, Stefanie
    Noonan, Jack
    Lehmann, Brieuc
    Shenvi, Aditi
    Doan, Xuan Vinh
    Leslie, David
    Bianconi, Ginestra
    Sanchez-Garcia, Ruben
    Davies, Alisha
    Mackintosh, Maxine
    Andrinopoulou, Eleni-Rosalina
    Basiri, Anahid
    Harbron, Chris
    MacArthur, Ben D.
    [J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (01) : 13 - 23
  • [2] Embedding for Informative Missingness: Deep Learning With Incomplete Data
    Ghorbani, Amirata
    Zou, James Y.
    [J]. 2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 437 - 445
  • [3] Missingness-Pattern-Adaptive Learning With Incomplete Data
    Gong, Yongshun
    Li, Zhibin
    Liu, Wei
    Lu, Xiankai
    Liu, Xinwang
    Tsang, Ivor W. W.
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11053 - 11066
  • [4] Learning comprehensible theories from structured data
    Lloyd, JW
    [J]. ADVANCED LECTURES ON MACHINE LEARNING, 2002, 2600 : 203 - 225
  • [5] Learning from highly structured data by decomposition
    Mac Kinney-Romero, R
    Giraud-Carrier, C
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 436 - 441
  • [6] Learning structured data from unspecific reinforcement
    Biehl, M
    Kühn, R
    Stamatescu, IO
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2000, 33 (39): : 6843 - 6857
  • [7] Transfer Learning Approach for Learning of Unstructured Data from Structured Data in Medical Domain
    Wankhade, Nishigandha V.
    Potey, Madhuri A.
    [J]. 2013 2ND INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT IN THE KNOWLEDGE ECONOMY (IMKE), 2013, : 86 - 91
  • [8] On the hardness of learning queries from tree structured data
    Liu, Xianmin
    Li, Jianzhong
    [J]. JOURNAL OF COMBINATORIAL OPTIMIZATION, 2015, 29 (03) : 670 - 684
  • [9] DIFFER: A Propositionalization approach for Learning from Structured Data
    Karunaratne, Thashmee
    Bostrom, Henrik
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 15, 2006, 15 : 49 - +
  • [10] On the computational hardness of learning from structured symbolic data
    Jappy, P
    Gascuel, O
    [J]. ORDINAL AND SYMBOLIC DATA ANALYSIS, 1996, : 189 - 200