Data Sculpting: Interpretable Algorithm for End-to-End Cohort Selection

被引:0
|
作者
Liu, Ruishan [1 ]
Zou, James [2 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Biomed Data Sci, Stanford, CA 94305 USA
关键词
cohort selection; odds ratio; logistic regression;
D O I
10.1109/IEEECONF56349.2022.10052001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many scientific and medical analysis involves fitting a parametric model over a heterogeneous data set. The model is often chosen to be low capacity (e.g. logistic regression) in order to make statistical inference about the association between each feature and the outcome (e.g. odds ratio). However the simple model often cannot capture the heterogeneity in the data. For example, a subset of the data might follow a clean logistic relation, but other data points could follow different relations so that the fitting a logistic regression over the entire set may not find any association. In this paper, we propose a novel algorithm, Data Sculpting, for simultaneously learning to select a subset of the data while fitting the desired parametric model on the selected cohort. Data Sculpting retains the statistical inference convenience of the original model, while leveraging end-to-end differentiable optimization (via the Concrete selector) to learn interpretable rules for selecting the cohort. Extensive experiments demonstrate that Data Sculpting is efficient, robust and substantially improves over the standard approaches.
引用
收藏
页码:263 / 270
页数:8
相关论文
共 50 条
  • [1] A robust and interpretable end-to-end deep learning model for cytometry data
    Hu, Zicheng
    Tang, Alice
    Singh, Jaiveer
    Bhattacharya, Sanchita
    Butte, Atul J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (35) : 21373 - 21380
  • [2] A robust and interpretable, end-to-end deep learning model for cytometry data
    Hu, Zicheng
    Tang, Alice
    Singh, Jaiveer
    Bhattacharya, Sanchita
    Butte, Atul
    [J]. JOURNAL OF IMMUNOLOGY, 2020, 204 (01):
  • [3] Interpretable End-to-End heart sound classification
    Li, Shuaizhong
    Sun, Jing
    Yang, Hongbo
    Pan, Jiahua
    Guo, Tao
    Wang, Weilian
    [J]. MEASUREMENT, 2024, 237
  • [4] End-to-end Interpretable Neural Motion Planner
    Zeng, Wenyuan
    Luo, Wenjie
    Suo, Simon
    Sadat, Abbas
    Yang, Bin
    Casas, Sergio
    Urtasun, Raquel
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8652 - 8661
  • [5] Interpretable End-to-End Driving Model for Implicit Scene Understanding
    Sun, Yiyang
    Wang, Xiaonian
    Zhang, Yangyang
    Tang, Jiagui
    Tang, Xiaqiang
    Yao, Jing
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 2874 - 2880
  • [6] End-to-end interpretable disease-gene association prediction
    Li, Yang
    Guo, Zihou
    Wang, Keqi
    Gao, Xin
    Wang, Guohua
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [7] Hierarchical Interpretable Imitation Learning for End-to-End Autonomous Driving
    Teng, Siyu
    Chen, Long
    Ai, Yunfeng
    Zhou, Yuanye
    Xuanyuan, Zhe
    Hu, Xuemin
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 673 - 683
  • [8] COmic: convolutional kernel networks for interpretable end-to-end learning on (multi-)omics data
    Ditz, Jonas C.
    Reuter, Bernhard
    Pfeifer, Nico
    [J]. BIOINFORMATICS, 2023, 39 : I76 - I85
  • [9] COmic: convolutional kernel networks for interpretable end-to-end learning on (multi-)omics data
    Ditz, Jonas C.
    Reuter, Bernhard
    Pfeifer, Nico
    [J]. BIOINFORMATICS, 2023, 39 : i76 - i85
  • [10] A New Ant Algorithm for Optimal Service Selection with End-to-End QoS Constraints
    Dac-Nhuong Le
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2017, 18 (05): : 1017 - 1030