On learning disentangled representations for individual treatment effect estimation

被引:1
|
作者
Chu, Jiebin [1 ]
Sun, Zhoujian [1 ]
Dong, Wei [2 ]
Shi, Jinlong [3 ]
Huang, Zhengxing [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Chinese Peoples Liberat Army Gen Hosp, Dept Cardiol, Beijing, Peoples R China
[3] Chinese Peoples Liberat Army Gen Hosp, Med Big Data Ctr, Dept Med Innovat Res, Beijing, Peoples R China
关键词
Individualized treatment effect; Causal inference; Deep learning; Disentangled representation; Auxiliary-task learning; Observational data; PROPENSITY SCORE; MODEL; BIAS;
D O I
10.1016/j.jbi.2021.103940
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: Estimating the individualized treatment effect (ITE) from observational data is a challenging task due to selection bias, which results from the distributional discrepancy between different treatment groups caused by the dependence between features and assigned treatments. This dependence is induced by the factors related to the treatment assignment. We hypothesize that features consist of three types of latent factors: outcome-specific factors, treatment-specific factors and confounders. Then, we aim to reduce the influence of treatment-related factors, i.e., treatment-specific factors and confounders, on outcome prediction to mitigate the effects of selection bias. Method: We present a novel representation learning model in which both the main task of outcome prediction and the auxiliary task of classifying the treatment assignment are used to learn the outcome-oriented and treatment-oriented latent representations, respectively. However, since the confounders are related to both treatment assignment and outcome, it is still contained in the representations. To further reduce influence of the confounders contained in both representations, individualized orthogonal regularization is incorporated into the proposed model. The orthogonal regularization forces the outcome-oriented and treatment-oriented latent representations of an individual to be vertical in the inner product space, meaning they are orthogonal with each other, and the common information of confounder is reduced. Such that the ITE can be estimated more precisely without the effects of selection bias. Result: We evaluate our proposed model on a semi-simulated dataset and a real-world dataset. The experimental results demonstrate that the proposed model achieves competitive or better performance compared with the performances of the state-of-the-art models. Conclusion: The proposed method is well performed on ITE estimation with the ability to reduce selection bias thoroughly by incorporating an auxiliary task and adopting orthogonal regularization to disentangle the latent factors. Significance: This paper offers a novel method of reducing selection bias in estimating the ITE from observational data by disentangled representation learning.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Adversarial Learning of Disentangled and Generalizable Representations of Visual Attributes
    Oldfield, James
    Panagakis, Yannis
    Nicolaou, Mihalis A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3498 - 3509
  • [42] Learning Disentangled Representations of Texts with Application to Biomedical Abstracts
    Jain, Sarthak
    Banner, Edward
    van de Meent, Jan-Willem
    Marshall, Iain J.
    Wallace, Byron C.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4683 - 4693
  • [43] Learning Disentangled Representations of Satellite Image Time Series
    Sanchez, Eduardo H.
    Serrurier, Mathieu
    Ortner, Mathias
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 306 - 321
  • [44] On the Fairness of Disentangled Representations
    Locatello, Francesco
    Abbati, Gabriele
    Rainforth, Tom
    Bauer, Stefan
    Scholkopf, Bernhard
    Bachem, Olivier
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [45] Learning Disentangled Attribute Representations for Robust Pedestrian Attribute Recognition
    Jia, Jian
    Gao, Naiyu
    He, Fei
    Chen, Xiaotang
    Huang, Kaiqi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1069 - 1077
  • [46] Geometric Inductive Biases for Identifiable Unsupervised Learning of Disentangled Representations
    Pan, Ziqi
    Niu, Li
    Zhang, Liqing
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9372 - 9380
  • [47] Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations
    Benaroya, Laurent
    Obin, Nicolas
    Roebel, Axel
    ENTROPY, 2023, 25 (02)
  • [48] An Adversarial Neuro-Tensorial Approach for Learning Disentangled Representations
    Wang, Mengjiao
    Shu, Zhixin
    Cheng, Shiyang
    Panagakis, Yannis
    Samaras, Dimitris
    Zafeiriou, Stefanos
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (6-7) : 743 - 762
  • [49] An Adversarial Neuro-Tensorial Approach for Learning Disentangled Representations
    Mengjiao Wang
    Zhixin Shu
    Shiyang Cheng
    Yannis Panagakis
    Dimitris Samaras
    Stefanos Zafeiriou
    International Journal of Computer Vision, 2019, 127 : 743 - 762
  • [50] Disentangled behavioral representations
    Dezfouli, Amir
    Ashtiani, Hassan
    Ghattas, Omar
    Nock, Richard
    Dayan, Peter
    Ong, Cheng Soon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32