CHEMIST: an R package for causal inference with high-dimensional error-prone covariates and misclassified treatments

被引:1
|
作者
Chen, Li-Pang [1 ]
Hsu, Wei-Hsin [1 ]
机构
[1] Natl Chengchi Univ, Dept Stat, Taipei 116, Taiwan
关键词
Feature screening; Inverse probability weight; Measurement error; Propensity score; R package; VARIABLE SELECTION; ADAPTIVE LASSO; LIKELIHOOD;
D O I
10.1007/s42081-023-00217-y
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we study causal inference with complex and noisy data accommodated. A new structure is called CHEMIST, which refers to Causal inference with High-dimensional Error-prone covariates and MISclassified Treatments. To suitably tackle those challenges when estimating the average treatment effect (ATE), we develop the FATE method, which reflects Feature screening, Adaptive lasso, Treatment adjustment, and Error elimination in covariates, to handle variable selection and measurement error correction. Under informative and error-eliminated data, we can estimate the ATE. To make our strategy available for public use, we develop a new R package CHEMIST, which provides functions for users to estimate the ATE. With the flexibility of arguments, one can examine different scenarios based on our package. In this paper, we introduce the FATE method and the implementation in the R package CHEMIST. Moreover, we demonstrate applications in two real data sets.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] ordinalgmifs: An R Package for Ordinal Regression in High-dimensional Data Settings
    Archer, Kellie J.
    Hou, Jiayi
    Zhou, Qing
    Ferber, Kyle
    Layne, John G.
    Gentry, Amanda E.
    CANCER INFORMATICS, 2014, 13 : 187 - 195
  • [42] Random Subspace Method for high-dimensional regression with the R package regRSM
    Teisseyre, Pawel
    Klopotek, Robert A.
    Mielniczuk, Jan
    COMPUTATIONAL STATISTICS, 2016, 31 (03) : 943 - 972
  • [43] Variable Clustering in High-Dimensional Linear Regression: The R Package clere
    Yengo, Loic
    Jacques, Julien
    Biernacki, Christophe
    Canouil, Mickael
    R JOURNAL, 2016, 8 (01): : 92 - 106
  • [44] Random Subspace Method for high-dimensional regression with the R package regRSM
    Paweł Teisseyre
    Robert A. Kłopotek
    Jan Mielniczuk
    Computational Statistics, 2016, 31 : 943 - 972
  • [45] Uncertainty Assessment and False Discovery Rate Control in High-Dimensional Granger Causal Inference
    Chaudhry, Aditya
    Xu, Pan
    Gu, Quanquan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [46] Interep: An R Package for High-Dimensional Interaction Analysis of the Repeated Measurement Data
    Zhou, Fei
    Ren, Jie
    Liu, Yuwen
    Li, Xiaoxi
    Wang, Weiqun
    Wu, Cen
    GENES, 2022, 13 (03)
  • [47] MCPtaggR: R package for accurate genotype calling in reduced representation sequencing data by eliminating error-prone markers based on genome comparison
    Furuta, Tomoyuki
    Yamamoto, Toshio
    DNA RESEARCH, 2024, 31 (01)
  • [48] bbl: Boltzmann Bayes Learner for High-Dimensional Inference with Discrete Predictors in R
    Woo, Jun
    Wang, Jinhua
    JOURNAL OF STATISTICAL SOFTWARE, 2022, 101 (05): : 1 - 32
  • [49] HRM: An R Package for Analysing High-dimensional Multi-factor Repeated Measures
    Happ, Martin
    Harrar, Solomon W.
    Bathke, Arne C.
    R JOURNAL, 2018, 10 (01): : 534 - 548
  • [50] LongDat: an R package for covariate-sensitive longitudinal analysis of high-dimensional data
    Chen, Chia-Yu
    Loeber, Ulrike
    Forslund, Sofia K.
    BIOINFORMATICS ADVANCES, 2023, 3 (01):