A KNN-Based Non-Parametric Conditional Independence Test for Mixed Data and Application in Causal Discovery

被引:0
|
作者
Huegle, Johannes [1 ]
Hagedorn, Christopher [1 ]
Schlosser, Rainer [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Potsdam, Germany
关键词
Non-Parametric CI Testing; Causal Discovery; Mixed Data; BAYESIAN NETWORKS; VARIABLES; DISCRETE;
D O I
10.1007/978-3-031-43412-9_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Testing for Conditional Independence (CI) is a fundamental task for causal discovery but is particularly challenging in mixed discrete-continuous data. In this context, inadequate assumptions or discretization of continuous variables reduce the CI test's statistical power, which yields incorrect learned causal structures. In this work, we present a non-parametric CI test leveraging k-nearest neighbor (kNN) methods that are adaptive to mixed discrete-continuous data. In particular, a kNN-based conditional mutual information estimator serves as the test statistic, and the p-value is calculated using a kNN-based local permutation scheme. We prove the CI test's statistical validity and power in mixed discrete-continuous data, which yields consistency when used in constraint-based causal discovery. An extensive evaluation of synthetic and real-world data shows that the proposed CI test outperforms state-of-the-art approaches in the accuracy of CI testing and causal discovery, particularly in settings with low sample sizes.
引用
收藏
页码:541 / 558
页数:18
相关论文
共 50 条
  • [1] Approximate Kernel-Based Conditional Independence Tests for Fast Non-Parametric Causal Discovery
    Strobl, Eric, V
    Zhang, Kun
    Visweswaran, Shyam
    [J]. JOURNAL OF CAUSAL INFERENCE, 2019, 7 (01)
  • [2] Residual Similarity Based Conditional Independence Test and Its Application in Causal Discovery
    Zhang, Hao
    Zhou, Shuigeng
    Zhang, Kun
    Guan, Jihong
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5942 - 5949
  • [3] Non-Stationary Causal Discovery Method Based on Conditional Independence Test
    Zhifeng, Hao
    Weijie, Zhang
    Ruichu, Cai
    Wei, Chen
    [J]. Computer Engineering and Applications, 60 (10): : 113 - 120
  • [4] A NON-PARAMETRIC TEST OF INDEPENDENCE
    HOEFFDING, W
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1948, 19 (04): : 546 - 557
  • [5] A non-parametric test for independence based on symbolic dynamics
    Matilla-Garcia, Mariano
    [J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2007, 31 (12): : 3889 - 3903
  • [6] POWER OF A NON-PARAMETRIC TEST OF INDEPENDENCE
    ELANDT, RC
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1961, 32 (02): : 625 - 625
  • [7] A Distribution Free Conditional Independence Test with Applications to Causal Discovery
    Cai, Zhanrui
    Li, Runze
    Zhang, Yaowu
    [J]. Journal of Machine Learning Research, 2022, 23
  • [8] A Distribution Free Conditional Independence Test with Applications to Causal Discovery
    Cai, Zhanrui
    Li, Runze
    Zhang, Yaowu
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [9] On the Intersection Property of Conditional Independence and its Application to Causal Discovery
    Peters, Jonas
    [J]. JOURNAL OF CAUSAL INFERENCE, 2015, 3 (01) : 97 - 108
  • [10] A non-parametric independence test using permutation entropy
    Matilla-Garcia, Mariano
    Marin, Manuel Ruiz
    [J]. JOURNAL OF ECONOMETRICS, 2008, 144 (01) : 139 - 155