Causal Discovery on Discrete Data via Weighted Normalized Wasserstein Distance

被引:0
|
作者
Wei, Yi [1 ]
Li, Xiaofei [1 ]
Lin, Lihui [1 ]
Zhu, Dengming [2 ]
Li, Qingyong [3 ]
机构
[1] Wuyi Univ, Sch Math & Comp Sci, Fujian Key Lab Big Data Applicat & Intellectualiz, Nanping 354300, Fujian, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[3] Beijing Jiaotong Univ, Frontiers Sci Ctr Smart High Speed Railway Syst, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Asymmetry; causal discovery; discrete additive noise model (ANM); weighted normalized Wasserstein distance; INFERENCE;
D O I
10.1109/TNNLS.2022.3213641
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of causal discovery from observational data (X,Y) is defined as the task of deciding whether X causes Y , or Y causes X or if there is no causal relationship between X and Y . Causal discovery from observational data is an important problem in many areas of science. In this study, we propose a method to address this problem when the cause-and-effect relationship is represented by a discrete additive noise model (ANM). First, assuming that X causes Y , we estimate the conditional distributions of the noise given X using regression. Similarly, assuming that Y causes X , we also estimate the conditional distributions of noise given Y . Based on the structural characteristics of the discrete ANM, we find that the dissimilarity of the conditional distributions of noise in the causal direction is smaller than that in the anticausal direction. Then, we propose a weighted normalized Wasserstein distance to measure the dissimilarity of the conditional distributions of noise. Finally, we propose a decision rule for casual discovery by comparing two computed weighted normalized Wasserstein distances. An empirical investigation demonstrates that our method performs well on synthetic data and outperforms state-of-the-art methods on real data.
引用
收藏
页码:4911 / 4923
页数:13
相关论文
共 50 条
  • [1] Causal Inference on Discrete Data via Estimating Distance Correlations
    Liu, Furui
    Chan, Laiwan
    [J]. NEURAL COMPUTATION, 2016, 28 (05) : 801 - 814
  • [2] Causal Discovery on Discrete Data with Extensions to Mixture Model
    Liu, Furui
    Chan, Laiwan
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 7 (02)
  • [3] A SUBSAMPLING-BASED METHOD FOR CAUSAL DISCOVERY ON DISCRETE DATA
    Goddard, Austin
    Xiang, Yu
    [J]. 2021 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2021, : 341 - 345
  • [4] On the Robustness of Causal Discovery with Additive Noise Models on Discrete Data
    Du, Kang
    Goddard, Austin
    Xiang, Yu
    [J]. 2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 365 - 365
  • [5] An infrared small target detection model via Gather-Excite attention and normalized Wasserstein distance
    Sun, Kangjian
    Huo, Ju
    Liu, Qi
    Yang, Shunyuan
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (11) : 19040 - 19064
  • [6] A Weighted Approximation Approach to the Study of the Empirical Wasserstein Distance
    Mason, David M.
    [J]. HIGH DIMENSIONAL PROBABILITY VII: THE CARGESE VOLUME, 2016, 71 : 137 - 154
  • [7] Shape recognition via Wasserstein distance
    Gangbo, W
    McCann, RJ
    [J]. QUARTERLY OF APPLIED MATHEMATICS, 2000, 58 (04) : 705 - 737
  • [8] Discrete Wasserstein barycenters: optimal transport for discrete data
    Ethan Anderes
    Steffen Borgwardt
    Jacob Miller
    [J]. Mathematical Methods of Operations Research, 2016, 84 : 389 - 409
  • [9] Discrete Wasserstein barycenters: optimal transport for discrete data
    Anderes, Ethan
    Borgwardt, Steffen
    Miller, Jacob
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 84 (02) : 389 - 409
  • [10] Causal Discovery from Discrete Data using Hidden Compact Representation
    Cai, Ruichu
    Qiao, Jie
    Zhang, Kun
    Zhang, Zhenjie
    Hao, Zhifeng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31