Graphical Granger Causality by Information-Theoretic Criteria

被引:2
|
作者
Hlavackova-Schindler, Katerina [1 ]
Plant, Claudia [1 ,2 ]
机构
[1] Univ Vienna, Fac Comp Sci, Vienna, Austria
[2] Univ Vienna, Ds UniVie, Vienna, Austria
关键词
EXPRESSION; SELECTION; MODELS;
D O I
10.3233/FAIA200252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Causal inference by a graphical Granger model (GGM) among p variables is typically solved by p penalized linear regression problems in time series with a given lag. In practice however, the estimates of a penalized linear regression after a finite number of steps can be still far from the optimum. Furthermore, the selection of the regularization parameter, influencing the precision of the model is not trivial, especially when the corresponding design matrix is super-collinear. In this paper, for the first time we concept a graphical Granger model as an instance of combinatorial optimization. Computing maximum likelihood (ML) estimates of the regression coefficients and of the variance for each of p variables we propose an information-theoretic graphical Granger model (ITGGM). In the sense of information theory, the criterion to be minimized is the complexity of the class of the selected models together with the complexity of the data set. Following this idea, we propose four various information-theoretic (IT) objective functions based on stochastic complexity, on minimum message length, on Akaike and on Bayesian information criterion. To find their minima we propose a genetic algorithm operating with populations of subsets of regressor variables. The feature selection by the ITGGM with any of the functions is parameter-free in the sense that beside the ML estimates which are for each and within the model constant, no adjustable parameter is added into these objective functions. We further provide a theoretical analysis of the convergence properties of the GGM with the proposed IT functions. We test the performance of the functions in terms of F-1 measure with respect to two common penalized GGMs on synthetic and real data. The experiments demonstrate the significant superiority of the IT criteria in terms of F-1 measure over the two alternatives of the penalized GGM for Granger causal inference.
引用
收藏
页码:1459 / 1466
页数:8
相关论文
共 50 条
  • [41] INFORMATION-THEORETIC VIEW OF CONTROL
    Roy, Prateep
    Cela, Arben
    Hamam, Yskandar
    ICINCO 2009: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 3, 2009, : 5 - +
  • [42] Distributed Information-Theoretic Biclustering
    Pichler, Georg
    Piantanida, Pablo
    Matz, Gerald
    2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 1083 - 1087
  • [43] CONTENT - SEMANTIC AND INFORMATION-THEORETIC
    CHURCHLAND, PM
    CHURCHLAND, PS
    BEHAVIORAL AND BRAIN SCIENCES, 1983, 6 (01) : 67 - 68
  • [44] An Information-Theoretic Protocol Compiler
    Sahai, Amit
    2012 IEEE INFORMATION THEORY WORKSHOP (ITW), 2012, : 1 - 4
  • [45] Distributed information-theoretic clustering
    Pichler, Georg
    Piantanida, Pablo
    Matz, Gerald
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2022, 11 (01) : 137 - 166
  • [46] Information-Theoretic System Identification
    Chernyshov, K. R.
    2017 4TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2017, : 1117 - 1122
  • [47] An information-theoretic perspective on teleconnections
    Greene, Arthur M.
    GEOPHYSICAL RESEARCH LETTERS, 2013, 40 (19) : 5258 - 5262
  • [48] An information-theoretic model for steganography
    Cachin, C
    INFORMATION AND COMPUTATION, 2004, 192 (01) : 41 - 56
  • [49] Information-Theoretic Security with Asymmetries
    Beyne, Tim
    Chen, Yu Long
    ADVANCES IN CRYPTOLOGY - CRYPTO 2024, PT IV, 2024, 14923 : 463 - 494
  • [50] An information-theoretic model for steganography
    Cachin, C
    INFORMATION HIDING, 1998, 1525 : 306 - 318