Graphical Granger Causality by Information-Theoretic Criteria

被引：2

作者：

Hlavackova-Schindler, Katerina ^{[1
]}

Plant, Claudia ^{[1
,2
]}

机构：

[1] Univ Vienna, Fac Comp Sci, Vienna, Austria

[2] Univ Vienna, Ds UniVie, Vienna, Austria

来源：

ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年 / 325卷

关键词：

EXPRESSION; SELECTION; MODELS;

D O I：

10.3233/FAIA200252

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Causal inference by a graphical Granger model (GGM) among p variables is typically solved by p penalized linear regression problems in time series with a given lag. In practice however, the estimates of a penalized linear regression after a finite number of steps can be still far from the optimum. Furthermore, the selection of the regularization parameter, influencing the precision of the model is not trivial, especially when the corresponding design matrix is super-collinear. In this paper, for the first time we concept a graphical Granger model as an instance of combinatorial optimization. Computing maximum likelihood (ML) estimates of the regression coefficients and of the variance for each of p variables we propose an information-theoretic graphical Granger model (ITGGM). In the sense of information theory, the criterion to be minimized is the complexity of the class of the selected models together with the complexity of the data set. Following this idea, we propose four various information-theoretic (IT) objective functions based on stochastic complexity, on minimum message length, on Akaike and on Bayesian information criterion. To find their minima we propose a genetic algorithm operating with populations of subsets of regressor variables. The feature selection by the ITGGM with any of the functions is parameter-free in the sense that beside the ML estimates which are for each and within the model constant, no adjustable parameter is added into these objective functions. We further provide a theoretical analysis of the convergence properties of the GGM with the proposed IT functions. We test the performance of the functions in terms of F-1 measure with respect to two common penalized GGMs on synthetic and real data. The experiments demonstrate the significant superiority of the IT criteria in terms of F-1 measure over the two alternatives of the penalized GGM for Granger causal inference.

引用

页码：1459 / 1466

页数：8

共 50 条

[41] INFORMATION-THEORETIC VIEW OF CONTROL
Roy, Prateep
Cela, Arben
Hamam, Yskandar
ICINCO 2009: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 3, 2009, : 5 - +
[42] Distributed Information-Theoretic Biclustering
Pichler, Georg
Piantanida, Pablo
Matz, Gerald
2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 1083 - 1087
[43] CONTENT - SEMANTIC AND INFORMATION-THEORETIC
CHURCHLAND, PM
CHURCHLAND, PS
BEHAVIORAL AND BRAIN SCIENCES, 1983, 6 (01) : 67 - 68
[44] An Information-Theoretic Protocol Compiler
Sahai, Amit
2012 IEEE INFORMATION THEORY WORKSHOP (ITW), 2012, : 1 - 4
[45] Distributed information-theoretic clustering
Pichler, Georg
Piantanida, Pablo
Matz, Gerald
INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2022, 11 (01) : 137 - 166
[46] Information-Theoretic System Identification
Chernyshov, K. R.
2017 4TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2017, : 1117 - 1122
[47] An information-theoretic perspective on teleconnections
Greene, Arthur M.
GEOPHYSICAL RESEARCH LETTERS, 2013, 40 (19) : 5258 - 5262
[48] An information-theoretic model for steganography
Cachin, C
INFORMATION AND COMPUTATION, 2004, 192 (01) : 41 - 56
[49] Information-Theoretic Security with Asymmetries
Beyne, Tim
Chen, Yu Long
ADVANCES IN CRYPTOLOGY - CRYPTO 2024, PT IV, 2024, 14923 : 463 - 494
[50] An information-theoretic model for steganography
Cachin, C
INFORMATION HIDING, 1998, 1525 : 306 - 318

← 1 2 3 4 5 →