Graphical Granger Causality by Information-Theoretic Criteria

被引:2
|
作者
Hlavackova-Schindler, Katerina [1 ]
Plant, Claudia [1 ,2 ]
机构
[1] Univ Vienna, Fac Comp Sci, Vienna, Austria
[2] Univ Vienna, Ds UniVie, Vienna, Austria
关键词
EXPRESSION; SELECTION; MODELS;
D O I
10.3233/FAIA200252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Causal inference by a graphical Granger model (GGM) among p variables is typically solved by p penalized linear regression problems in time series with a given lag. In practice however, the estimates of a penalized linear regression after a finite number of steps can be still far from the optimum. Furthermore, the selection of the regularization parameter, influencing the precision of the model is not trivial, especially when the corresponding design matrix is super-collinear. In this paper, for the first time we concept a graphical Granger model as an instance of combinatorial optimization. Computing maximum likelihood (ML) estimates of the regression coefficients and of the variance for each of p variables we propose an information-theoretic graphical Granger model (ITGGM). In the sense of information theory, the criterion to be minimized is the complexity of the class of the selected models together with the complexity of the data set. Following this idea, we propose four various information-theoretic (IT) objective functions based on stochastic complexity, on minimum message length, on Akaike and on Bayesian information criterion. To find their minima we propose a genetic algorithm operating with populations of subsets of regressor variables. The feature selection by the ITGGM with any of the functions is parameter-free in the sense that beside the ML estimates which are for each and within the model constant, no adjustable parameter is added into these objective functions. We further provide a theoretical analysis of the convergence properties of the GGM with the proposed IT functions. We test the performance of the functions in terms of F-1 measure with respect to two common penalized GGMs on synthetic and real data. The experiments demonstrate the significant superiority of the IT criteria in terms of F-1 measure over the two alternatives of the penalized GGM for Granger causal inference.
引用
收藏
页码:1459 / 1466
页数:8
相关论文
共 50 条
  • [21] Information-Theoretic Caching
    Wang, Chien-Yi
    Lim, Sung Hoon
    Gastpar, Michael
    2015 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2015, : 1776 - 1780
  • [22] INFORMATION-THEORETIC INCOMPLETENESS
    CHAITIN, GJ
    APPLIED MATHEMATICS AND COMPUTATION, 1992, 52 (01) : 83 - 101
  • [23] The information-theoretic turn
    Blevins, James P.
    PSIHOLOGIJA, 2013, 46 (04) : 355 - 375
  • [24] Information-theoretic logic
    Corcoran, J
    TRUTH IN PERSPECTIVE: RECENT ISSUES IN LOGIC, REPRESENTATION AND ONTOLOGY, 1998, : 113 - 135
  • [25] Information-Theoretic Adverbialism
    Gert, Joshua
    AUSTRALASIAN JOURNAL OF PHILOSOPHY, 2021, 99 (04) : 696 - 715
  • [26] Heterogeneous Graphical Granger Causality by Minimum Message Length
    Hlavackova-Schindler, Katerina
    Plant, Claudia
    ENTROPY, 2020, 22 (12) : 1 - 21
  • [27] Poisson Graphical Granger Causality by Minimum Message Length
    Hlavackova-Schindler, Katerina
    Plant, Claudia
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT I, 2021, 12457 : 526 - 541
  • [28] A comparison of information-theoretic fit criteria for use in model selection.
    Markon, KE
    Krueger, RF
    BEHAVIOR GENETICS, 2002, 32 (06) : 478 - 478
  • [29] Using hierarchical information-theoretic criteria to optimize subsampling of extensive datasets
    Duarte, Belmiro P. M.
    Atkinson, Anthony C.
    Oliveira, Nuno M. C.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 245
  • [30] Signal detection and estimation using atomic decomposition and information-theoretic criteria
    López-Risueño, G
    Grajal, J
    Yeste-Ojeda, OA
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING SIGNAL PROCESSING THEORY AND METHODS, 2004, : 1097 - 1100