New partition based measures for data compatibility and information gain

被引:4
|
作者
Shi, Daoyuan [1 ]
Chen, Ming-Hui [1 ]
Kuo, Lynn [1 ]
O. Lewis, Paul [2 ]
机构
[1] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
[2] Univ Connecticut, Dept Ecol & Evolutionary Biol, Storrs, CT USA
基金
美国国家科学基金会;
关键词
entropy; highest posterior density (HPD) region; information; Kullback‐ Leibler (KL) divergence; posterior distribution; power prior;
D O I
10.1002/sim.8982
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is of great practical importance to compare and combine data from different studies in order to carry out appropriate and more powerful statistical inference. We propose a partition based measure to quantify the compatibility of two datasets using their respective posterior distributions. We further propose an information gain measure to quantify the information increase (or decrease) in combining two datasets. These measures are well calibrated and efficient computational algorithms are provided for their calculations. We use examples in a benchmark dose toxicology study, a six cities pollution data and a melanoma clinical trial to illustrate how these two measures are useful in combining current data with historical data and missing data.
引用
收藏
页码:3560 / 3581
页数:22
相关论文
共 50 条
  • [31] Developing and testing new smoking measures for the health plan employer data and information set
    Pbert, L
    Vuckovic, N
    Ockene, JK
    Hollis, JF
    Riedlinger, K
    MEDICAL CARE, 2003, 41 (04) : 550 - 559
  • [32] MIGR: A Categorical Data Clustering Algorithm Based on Information Gain in Rough Set Theory
    Raheem, Saddam
    Al Shehabi, Shadi
    Nassief, Amaal Mohi
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2022, 30 (05) : 757 - 771
  • [33] Group Feature Screening Based on Information Gain Ratio for Ultrahigh-Dimensional Data
    Wang, Zhongzheng
    Deng, Guangming
    Yu, Jianqi
    JOURNAL OF MATHEMATICS, 2022, 2022
  • [34] Hybrid Information Gain Based Fuzzy Roughset Feature Selection in Cancer Microarray Data
    Chinnaswamy, Arunkumar
    Srinivasan, Ramakrishnan
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [35] Information gain-based semi-supervised feature selection for hybrid data
    Shu, Wenhao
    Yan, Zhenchao
    Yu, Jianhui
    Qian, Wenbin
    APPLIED INTELLIGENCE, 2023, 53 (06) : 7310 - 7325
  • [36] Information gain-based semi-supervised feature selection for hybrid data
    Wenhao Shu
    Zhenchao Yan
    Jianhui Yu
    Wenbin Qian
    Applied Intelligence, 2023, 53 : 7310 - 7325
  • [37] Inversion of Bayes formula and measures of Bayesian information gain and pairwise dependence
    Ng, Kai Wang
    Tong, Howell
    STATISTICS AND ITS INTERFACE, 2011, 4 (01) : 95 - 103
  • [38] COMPATIBILITY AND ATTAINABILITY OF MATRICES OF CORRELATION-BASED MEASURES OF CONCORDANCE
    Hofert, Marius
    Koike, Takaaki
    ASTIN BULLETIN, 2019, 49 (03): : 885 - 918
  • [39] MEASURES OF DEPENDENCE IN NORMAL-MODELS AND EXPONENTIAL MODELS BY INFORMATION GAIN
    INABA, T
    SHIRAHATA, S
    BIOMETRIKA, 1986, 73 (02) : 345 - 352
  • [40] Information Geometry, Complexity Measures and Data Analysis
    Amigo, Jose M.
    Tempesta, Piergiulio
    ENTROPY, 2022, 24 (12)