Exact Statistical Tests for Heterogeneity of Frequencies Based on Extreme Values

被引:4
|
作者
Wu, Chih-Chieh [1 ]
Grimson, Roger C. [2 ]
Shete, Sanjay
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Epidemiol, Unit 1340, Houston, TX 77030 USA
[2] SUNY Stony Brook, Dept Prevent Med, Stony Brook, NY 11794 USA
关键词
Classical occupancy model; Extreme value; Maximum; Minimum; Pearson's 2 test; Temporal anomalies; DISEASE CLUSTERS; TIME;
D O I
10.1080/03610910903528335
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Sophisticated statistical analyses of incidence frequencies are often required for various epidemiologic and biomedical applications. Among the most commonly applied methods is the Pearson's 2 test, which is structured to detect non specific anomalous patterns of frequencies and is useful for testing the significance for incidence heterogeneity. However, the Pearson's 2 test is not efficient for assessing the significance of frequency in a particular cell (or class) to be attributed to chance alone. We recently developed statistical tests for detecting temporal anomalies of disease cases based on maximum and minimum frequencies; these tests are actually designed to test of significance for a particular high or low frequency. The purpose of this article is to demonstrate merits of these tests in epidemiologic and biomedical studies. We show that our proposed methods are more sensitive and powerful for testing extreme cell counts than is the Pearson's 2 test. This feature could provide important and valuable information in epidemiologic or biomeidcal studies. We elucidated and illustrated the differences in sensitivity among our tests and the Pearson's 2 test by analyzing a data set of Langerhans cell histiocytosis cases and its hypothetical sets. We also computed and compared the statistical power of these methods using various sets of cell numbers and alternative frequencies. The investigation of statistical sensitivity and power presented in this work will provide investigators with useful guidelines for selecting the appropriate tests for their studies.
引用
收藏
页码:612 / 623
页数:12
相关论文
共 50 条
  • [31] Rule induction based on frequencies of attribute values
    Borowik, Grzegorz
    Kowalski, Karol
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2015, 2015, 9662
  • [32] Statistical tests under Dallal's model: Asymptotic and exact methods
    Li, Zhiming
    Ma, Changxing
    Ai, Mingyao
    PLOS ONE, 2020, 15 (11):
  • [33] Dipole tilt effects in plasma sheet By: statistical model and extreme values
    Petrukovich, A. A.
    ANNALES GEOPHYSICAE, 2009, 27 (03) : 1343 - 1352
  • [34] ON HETEROGENEITY TESTS BASED ON EFFICIENT SCORES
    TARONE, RE
    BIOMETRIKA, 1985, 72 (01) : 91 - 95
  • [35] USE OF THE DISTRIBUTION THEORY OF EXTREME VALUES OF A SAMPLE IN THE EVALUATION OF STRENGTH TESTS
    LINDTROP, NG
    INDUSTRIAL LABORATORY, 1963, 29 (07): : 902 - 905
  • [36] On Statistical Inference Based on Record Values
    Andrey Feuerverger
    Peter Hall
    Extremes, 1998, 1 (2) : 169 - 190
  • [37] EXACT P-VALUES FOR F-TEST AND T-TESTS
    LACKRITZ, JR
    AMERICAN STATISTICIAN, 1984, 38 (04): : 312 - 314
  • [38] Inconsistency of Tests Based on Extremal Values
    Fetisova, T. A.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2016, 37 (04) : 418 - 421
  • [39] A combined statistical and machine learning approach for spatial prediction of extreme wildfire frequencies and sizes
    Daniela Cisneros
    Yan Gong
    Rishikesh Yadav
    Arnab Hazra
    Raphaël Huser
    Extremes, 2023, 26 : 301 - 330
  • [40] A combined statistical and machine learning approach for spatial prediction of extreme wildfire frequencies and sizes
    Cisneros, Daniela
    Gong, Yan
    Yadav, Rishikesh
    Hazra, Arnab
    Huser, Raphael
    EXTREMES, 2023, 26 (02) : 301 - 330