Testing goodness-of-fit with interval data

被引:0
|
作者
Vozhov, Stanislav S. [1 ]
Chimitova, Ekaterina V. [1 ]
机构
[1] Novosibirsk State Tech Univ, Novosibirsk, Russia
关键词
interval data; nonparametric estimation of distribution function; Turnbull algorithm; ICM-algorithm;
D O I
10.17223/19988605/34/4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main terms of interval data analysis was initially founded the measurement theory in metrology where an interval uncertainty is naturally introduced. It is expected that every observation is measured by an instrument with absolute error Delta. Thus, if the precise value of an observed response is <(x)over dot>, measurement error is e is an element of[-Delta,Delta], then the measurement is equal to x = <(x)over dot>+e. In this case, we deal with a usual complete sample X-n = {X-1,..., X-n}. Nevertheless, the measurement can be represented as an interval (x-Delta, x+Delta) = (L, R). In this case, for the sample of n observations we obtain an interval sample of the form I-n = {(L-1, R-1),..., (L-n, R-n)}. The main idea of nonparametric estimation of the distribution function with interval data is based on maximization of the loglikelihood function In L(I-n) = Sigma(n)(i=1) ln (F(R-i)-(F(L-i)) at the boundary points of observations L-i, R-i, i = (1,n) over bar under condition of monotonicity of the distribution function. The Turnbull and ICM algorithms are used for calculation of the nonparametric estimate of the distribution function with interval data. The accuracy of the estimates calculation is the same for both algorithm, but the computing time is less for the ICM algorithm. Unknown distribution parameters can be estimated by the maximum likelihood method, which is based on the maximization of likelihood function by parameter theta: L(I-n vertical bar theta) = Pi(n)(i=1) (F(R-i vertical bar theta) - F(L-i vertical bar theta)). Thus, the maximum likelihood estimates can be written as (theta) over cap = argmax ln L(I-n|theta)(theta)is an element of Theta(theta=Theta.) In this paper, the modifications of the classical goodness-of-fit tests for composite hypothesis H-0 : F(t) is an element of {F-0(t;theta),theta is an element of Theta} have been proposed. The main idea of this modification is based on the usage of nonparametric estimate of the distribution function, obtained by the ICM algorithm, instead of the empirical distribution function. In this case, we have the test statistic of the Kolmogorov type as D-n = sup(0 < t <tau m)vertical bar(F) over cap (t) - F-0(t,(theta) over cap vertical bar the statistic of Cramer-von Mises-Smirnov type test as S-omega 2 = integral(tau m)(0)((F) over cap (n)(t) - F-0(t,(theta) over cap))(2) dF(0)(t,(theta) over cap), and the statistic of Anderson-Darling type test as S-W = integral(tau m)(0)((F) over cap (n)(t) - F-0(t,(theta) over cap))(2) dF(0)(t,(theta) over cap)/F-0(t,(theta) over cap)(1-F-0(t,(theta) over cap)), where (F) over capn(t) is the nonparametric estimate of the distribution function by the interval data, 0 < tau(0) < tau(1) <... <tau(m) are ordered different values L-i and R-i, i=(1,n) over bar. We have formulated the sequence of steps for estimation of the p-value for the proposed tests. The hypothesis is not rejected if the obtained p-value is larger than the significance level alpha. As an example, we have tested the normality hypothesis by the interval sample of consumer demand prices for bioenergy drink SPC "SAVA". It has been shown that there is no reason for rejecting the hypothesis of normality of consumer demand prices.
引用
收藏
页码:35 / 42
页数:8
相关论文
共 50 条
  • [1] Goodness-of-fit testing in interval censoring case 1
    Koul, HL
    Yi, TT
    [J]. STATISTICS & PROBABILITY LETTERS, 2006, 76 (07) : 709 - 718
  • [2] The Kolmogorov goodness-of-fit test for interval-valued data
    Grzegorzewski, Przemyslaw
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2017,
  • [3] A flexible semi-Markov model for interval-censored data and goodness-of-fit testing
    Foucher, Y.
    Giral, M.
    Soulillou, J. P.
    Daures, J. P.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (02) : 127 - 145
  • [4] Order statistics in goodness-of-fit testing
    Glen, AG
    Leemis, LM
    Barr, DR
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2001, 50 (02) : 209 - 213
  • [5] Power maps in goodness-of-fit testing
    Fischer, T.
    Kamps, U.
    [J]. COMPUTATIONAL STATISTICS, 2013, 28 (03) : 1365 - 1382
  • [6] A Bayesian formulation of exploratory data analysis and goodness-of-fit testing
    Gelman, A
    [J]. INTERNATIONAL STATISTICAL REVIEW, 2003, 71 (02) : 369 - 382
  • [7] A unified approach to goodness-of-fit testing for spherical and hyperspherical data
    Ebner, Bruno
    Henze, Norbert
    Meintanis, Simos
    [J]. STATISTICAL PAPERS, 2024, 65 (06) : 3447 - 3475
  • [8] Minimax nonparametric goodness-of-fit testing
    Ingster, YI
    Suslina, IA
    [J]. FOUNDATIONS OF STATISTICAL INFERENCE, 2003, : 141 - 152
  • [9] Goodness-of-fit testing: the thresholding approach
    Kim, Min Hee
    Akritas, Michael G.
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2012, 24 (01) : 119 - 138
  • [10] Power maps in goodness-of-fit testing
    T. Fischer
    U. Kamps
    [J]. Computational Statistics, 2013, 28 : 1365 - 1382