Correcting for regression dilution bias: comparison of methods for a single predictor variable

被引:230
|
作者
Frost, C
Thompson, SG
机构
[1] Univ London London Sch Hyg & Trop Med, Med Stat Unit, London WC1E 7HT, England
[2] Univ London Imperial Coll Sci Technol & Med, Sch Med, London, England
关键词
correction methods; epidemiology; method comparison; regression dilution bias; variance formulae;
D O I
10.1111/1467-985X.00164
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
In an epidemiological study the regression slope between a response and predictor variable is underestimated when the predictor variable is measured imprecisely. Repeat measurements of the predictor in individuals in a subset of the study or in a separate study can be used to estimate a multiplicative factor to correct for this 'regression dilution bias'. In applied statistics publications various methods have been used to estimate this correction factor. Here we compare six different estimation methods and explain how they fall into two categories, namely regression and correlation-based methods. We provide new asymptotic variance formulas for the optimal correction factors in each category, when these are estimated from the repeat measurements subset alone, and show analytically and by simulation that the correlation method of choice gives uniformly lower variance. The simulations also show that, when the correction factor is not much greater than 1, this correlation method gives a correction factor which is closer to the true value than that from the best regression method on up to 80% of occasions. We also provide a variance formula for a modified correlation method which uses the standard deviation of the predictor variable in the main study; this shows further improved performance provided that the correction factor is not too extreme. A confidence interval for a corrected regression slope in an epidemiological study should reflect the imprecision of both the uncorrected slope and the estimated correction factor. We provide formulae for this and show that, particularly when the correction factor is large and the size of the subset of repeat measures is small, the effect of allowing for imprecision in the estimated correction factor can be substantial.
引用
收藏
页码:173 / 189
页数:17
相关论文
共 50 条
  • [21] Comparison of Bias and Resolvability in Single-Cell and Single-Transcript Methods
    Rammohan, Jayan
    Lund, Steven P.
    Alperovich, Nina
    Paralanov, Vanya
    Strychalski, Elizabeth A.
    Ross, David
    BIOPHYSICAL JOURNAL, 2021, 120 (03) : 136A - 136A
  • [22] Comparison of bias and resolvability in single-cell and single-transcript methods
    Jayan Rammohan
    Steven P. Lund
    Nina Alperovich
    Vanya Paralanov
    Elizabeth A. Strychalski
    David Ross
    Communications Biology, 4
  • [23] Comparison of bias and resolvability in single-cell and single-transcript methods
    Rammohan, Jayan
    Lund, Steven P.
    Alperovich, Nina
    Paralanov, Vanya
    Strychalski, Elizabeth A.
    Ross, David
    COMMUNICATIONS BIOLOGY, 2021, 4 (01)
  • [25] Correcting the bias in least squares regression with volume-rescaled sampling
    Derezinski, Michal
    Warmuth, Manfred K.
    Hsu, Daniel
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 944 - 953
  • [26] Correcting for Selection Bias and Missing Response in Regression using Privileged Information
    Boeken, Philip
    de Kroon, Noud
    de Jong, Mathijs
    Mooij, Joris M.
    Zoeter, Onno
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 195 - 205
  • [27] Comparison of conventional and machine learning methods for bias correcting CMIP6 rainfall and temperature in Nigeria
    Tanimu, Bashir
    Bello, Al-Amin Danladi
    Abdullahi, Sule Argungu
    Ajibike, Morufu A.
    Yaseen, Zaher Mundher
    Kamruzzaman, Mohammad
    Muhammad, Mohd Khairul Idlan bin
    Shahid, Shamsuddin
    THEORETICAL AND APPLIED CLIMATOLOGY, 2024, 155 (6) : 4423 - 4452
  • [28] A comparison of random forest variable selection methods for regression modeling of continuous outcomes
    O'Connell, Nathaniel S.
    Jaeger, Byron C.
    Bullock, Garrett S.
    Speiser, Jaime Lynn
    BRIEFINGS IN BIOINFORMATICS, 2025, 26 (02)
  • [29] DERIVATIVE ESTIMATION IN NONPARAMETRIC REGRESSION WITH RANDOM PREDICTOR VARIABLE
    MACK, YP
    MULLER, HG
    SANKHYA-THE INDIAN JOURNAL OF STATISTICS SERIES A, 1989, 51 : 59 - 72
  • [30] Intersalt data - Correction for regression dilution bias in Intersalt study was misleading
    Smith, GD
    Phillips, AN
    BRITISH MEDICAL JOURNAL, 1997, 315 (7106): : 485 - 486