Correcting for regression dilution bias: comparison of methods for a single predictor variable

被引：230

作者：

Frost, C

Thompson, SG

机构：

[1] Univ London London Sch Hyg & Trop Med, Med Stat Unit, London WC1E 7HT, England

[2] Univ London Imperial Coll Sci Technol & Med, Sch Med, London, England

来源：

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY | 2000年 / 163卷

关键词：

correction methods; epidemiology; method comparison; regression dilution bias; variance formulae;

D O I：

10.1111/1467-985X.00164

中图分类号：

O1 [数学]; C [社会科学总论];

学科分类号：

03 ; 0303 ; 0701 ; 070101 ;

摘要：

In an epidemiological study the regression slope between a response and predictor variable is underestimated when the predictor variable is measured imprecisely. Repeat measurements of the predictor in individuals in a subset of the study or in a separate study can be used to estimate a multiplicative factor to correct for this 'regression dilution bias'. In applied statistics publications various methods have been used to estimate this correction factor. Here we compare six different estimation methods and explain how they fall into two categories, namely regression and correlation-based methods. We provide new asymptotic variance formulas for the optimal correction factors in each category, when these are estimated from the repeat measurements subset alone, and show analytically and by simulation that the correlation method of choice gives uniformly lower variance. The simulations also show that, when the correction factor is not much greater than 1, this correlation method gives a correction factor which is closer to the true value than that from the best regression method on up to 80% of occasions. We also provide a variance formula for a modified correlation method which uses the standard deviation of the predictor variable in the main study; this shows further improved performance provided that the correction factor is not too extreme. A confidence interval for a corrected regression slope in an epidemiological study should reflect the imprecision of both the uncorrected slope and the estimated correction factor. We provide formulae for this and show that, particularly when the correction factor is large and the size of the subset of repeat measures is small, the effect of allowing for imprecision in the estimated correction factor can be substantial.

引用

页码：173 / 189

页数：17

共 50 条

[21] Comparison of Bias and Resolvability in Single-Cell and Single-Transcript Methods
Rammohan, Jayan
Lund, Steven P.
Alperovich, Nina
Paralanov, Vanya
Strychalski, Elizabeth A.
Ross, David
BIOPHYSICAL JOURNAL, 2021, 120 (03) : 136A - 136A
[22] Comparison of bias and resolvability in single-cell and single-transcript methods
Jayan Rammohan
Steven P. Lund
Nina Alperovich
Vanya Paralanov
Elizabeth A. Strychalski
David Ross
Communications Biology, 4
[23] Comparison of bias and resolvability in single-cell and single-transcript methods
Rammohan, Jayan
Lund, Steven P.
Alperovich, Nina
Paralanov, Vanya
Strychalski, Elizabeth A.
Ross, David
COMMUNICATIONS BIOLOGY, 2021, 4 (01)
[24] SINGLE-VARIABLE POISSON REGRESSION - GOODNESS-OF-FIT TEST AND THE COMPARISON OF REGRESSION COEFFICIENTS
PYNE, DA
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (366) : 489 - 493
[25] Correcting the bias in least squares regression with volume-rescaled sampling
Derezinski, Michal
Warmuth, Manfred K.
Hsu, Daniel
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 944 - 953
[26] Correcting for Selection Bias and Missing Response in Regression using Privileged Information
Boeken, Philip
de Kroon, Noud
de Jong, Mathijs
Mooij, Joris M.
Zoeter, Onno
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 195 - 205
[27] Comparison of conventional and machine learning methods for bias correcting CMIP6 rainfall and temperature in Nigeria
Tanimu, Bashir
Bello, Al-Amin Danladi
Abdullahi, Sule Argungu
Ajibike, Morufu A.
Yaseen, Zaher Mundher
Kamruzzaman, Mohammad
Muhammad, Mohd Khairul Idlan bin
Shahid, Shamsuddin
THEORETICAL AND APPLIED CLIMATOLOGY, 2024, 155 (6) : 4423 - 4452
[28] A comparison of random forest variable selection methods for regression modeling of continuous outcomes
O'Connell, Nathaniel S.
Jaeger, Byron C.
Bullock, Garrett S.
Speiser, Jaime Lynn
BRIEFINGS IN BIOINFORMATICS, 2025, 26 (02)
[29] DERIVATIVE ESTIMATION IN NONPARAMETRIC REGRESSION WITH RANDOM PREDICTOR VARIABLE
MACK, YP
MULLER, HG
SANKHYA-THE INDIAN JOURNAL OF STATISTICS SERIES A, 1989, 51 : 59 - 72
[30] Intersalt data - Correction for regression dilution bias in Intersalt study was misleading
Smith, GD
Phillips, AN
BRITISH MEDICAL JOURNAL, 1997, 315 (7106): : 485 - 486

← 1 2 3 4 5 →