On Making Valid Inferences by Integrating Data from Surveys and Other Sources

被引:39
|
作者
Rao, J. N. K. [1 ]
机构
[1] Carleton Univ, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Big data; Dual frames; Probability sampling; Non-probability sampling; Sample selection bias; Small area estimation; SMALL-AREA ESTIMATION; AUXILIARY INFORMATION; CALIBRATION APPROACH; COMBINING DATA; MODEL; ESTIMATORS; FUTURE; NONRESPONSE; PREDICTION; IMPUTATION;
D O I
10.1007/s13571-020-00227-w
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Survey samplers have long been using probability samples from one or more sources in conjunction with census and administrative data to make valid and efficient inferences on finite population parameters. This topic has received a lot of attention more recently in the context of data from non-probability samples such as transaction data, web surveys and social media data. In this paper, I will provide a brief overview of probability sampling methods first and then discuss some recent methods, based on models for the non-probability samples, which could lead to useful inferences from a non-probability sample by itself or when combined with a probability sample. I will also explain how big data may be used as predictors in small area estimation, a topic of current interest because of the growing demand for reliable local area statistics.
引用
收藏
页码:242 / 272
页数:31
相关论文
共 50 条
  • [31] THE OTHER HALF OF THE CASSINESI CHIOSE FIRST SURVEYS ON THE SOURCES
    Alvino, GIuseppe
    [J]. TICONTRE-TEORIA TESTO TRADUZIONE, 2023, (20):
  • [32] MAKING CAUSAL INFERENCES WITH ORDINAL DATA - REYNOLDS,NT
    BASU, AK
    [J]. SOCIOLOGY AND SOCIAL RESEARCH, 1972, 57 (01): : 107 - 109
  • [33] CAUSAL ANALYSIS OF DATA FROM PANEL STUDIES AND OTHER KINDS OF SURVEYS
    GOODMAN, LA
    [J]. AMERICAN JOURNAL OF SOCIOLOGY, 1973, 78 (05) : 1135 - 1191
  • [34] Integrating XML sources into a data warehouse
    Vrdoljak, Boris
    Banek, Marko
    Skocir, Zoran
    [J]. DATA ENGINEERING ISSUES IN E-COMMERCE AND SERVICES, PROCEEDINGS, 2006, 4055 : 133 - 142
  • [35] On Inferences from Completed Data
    Haddock, Jamie
    Molitor, Denali
    Needell, Deanna
    Sambandam, Sneha
    Song, Joy
    Sun, Simon
    [J]. 2019 13TH INTERNATIONAL CONFERENCE ON SAMPLING THEORY AND APPLICATIONS (SAMPTA), 2019,
  • [36] Inferences from aggregated data
    Jarjoura, D
    [J]. ACADEMIC EMERGENCY MEDICINE, 2003, 10 (08) : 881 - 882
  • [37] Curating and Integrating Data from Multiple Sources to Support Healthcare Analytics
    Ng, Kenney
    Kakkanatt, Chris
    Benigno, Michael
    Thompson, Clay
    Jackson, Margaret
    Cahan, Amos
    Zhu, Xinxin
    Zhang, Ping
    Huang, Paul
    [J]. MEDINFO 2015: EHEALTH-ENABLED HEALTH, 2015, 216 : 1056 - 1056
  • [38] Integrating human single-cell data from multiple sources
    Li, Chenwei
    Liu, Zedao
    Zhang, Zemin
    [J]. QUANTITATIVE BIOLOGY, 2022, 10 (03) : 299 - 300
  • [39] Integrating human single-cell data from multiple sources
    Chenwei Li
    Zedao Liu
    Zemin Zhang
    [J]. Quantitative Biology, 2022, 10 (03) : 299 - 300
  • [40] PALEOCEANOGRAPHY OF THE CRETACEOUS/TERTIARY BOUNDARY EVENT: INFERENCES FROM STABLE ISOTOPIC AND OTHER DATA
    Zachos, J. C.
    Arthur, M. A.
    [J]. PALEOCEANOGRAPHY, 1986, 1 (01): : 5 - 26