Estimation of spatial econometric linear models with large datasets: How big can spatial Big Data be?

被引:11
|
作者
Arbia, G. [1 ,2 ]
Ghiringhelli, C. [1 ]
Mira, A. [3 ,4 ]
机构
[1] Univ Svizzera Italians, Lugano, Switzerland
[2] Univ Cattolica Sacro Cuore, Milan, Italy
[3] Univ Svizzera Italiana, Inst Computat Sci, Data Sci Lab, Lugano, Switzerland
[4] Univ Insubria, Varese, Italy
基金
瑞士国家科学基金会;
关键词
Big spatial data; Computational issues; Spatial econometric models; Maximum Likelihood; Bayesian estimator; Spatial two stages least squares; Dense matrix; AUTOREGRESSIVE MODELS; MATRICES; ISOTROPY;
D O I
10.1016/j.regsciurbeco.2019.01.006
中图分类号
F [经济];
学科分类号
02 ;
摘要
Spatial econometrics is currently experiencing the Big Data revolution both in terms of the volume of data and the velocity with which they are accumulated. Regional data, employed traditionally in spatial econometric modeling, can be very large, with information that are increasingly available at a very fine resolution level such as census tracts, local markets, town blocks, regular grids or other small partitions of the territory. When dealing with spatial microeconometric models referred to the granular observations of the single economic agent, the number of observations available can be a lot higher. This paper reports the results of a systematic simulation study on the limits of the current methodologies when estimating spatial models with large datasets. In our study we simulate a Spatial Lag Model (SLM), we estimate it using Maximum Likelihood (ML), Two Stages Least Squares (2SLS) and Bayesian estimator (B), and we test their performances for different sample sizes and different levels of sparsity of the weight matrices. We considered three performance indicators, namely: computing time, storage required and accuracy of the estimators. The results show that using standard computer capabilities the analysis becomes prohibitive and unreliable when the sample size is greater than 70,000 even for low levels of sparsity. This result suggests that new approaches should be introduced to analyze the big datasets that are quickly becoming the new standard in spatial econometrics.
引用
收藏
页码:67 / 73
页数:7
相关论文
共 50 条
  • [1] Storing and Clustering Large Spatial Datasets Using Big Data Technologies
    Cortinas, Alejandro
    Luaces, Miguel R.
    Rodeiro, Tirso V.
    WEB AND WIRELESS GEOGRAPHICAL INFORMATION SYSTEMS, W2GIS 2018, 2018, 10819 : 15 - 24
  • [2] An Overview on Econometric Models for Linear Spatial Panel Data
    Sutradhar, Brajendra C.
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2021, 83 (01): : 206 - 244
  • [3] An Overview on Econometric Models for Linear Spatial Panel Data
    Brajendra C. Sutradhar
    Sankhya A, 2021, 83 : 206 - 244
  • [4] Small Stories in Big Data: Gaining Insights From Large Spatial Point Pattern Datasets
    Poorthuis, Ate
    Zook, Matthew
    CITYSCAPE, 2015, 17 (01) : 151 - 160
  • [5] LARGE SAMPLE PROPERTIES OF BAYESIAN ESTIMATION OF SPATIAL ECONOMETRIC MODELS
    Han, Xiaoyi
    Lee, Lung-Fei
    Xu, Xingbai
    ECONOMETRIC THEORY, 2021, 37 (04) : 708 - 746
  • [6] Big Spatial Data Mining
    Wang Shuliang
    Ding Gangyi
    Zhong Ming
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [7] The Era of Big Spatial Data
    Eldawy, Ahmed
    Mokbel, Mohamed F.
    2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2015, : 42 - 49
  • [8] The Era of Big Spatial Data
    Eldawy, Ahmed
    Mokbel, Mohamed F.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1992 - 1995
  • [9] The Era of Big Spatial Data
    Eldawy, Ahmed
    Mokbel, Mohamed F.
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1424 - 1427
  • [10] Benchmarking Spatial Big Data
    Shekhar, Shashi
    Evans, Michael R.
    Gunturi, Viswanath
    Yang, KwangSoo
    Cugler, Daniel Cintra
    SPECIFYING BIG DATA BENCHMARKS, 2014, 8163 : 81 - 93