Estimation of spatial econometric linear models with large datasets: How big can spatial Big Data be?

被引:11
|
作者
Arbia, G. [1 ,2 ]
Ghiringhelli, C. [1 ]
Mira, A. [3 ,4 ]
机构
[1] Univ Svizzera Italians, Lugano, Switzerland
[2] Univ Cattolica Sacro Cuore, Milan, Italy
[3] Univ Svizzera Italiana, Inst Computat Sci, Data Sci Lab, Lugano, Switzerland
[4] Univ Insubria, Varese, Italy
基金
瑞士国家科学基金会;
关键词
Big spatial data; Computational issues; Spatial econometric models; Maximum Likelihood; Bayesian estimator; Spatial two stages least squares; Dense matrix; AUTOREGRESSIVE MODELS; MATRICES; ISOTROPY;
D O I
10.1016/j.regsciurbeco.2019.01.006
中图分类号
F [经济];
学科分类号
02 ;
摘要
Spatial econometrics is currently experiencing the Big Data revolution both in terms of the volume of data and the velocity with which they are accumulated. Regional data, employed traditionally in spatial econometric modeling, can be very large, with information that are increasingly available at a very fine resolution level such as census tracts, local markets, town blocks, regular grids or other small partitions of the territory. When dealing with spatial microeconometric models referred to the granular observations of the single economic agent, the number of observations available can be a lot higher. This paper reports the results of a systematic simulation study on the limits of the current methodologies when estimating spatial models with large datasets. In our study we simulate a Spatial Lag Model (SLM), we estimate it using Maximum Likelihood (ML), Two Stages Least Squares (2SLS) and Bayesian estimator (B), and we test their performances for different sample sizes and different levels of sparsity of the weight matrices. We considered three performance indicators, namely: computing time, storage required and accuracy of the estimators. The results show that using standard computer capabilities the analysis becomes prohibitive and unreliable when the sample size is greater than 70,000 even for low levels of sparsity. This result suggests that new approaches should be introduced to analyze the big datasets that are quickly becoming the new standard in spatial econometrics.
引用
收藏
页码:67 / 73
页数:7
相关论文
共 50 条
  • [21] Clustering Algorithms for Spatial Big Data
    Schoier, Gabriella
    Gregorio, Caterina
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT IV, 2017, 10407 : 571 - 583
  • [22] Spatial big data and anxieties of control
    Leszczynski, Agnieszka
    ENVIRONMENT AND PLANNING D-SOCIETY & SPACE, 2015, 33 (06): : 965 - 984
  • [23] On the Problem of Clustering Spatial Big Data
    Schoier, Gabriella
    Borruso, Giuseppe
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2015, PT III, 2015, 9157 : 688 - 697
  • [24] Interactive Visualization for Big Spatial Data
    Ghosh, Saheli
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 1826 - 1828
  • [25] BIG DATA AND MULTIDIMENSIONAL ANALYSES WITH SPATIAL DATA
    Otrzasek, Jakub
    Mueller, Wojciech
    Koszela, Krzysztof
    Weres, Jerzy
    Przybyl, Jacek
    GEOCONFERENCE ON INFORMATICS, GEOINFORMATICS AND REMOTE SENSING, VOL I, 2014, : 65 - 71
  • [26] Managing Spatial Big Data on the Data LakeHouse
    Errami, Soukaina Ait
    Hajji, Hicham
    El Kadi, Kenza Ait
    Badir, Hassan
    EMERGING TRENDS IN INTELLIGENT SYSTEMS & NETWORK SECURITY, 2023, 147 : 323 - 331
  • [27] Spatial Data Mining: A Perspective of Big Data
    Wang, Shuliang
    Yuan, Hanning
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2014, 10 (04) : 50 - 70
  • [28] Spatial Data Mining in the Context of Big Data
    Wang, Shuliang
    Yuan, Hanning
    2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013), 2013, : 486 - 491
  • [29] Large Scale Analytics of Vector plus Raster Big Spatial Data
    Eldawy, Ahmed
    Niu, Lyuye
    Haynes, David
    Su, Zhiba
    25TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2017), 2017,
  • [30] Towards Parallel Spatial Query Processing for Big Spatial Data
    Zhong, Yunqin
    Han, Jizhong
    Zhang, Tieying
    Li, Zhenhua
    Fang, Jinyun
    Chen, Guihai
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2085 - 2094