Estimation of spatial econometric linear models with large datasets: How big can spatial Big Data be?

被引:11
|
作者
Arbia, G. [1 ,2 ]
Ghiringhelli, C. [1 ]
Mira, A. [3 ,4 ]
机构
[1] Univ Svizzera Italians, Lugano, Switzerland
[2] Univ Cattolica Sacro Cuore, Milan, Italy
[3] Univ Svizzera Italiana, Inst Computat Sci, Data Sci Lab, Lugano, Switzerland
[4] Univ Insubria, Varese, Italy
基金
瑞士国家科学基金会;
关键词
Big spatial data; Computational issues; Spatial econometric models; Maximum Likelihood; Bayesian estimator; Spatial two stages least squares; Dense matrix; AUTOREGRESSIVE MODELS; MATRICES; ISOTROPY;
D O I
10.1016/j.regsciurbeco.2019.01.006
中图分类号
F [经济];
学科分类号
02 ;
摘要
Spatial econometrics is currently experiencing the Big Data revolution both in terms of the volume of data and the velocity with which they are accumulated. Regional data, employed traditionally in spatial econometric modeling, can be very large, with information that are increasingly available at a very fine resolution level such as census tracts, local markets, town blocks, regular grids or other small partitions of the territory. When dealing with spatial microeconometric models referred to the granular observations of the single economic agent, the number of observations available can be a lot higher. This paper reports the results of a systematic simulation study on the limits of the current methodologies when estimating spatial models with large datasets. In our study we simulate a Spatial Lag Model (SLM), we estimate it using Maximum Likelihood (ML), Two Stages Least Squares (2SLS) and Bayesian estimator (B), and we test their performances for different sample sizes and different levels of sparsity of the weight matrices. We considered three performance indicators, namely: computing time, storage required and accuracy of the estimators. The results show that using standard computer capabilities the analysis becomes prohibitive and unreliable when the sample size is greater than 70,000 even for low levels of sparsity. This result suggests that new approaches should be introduced to analyze the big datasets that are quickly becoming the new standard in spatial econometrics.
引用
收藏
页码:67 / 73
页数:7
相关论文
共 50 条
  • [41] Machine Learning Meets Big Spatial Data
    Sabek, Ibrahim
    Mokbel, Mohamed F.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 1982 - 1985
  • [42] Fields as a Generic Data Type for Big Spatial Data
    Camara, Gilberto
    Egenhofer, Max J.
    Ferreira, Karine
    Andrade, Pedro
    Queiroz, Gilberto
    Sanchez, Alber
    Jones, Jim
    Vinhas, Lubia
    GEOGRAPHIC INFORMATION SCIENCE (GISCIENCE 2014), 2014, 8728 : 159 - 172
  • [43] ORANGE : Spatial big data analysis platform
    Cho, Sunghwan
    Hong, Sunghal
    Lee, Changsoo
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3963 - 3965
  • [44] Scalable Spatial Queries in Big Data Systems
    Abdelhafeez, Laila
    2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 328 - 330
  • [45] Big spatial vector data management: a review
    Yao, Xiaochuang
    Li, Guoqing
    BIG EARTH DATA, 2018, 2 (01) : 108 - 129
  • [46] Detecting Skewness of Big Spatial Data in SpatialHadoop
    Belussi, Alberto
    Migliorini, Sara
    Eldawy, Ahmed
    26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 432 - 435
  • [47] A Performance Study of Big Spatial Data Systems
    Alam, Md Mahbub
    Ray, Suprio
    Bhavsar, Virendra C.
    BIGSPATIAL 2018: PROCEEDINGS OF THE 7TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON ANALYTICS FOR BIG GEOSPATIAL DATA (BIGSPATIAL-2018), 2018, : 1 - 9
  • [48] High Performance Analysis of Big Spatial Data
    Haynes, David
    Ray, Suprio
    Manson, Steven M.
    Soni, Ankit
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1953 - 1957
  • [49] Big Spatial Data Processing With Apache Spark
    Boyi Shangguan
    Peng Yue
    Wu, Zhaoyan
    Jiang, Liangcun
    2017 6TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS, 2017, : 239 - 242
  • [50] Tools for the Storage and Analysis of Spatial Big Data
    Lisowski, Przemyslaw
    Piorkowski, Adam
    Lesniak, Andrzej
    10TH INTERNATIONAL CONFERENCE ENVIRONMENTAL ENGINEERING (10TH ICEE), 2017,