Optimizing distance-based methods for large data sets

被引:6
|
作者
Scholl, Tobias [1 ]
Brenner, Thomas [1 ]
机构
[1] Philipps Univ Marburg, Lehrstuhl Wirtschaftsgeog & Standortforsch, D-35032 Marburg, Germany
关键词
Spatial concentration; Duranton-Overman index; Big data analysis; MAUP; Distance-based measures; GEOGRAPHIC CONCENTRATION; SERVICE INDUSTRIES;
D O I
10.1007/s10109-015-0219-1
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
Distance-based methods for measuring spatial concentration of industries have received an increasing popularity in the spatial econometrics community. However, a limiting factor for using these methods is their computational complexity since both their memory requirements and running times are in . In this paper, we present an algorithm with constant memory requirements and shorter running time, enabling distance-based methods to deal with large data sets. We discuss three recent distance-based methods in spatial econometrics: the D&O-Index by Duranton and Overman (Rev Econ Stud 72(4):1077-1106, 2005), the M-function by Marcon and Puech (J Econ Geogr 10(5):745-762, 2010) and the Cluster-Index by Scholl and Brenner (Reg Stud (ahead-of-print):1-15, 2014). Finally, we present an alternative calculation for the latter index that allows the use of data sets with millions of firms.
引用
收藏
页码:333 / 351
页数:19
相关论文
共 50 条
  • [31] Mixtures of distance-based models for ranking data
    Murphy, TB
    Martin, D
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 41 (3-4) : 645 - 655
  • [32] Distance-Based Opportunistic Mobile Data Offloading
    Lu, Xiaofeng
    Lio, Pietro
    Hui, Pan
    SENSORS, 2016, 16 (06)
  • [33] Distance-based consensus methods:: a goal programming approach
    González-Pachón, J
    Romero, C
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1999, 27 (03): : 341 - 347
  • [34] Distance-Based Fuzzy-Rough Sets and Their Application to the Classification Problem
    Kumar, Amrit
    Chatterjee, Niladri
    ROUGH SETS, PT I, IJCRS 2024, 2024, 14839 : 134 - 156
  • [35] Normalized Distance-Based Entropy Measures for Multiaspect Fuzzy Soft Sets
    Sulaiman, Nor Hashimah
    Mohamad, Daud
    4TH INTERNATIONAL CONFERENCE ON MATHEMATICAL SCIENCES (ICMS4): MATHEMATICAL SCIENCES: CHAMPIONING THE WAY IN A PROBLEM BASED AND DATA DRIVEN SOCIETY, 2017, 1830
  • [36] Data Preprocessing for Distance-based Unsupervised Intrusion Detection
    Said, Dina
    Stirling, Lisa
    Federolf, Peter
    Barker, Ken
    2011 NINTH ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST, 2011, : 181 - 188
  • [37] Mixtures of Weighted Distance-Based Models for Ranking Data
    Lee, Paul H.
    Yu, Philip L. H.
    COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 517 - 524
  • [38] Distance-based kernels for real-valued data
    Belanche, Lluis
    Vazquez, Jean Luis
    Vazquez, Miguel
    DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 3 - +
  • [40] Optimizing the distribution of large data sets in theory and practice
    Rauch, F
    Kurmann, C
    Stricker, TM
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2002, 14 (03): : 165 - 181