Optimizing distance-based methods for large data sets

被引:6
|
作者
Scholl, Tobias [1 ]
Brenner, Thomas [1 ]
机构
[1] Philipps Univ Marburg, Lehrstuhl Wirtschaftsgeog & Standortforsch, D-35032 Marburg, Germany
关键词
Spatial concentration; Duranton-Overman index; Big data analysis; MAUP; Distance-based measures; GEOGRAPHIC CONCENTRATION; SERVICE INDUSTRIES;
D O I
10.1007/s10109-015-0219-1
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
Distance-based methods for measuring spatial concentration of industries have received an increasing popularity in the spatial econometrics community. However, a limiting factor for using these methods is their computational complexity since both their memory requirements and running times are in . In this paper, we present an algorithm with constant memory requirements and shorter running time, enabling distance-based methods to deal with large data sets. We discuss three recent distance-based methods in spatial econometrics: the D&O-Index by Duranton and Overman (Rev Econ Stud 72(4):1077-1106, 2005), the M-function by Marcon and Puech (J Econ Geogr 10(5):745-762, 2010) and the Cluster-Index by Scholl and Brenner (Reg Stud (ahead-of-print):1-15, 2014). Finally, we present an alternative calculation for the latter index that allows the use of data sets with millions of firms.
引用
收藏
页码:333 / 351
页数:19
相关论文
共 50 条
  • [1] Optimizing distance-based methods for large data sets
    Tobias Scholl
    Thomas Brenner
    [J]. Journal of Geographical Systems, 2015, 17 : 333 - 351
  • [2] Distance-based classification methods
    Ekin, O
    Hammer, PL
    Kogan, A
    Winter, P
    [J]. INFOR, 1999, 37 (03) : 337 - 352
  • [3] Gene expression data classification: some distance-based methods
    Makinde, Olusola Samuel
    [J]. KUWAIT JOURNAL OF SCIENCE, 2019, 46 (03) : 31 - 39
  • [4] A NOTE ON THE SCOPE OF DISTANCE-BASED METHODS
    ELLIS, SP
    MORGENTHALER, S
    [J]. SANKHYA-THE INDIAN JOURNAL OF STATISTICS SERIES A, 1992, 54 : 464 - 471
  • [5] Methods for distance-based judgment aggregation
    Miller, Michael K.
    Osherson, Daniel
    [J]. SOCIAL CHOICE AND WELFARE, 2009, 32 (04) : 575 - 601
  • [6] Methods for distance-based judgment aggregation
    Michael K. Miller
    Daniel Osherson
    [J]. Social Choice and Welfare, 2009, 32
  • [7] Benchmarking distance-based partitioning methods for mixed-type data
    Efthymios Costa
    Ioanna Papatsouma
    Angelos Markos
    [J]. Advances in Data Analysis and Classification, 2023, 17 : 701 - 724
  • [8] Benchmarking distance-based partitioning methods for mixed-type data
    Costa, Efthymios
    Papatsouma, Ioanna
    Markos, Angelos
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2023, 17 (03) : 701 - 724
  • [9] On optimizing distance-based similarity search for biological databases
    Mao, R
    Xu, WJ
    Ramakrishnan, S
    Nuckolls, G
    Miranker, DP
    [J]. 2005 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2005, : 351 - 361
  • [10] Distance-based clustering of mixed data
    van de Velden, Michel
    D'Enza, Alfonso Iodice
    Markos, Angelos
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2019, 11 (03)