Parallel co-location mining with MapReduce and NoSQL systems

被引:15
|
作者
Yoo, Jin Soung [1 ]
Boulware, Douglas [2 ]
Kimmey, David [1 ]
机构
[1] Purdue Univ Ft Wayne, Dept Comp Sci, Ft Wayne, IN 46805 USA
[2] Air Force Res Lab, Rome Res Site, New York, NY USA
关键词
Spatial data mining; Parallel co-location mining; Cloud computing; MapReduce; NoSQL; COLOCATION PATTERNS; DATA SETS; FRAMEWORK;
D O I
10.1007/s10115-019-01381-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid growth of georeferenced data, large-scale data processing and analysis methods are needed for spatial big data. Spatial co-location pattern mining is an interesting and important issue in spatial data mining area which discovers the subsets of features whose objects are frequently located together in geographic proximity. There are several works for efficiently processing co-location pattern discovery; however, they may be insufficient for large dense spatial data because the mining task takes up a lot of processing time and memory. In this work, we leveraged the power of a modern distributed computing platform, Hadoop, and developed an algorithm (called ParColoc) for parallel co-location mining on the MapReduce framework. This study explored challenge issues in designing the parallel co-location mining algorithm and solved them with adopting a spatial declusteirng technique and a NoSQL system. We conducted an experimental evaluation with real-world data and synthetic data to examine the effectiveness of proposed methods. The experiment result shows that ParColoc is a promising method for parallel co-location mining in cloud computing environment.
引用
收藏
页码:1433 / 1463
页数:31
相关论文
共 50 条
  • [21] Mining Co-Location Patterns of Hotels with the Q Statistic
    Yan, Zhiwei
    Tian, Jing
    Ren, Chang
    Xiong, Fuquan
    APPLIED SPATIAL ANALYSIS AND POLICY, 2018, 11 (03) : 623 - 639
  • [22] Efficiently Mining Co-Location Rules on Interval Data
    Wang, Lizhen
    Chen, Hongmei
    Zhao, Lihong
    Zhou, Lihua
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 477 - 488
  • [23] Mining Regional High Utility Co-location Pattern
    Xiong, Meiyu
    Chen, Hongmei
    Wang, Lizhen
    Xiao, Qing
    SPATIAL DATA AND INTELLIGENCE, SPATIALDI 2024, 2024, 14619 : 97 - 107
  • [24] Mining Spatial Co-location Patterns by the Fuzzy Technology
    Lei, Le
    Wang, Lizhen
    Wang, Xiaoxuan
    2019 10TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK 2019), 2019, : 119 - 126
  • [25] A FAST APPROACH FOR SPATIAL CO-LOCATION PATTERN MINING
    He, Fei
    Deng, Xuemin
    Fang, Jinyun
    2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 3654 - 3657
  • [26] Spatial Interestingness Measures for Co-location Pattern Mining
    Sengstock, Christian
    Gertz, Michael
    Van Canh, Tran
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 821 - 826
  • [27] Mining Co-location Patterns in Incremental Spatial Databases
    Chang, Ye-In
    Wu, Chen-Chang
    Yen, Ching-Yi
    2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 141 - 148
  • [28] Mining Statistically Significant Co-location and Segregation Patterns
    Barua, Sajib
    Sander, Joerg
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1185 - 1199
  • [29] Mining Trajectory Hotspots Based on Co-location Patterns
    Yan R.
    Yin D.
    Gu Y.
    Data Analysis and Knowledge Discovery, 2023, 7 (07) : 58 - 73
  • [30] Parallel NoSQL Entity Resolution Approach with MapReduce
    Ma, Kun
    Yang, Bo
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS IEEE INCOS 2015, 2015, : 384 - 389