Parallel co-location mining with MapReduce and NoSQL systems

被引:15
|
作者
Yoo, Jin Soung [1 ]
Boulware, Douglas [2 ]
Kimmey, David [1 ]
机构
[1] Purdue Univ Ft Wayne, Dept Comp Sci, Ft Wayne, IN 46805 USA
[2] Air Force Res Lab, Rome Res Site, New York, NY USA
关键词
Spatial data mining; Parallel co-location mining; Cloud computing; MapReduce; NoSQL; COLOCATION PATTERNS; DATA SETS; FRAMEWORK;
D O I
10.1007/s10115-019-01381-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid growth of georeferenced data, large-scale data processing and analysis methods are needed for spatial big data. Spatial co-location pattern mining is an interesting and important issue in spatial data mining area which discovers the subsets of features whose objects are frequently located together in geographic proximity. There are several works for efficiently processing co-location pattern discovery; however, they may be insufficient for large dense spatial data because the mining task takes up a lot of processing time and memory. In this work, we leveraged the power of a modern distributed computing platform, Hadoop, and developed an algorithm (called ParColoc) for parallel co-location mining on the MapReduce framework. This study explored challenge issues in designing the parallel co-location mining algorithm and solved them with adopting a spatial declusteirng technique and a NoSQL system. We conducted an experimental evaluation with real-world data and synthetic data to examine the effectiveness of proposed methods. The experiment result shows that ParColoc is a promising method for parallel co-location mining in cloud computing environment.
引用
收藏
页码:1433 / 1463
页数:31
相关论文
共 50 条
  • [1] Parallel co-location mining with MapReduce and NoSQL systems
    Jin Soung Yoo
    Douglas Boulware
    David Kimmey
    Knowledge and Information Systems, 2020, 62 : 1433 - 1463
  • [2] A Parallel Spatial Co-location Mining Algorithm Based on MapReduce
    Yoo, Jin Soung
    Boulware, Douglas
    Kimmey, David
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 25 - 31
  • [3] A framework of Spatial Co-location Mining on MapReduce
    Yoo, Jin Soung
    Boulware, Douglas
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [4] Parallel approach to incremental co-location pattern mining
    Andrzejewski, Witold
    Boinski, Pawel
    INFORMATION SCIENCES, 2019, 496 : 485 - 505
  • [5] Co-location, co-location, co-location
    Watkins, Peter
    Structural Engineer, 2020, 98 (11): : 59 - 61
  • [6] A MapReduce approach for spatial co-location pattern mining via ordered-clique-growth
    Yang, Peizhong
    Wang, Lizhen
    Wang, Xiaoxuan
    DISTRIBUTED AND PARALLEL DATABASES, 2020, 38 (02) : 531 - 560
  • [7] A MapReduce approach for spatial co-location pattern mining via ordered-clique-growth
    Peizhong Yang
    Lizhen Wang
    Xiaoxuan Wang
    Distributed and Parallel Databases, 2020, 38 : 531 - 560
  • [8] A Combined Co-location Pattern Mining Approach for Post-Analyzing Co-location Patterns
    Fang, Yuan
    Wang, Lizhen
    Lu, Junli
    Zhou, Lihua
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2016, 127
  • [9] Mining regional co-location patterns with kNNG
    Qian, Feng
    Chiew, Kevin
    He, Qinming
    Huang, Hao
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (03) : 485 - 505
  • [10] Mining spatial dynamic co-location patterns
    Duan, Jiangli
    Wang, Lizhen
    Hu, Xin
    Chen, Hongmei
    FILOMAT, 2018, 32 (05) : 1491 - 1497