Strategies for the Storage of Large LiDAR Datasets-A Performance Comparison

被引:3
|
作者
Bejar-Martos, Juan A. [1 ]
Rueda-Ruiz, Antonio J. [1 ]
Ogayar-Anguita, Carlos J. [1 ]
Segura-Sanchez, Rafael J. [1 ]
Lopez-Ruiz, Alfonso [1 ]
机构
[1] Univ Jaen, Ctr Estudios Avanzados TIC, Jaen 23071, Spain
关键词
LiDAR; point clouds; databases; NoSQL; GEOSPATIAL BIG DATA; POINT CLOUDS; OCTREE; ARCHITECTURE;
D O I
10.3390/rs14112623
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The widespread use of LiDAR technologies has led to an ever-increasing volume of captured data that pose a continuous challenge for its storage and organization, so that it can be efficiently processed and analyzed. Although the use of system files in formats such as LAS/LAZ is the most common solution for LiDAR data storage, databases are gaining in popularity due to their evident advantages: centralized and uniform access to a collection of datasets; better support for concurrent retrieval; distributed storage in database engines that allows sharding; and support for metadata or spatial queries by adequately indexing or organizing the data. The present work evaluates the performance of four popular NoSQL and relational database management systems with large LiDAR datasets: Cassandra, MongoDB, MySQL and PostgreSQL. To perform a realistic assessment, we integrate these database engines in a repository implementation with an elaborate data model that enables metadata and spatial queries and progressive/partial data retrieval. Our experimentation concludes that, as expected, NoSQL databases show a modest but significant performance difference in favor of NoSQL databases, and that Cassandra provides the best overall database solution for LiDAR data.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Comprehensive comparison of large-scale tissue expression datasets
    Santos, Alberto
    Tsafou, Kalliopi
    Stolte, Christian
    Pletscher-Frankild, Sune
    O'Donoghue, Sean I.
    Jensen, Lars Juhl
    PEERJ, 2015, 3
  • [22] A comparison of spatial predictors when datasets could be very large
    Bradley, Jonathan R.
    Cressie, Noel
    Shi, Tao
    STATISTICS SURVEYS, 2016, 10 : 100 - 131
  • [23] Prediction of Canopy Heights over a Large Region Using Heterogeneous Lidar Datasets: Efficacy and Challenges
    Gopalakrishnan, Ranjith
    Thomas, Valerie A.
    Coulston, John W.
    Wynne, Randolph H.
    REMOTE SENSING, 2015, 7 (09) : 11036 - 11060
  • [24] Research on Database Storage of Large-scale Terrestrial LIDAR Data
    Guo Ming
    Wang Yanmin
    Zhao Youshan
    Zhou Junzhao
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 19 - +
  • [25] A Comparison of Two Strategies for Scaling Up Instance Selection in Huge Datasets
    de Haro-Garcia, Aida
    Perez-Rodriguez, Javier
    Garcia-Pedrajas, Nicolas
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7023 : 64 - 73
  • [26] A comparison of chain-of-thought reasoning strategies across datasets and models
    Hebenstreit, Konstantin
    Praas, Robert
    Kiesewetter, Louis P.
    Samwald, Matthias
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [27] A COST COMPARISON OF ALTERNATIVE BOOK STORAGE STRATEGIES
    COOPER, MD
    LIBRARY QUARTERLY, 1989, 59 (03): : 239 - 260
  • [28] A comparison of replication strategies for reliable decentralised storage
    Oxford University Computing Laboratory, United Kingdom
    不详
    J. Netw., 2006, 6 (36-44):
  • [29] Using OVA modeling to improve classification performance for large datasets
    Lutu, Patricia E. N.
    Engelbrecht, Andries P.
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (04) : 4358 - 4376
  • [30] Optimized Storage and Fast Retrieval of Large Monitoring Datasets without Compromising Granularity
    Cabaniols, Sebastien
    Viollet, Nathalie
    Poulain, Clement
    2015 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, 2015, : 135 - 136