Strategies for the Storage of Large LiDAR Datasets-A Performance Comparison

被引：3

作者：

Bejar-Martos, Juan A. ^{[1
]}

Rueda-Ruiz, Antonio J. ^{[1
]}

Ogayar-Anguita, Carlos J. ^{[1
]}

Segura-Sanchez, Rafael J. ^{[1
]}

Lopez-Ruiz, Alfonso ^{[1
]}

机构：

[1] Univ Jaen, Ctr Estudios Avanzados TIC, Jaen 23071, Spain

来源：

REMOTE SENSING | 2022年 / 14卷 / 11期

关键词：

LiDAR; point clouds; databases; NoSQL; GEOSPATIAL BIG DATA; POINT CLOUDS; OCTREE; ARCHITECTURE;

D O I：

10.3390/rs14112623

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The widespread use of LiDAR technologies has led to an ever-increasing volume of captured data that pose a continuous challenge for its storage and organization, so that it can be efficiently processed and analyzed. Although the use of system files in formats such as LAS/LAZ is the most common solution for LiDAR data storage, databases are gaining in popularity due to their evident advantages: centralized and uniform access to a collection of datasets; better support for concurrent retrieval; distributed storage in database engines that allows sharding; and support for metadata or spatial queries by adequately indexing or organizing the data. The present work evaluates the performance of four popular NoSQL and relational database management systems with large LiDAR datasets: Cassandra, MongoDB, MySQL and PostgreSQL. To perform a realistic assessment, we integrate these database engines in a repository implementation with an elaborate data model that enables metadata and spatial queries and progressive/partial data retrieval. Our experimentation concludes that, as expected, NoSQL databases show a modest but significant performance difference in favor of NoSQL databases, and that Cassandra provides the best overall database solution for LiDAR data.

引用

页数：15

共 50 条

[21] Comprehensive comparison of large-scale tissue expression datasets
Santos, Alberto
Tsafou, Kalliopi
Stolte, Christian
Pletscher-Frankild, Sune
O'Donoghue, Sean I.
Jensen, Lars Juhl
PEERJ, 2015, 3
[22] A comparison of spatial predictors when datasets could be very large
Bradley, Jonathan R.
Cressie, Noel
Shi, Tao
STATISTICS SURVEYS, 2016, 10 : 100 - 131
[23] Prediction of Canopy Heights over a Large Region Using Heterogeneous Lidar Datasets: Efficacy and Challenges
Gopalakrishnan, Ranjith
Thomas, Valerie A.
Coulston, John W.
Wynne, Randolph H.
REMOTE SENSING, 2015, 7 (09) : 11036 - 11060
[24] Research on Database Storage of Large-scale Terrestrial LIDAR Data
Guo Ming
Wang Yanmin
Zhao Youshan
Zhou Junzhao
2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 19 - +
[25] A Comparison of Two Strategies for Scaling Up Instance Selection in Huge Datasets
de Haro-Garcia, Aida
Perez-Rodriguez, Javier
Garcia-Pedrajas, Nicolas
ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7023 : 64 - 73
[26] A comparison of chain-of-thought reasoning strategies across datasets and models
Hebenstreit, Konstantin
Praas, Robert
Kiesewetter, Louis P.
Samwald, Matthias
PEERJ COMPUTER SCIENCE, 2024, 10
[27] A COST COMPARISON OF ALTERNATIVE BOOK STORAGE STRATEGIES
COOPER, MD
LIBRARY QUARTERLY, 1989, 59 (03): : 239 - 260
[28] A comparison of replication strategies for reliable decentralised storage
Oxford University Computing Laboratory, United Kingdom
不详
J. Netw., 2006, 6 (36-44):
[29] Using OVA modeling to improve classification performance for large datasets
Lutu, Patricia E. N.
Engelbrecht, Andries P.
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (04) : 4358 - 4376
[30] Optimized Storage and Fast Retrieval of Large Monitoring Datasets without Compromising Granularity
Cabaniols, Sebastien
Viollet, Nathalie
Poulain, Clement
2015 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, 2015, : 135 - 136

← 1 2 3 4 5 →