A guide to creating an effective big data management framework

被引:1
|
作者
Arundel, S. T. [1 ]
Mckeehan, K. G. [1 ]
Campbell, B. B. [2 ]
Bulen, A. N. [2 ]
Thiem, P. T. [1 ]
机构
[1] US Geol Survey, Ctr Excellence Geospatial Informat Sci, 1400 Independence Rd, Rolla, MO 65401 USA
[2] US Geol Survey, Natl Geospatial Tech Operat Ctr, 1400 Independence Rd, Rolla, MO 65401 USA
关键词
ADOM; Data movement; Ingress; Egress; Rclone; INFORMATION-SYSTEMS; MAPREDUCE;
D O I
10.1186/s40537-023-00801-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many agencies and organizations, such as the U.S. Geological Survey, handle massive geospatial datasets and their auxiliary data and are thus faced with challenges in storing data and ingesting it, transferring it between internal programs, and egressing it to external entities. As a result, these agencies and organizations may inadvertently devote unnecessary time and money to convey data without existing or outdated standards. This research aims to evaluate the components of data conveyance systems, such as transfer methods, tracking, and automation, to guide their improved performance. Specifically, organizations face the challenges of slow dispatch time and manual intervention when conveying data into, within, and from their systems. Conveyance often requires skilled workers when the system depends on physical media such as hard drives, particularly when terabyte transfers are required. In addition, incomplete or inconsistent metadata may necessitate manual intervention, process changes, or both. A proposed solution is organization-wide guidance for efficient data conveyance. That guidance involves systems analysis to outline a data management framework, which may include understanding the minimum requirements of data manifests, specification of transport mechanisms, and improving automation capabilities.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] A practical guide to big data
    Smirnova, Ekaterina
    Ivanescu, Andrada
    Bai, Jiawei
    Crainiceanu, Ciprian M.
    STATISTICS & PROBABILITY LETTERS, 2018, 136 : 25 - 29
  • [32] A guide to the day of big data
    Michael Nielsen
    Nature, 2009, 462 (7274) : 722 - 723
  • [33] BIG DATA TECHNOLOGY FRAMEWORK AND DATA UTILIZATION FOR URBAN ENVIRONMENTAL POLLUTION MANAGEMENT
    Li, Nan
    Ma, Zheng
    3C TECNOLOGIA, 2023, 12 (01): : 204 - 218
  • [34] A Big Data based Decision Framework for Public Management and Service in Tourism
    Zhang, Chi
    Qiao, Xiangjie
    Chen, Xianfeng
    COMPANION OF THE 2020 IEEE 20TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY (QRS-C 2020), 2020, : 550 - 555
  • [35] A novel framework for remote management of social media big data analytics
    Ahmad M. Al-Shomar
    Muhammad Al-Qurish
    Wajdi Aljedaani
    Social Network Analysis and Mining, 2022, 12
  • [36] Big Data Applications in Food Supply Chain Management: A Conceptual Framework
    Margaritis, Ioannis
    Madas, Michael
    Vlachopoulou, Maro
    SUSTAINABILITY, 2022, 14 (07)
  • [37] A Secure Big Data Stream Analytics Framework for Disaster Management on the Cloud
    Puthal, Deepak
    Nepal, Surya
    Ranjan, Rajiv
    Chen, Jinjun
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 1218 - 1225
  • [38] IntegrityMR: Integrity Assurance Framework for Big Data Analytics and Management Applications
    Wang, Yongzhi
    Wei, Jinpeng
    Srivatsa, Mudhakar
    Duan, Yucong
    Du, Wencai
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [39] A Cognitive Oriented Framework for IoT Big-data Management Prospective
    Mishra, Nilamadhab
    Lin, Chung-Chih
    Chang, Hsien-Tsung
    2014 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION PROBLEM-SOLVING (ICCP), 2014, : 124 - 127
  • [40] A novel framework for remote management of social media big data analytics
    Al-Shomar, Ahmad M.
    Al-Qurish, Muhammad
    Aljedaani, Wajdi
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)