Towards building a data-intensive index for big data computing - A case study of Remote Sensing data processing

被引:50
|
作者
Ma, Yan [1 ]
Wang, Lizhe [1 ]
Liu, Peng [1 ]
Ranjan, Rajiv
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100864, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Big data; Parallel computing; Data-intensive computing; Remote Sensing data processing; SYSTEM;
D O I
10.1016/j.ins.2014.10.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the recent advances in Remote Sensing (RS) techniques, continuous Earth Observation is generating tremendous volume of RS data. The proliferation of RS data is revolutionizing the way in which RS data are processed and understood. Data with higher dimensionality, as well as the increasing requirement for real-time processing capabilities, have also given rise to the challenging issue of "Data-Intensive (DI) Computing". However, how to properly identify and qualify the DI issue remains a significant problem that is worth exploring. DI computing is a complex issue. While the huge data volume may be one of the reasons for this, some other factors could also be important. In this paper, we propose an empirical model (DIRS) of DI index to estimate RS applications. DIRS here is a novel empirical model (DIRS) that could quantify the DI issues in RS data processing with a normalized DI index. Through experimental analysis of the typical algorithms across the whole RS data processing flow, we identify the key factors that affect the DI issues mostly. Finally, combined with the empirical knowledge of domain experts, we formulate DIRS model to describe the correlations between the key factors and DI index. By virtue of experimental validation on more selected RS applications, we have found that DIRS model is an easy but promising approach. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:171 / 188
页数:18
相关论文
共 50 条
  • [21] Improvement Of Data Throughput In Data-Intensive Cloud Computing Applications
    Ibrahim, Ibrahim Adel
    Bassiouni, Mostafa
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 49 - 54
  • [22] In-Memory Data Rearrangement for Irregular, Data-Intensive Computing
    Lloyd, Scott
    Gokhale, Maya
    COMPUTER, 2015, 48 (08) : 18 - 25
  • [23] Data Allocation with Neural Similarity Estimation for Data-Intensive Computing
    Vamosi, Ralf
    Schikuta, Erich
    COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 534 - 546
  • [24] Towards Building a Distributed Data Management Architecture to Integrate Multi-sources Remote Sensing Big Data
    Huang, Xiaohui
    Wang, Lizhe
    Yan, Jining
    Deng, Ze
    Wang, Shaoyuan
    Ma, Yan
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 83 - 90
  • [25] Data-Intensive Text Processing with MapReduce
    Xu, Peng
    COMPUTATIONAL LINGUISTICS, 2011, 37 (03) : 635 - 637
  • [26] PROCESSING BIG REMOTE SENSING DATA FOR FAST FLOOD DETECTION IN A DISTRIBUTED COMPUTING ENVIRONMENT
    Olasz, A.
    Kristof, D.
    Nguyen Thai, B.
    Belenyesi, M.
    Giachetta, R.
    FOSS4G-EUROPE 2017 - ACADEMIC TRACK, 2017, 42-4 (W2):
  • [27] pipsCloud: High performance cloud computing for remote sensing big data management and processing
    Wang, Lizhe
    Ma, Yan
    Yan, Jining
    Chang, Victor
    Zomaya, Albert Y.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 78 : 353 - 368
  • [28] INVITED: Enabling Practical Processing in and near Memory for Data-Intensive Computing
    Mutlu, Onur
    Ghose, Saugata
    Gomez-Luna, Juan
    Ausavarungnirun, Rachata
    PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,
  • [29] A brief survey on big data: technologies, terminologies and data-intensive applications
    Abdalla, Hemn Barzan
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [30] An Analysis of Software Parallelism in Big Data Technologies for Data-Intensive Architectures
    Cerezo, Felipe
    Cuesta, Carlos E.
    Vela, Belen
    SOFTWARE ARCHITECTURE, ECSA 2021, 2021, 12857 : 181 - 188