Towards building a data-intensive index for big data computing - A case study of Remote Sensing data processing

被引:50
|
作者
Ma, Yan [1 ]
Wang, Lizhe [1 ]
Liu, Peng [1 ]
Ranjan, Rajiv
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100864, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Big data; Parallel computing; Data-intensive computing; Remote Sensing data processing; SYSTEM;
D O I
10.1016/j.ins.2014.10.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the recent advances in Remote Sensing (RS) techniques, continuous Earth Observation is generating tremendous volume of RS data. The proliferation of RS data is revolutionizing the way in which RS data are processed and understood. Data with higher dimensionality, as well as the increasing requirement for real-time processing capabilities, have also given rise to the challenging issue of "Data-Intensive (DI) Computing". However, how to properly identify and qualify the DI issue remains a significant problem that is worth exploring. DI computing is a complex issue. While the huge data volume may be one of the reasons for this, some other factors could also be important. In this paper, we propose an empirical model (DIRS) of DI index to estimate RS applications. DIRS here is a novel empirical model (DIRS) that could quantify the DI issues in RS data processing with a normalized DI index. Through experimental analysis of the typical algorithms across the whole RS data processing flow, we identify the key factors that affect the DI issues mostly. Finally, combined with the empirical knowledge of domain experts, we formulate DIRS model to describe the correlations between the key factors and DI index. By virtue of experimental validation on more selected RS applications, we have found that DIRS model is an easy but promising approach. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:171 / 188
页数:18
相关论文
共 50 条
  • [41] Parallel Framework for Data-Intensive Computing with XSEDE
    Subramanian, Ranjini
    Zhang, Hui
    PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,
  • [42] Coordinating Green Clouds as Data-Intensive Computing
    Biran, Yahav
    Collins, George
    Liberatore, Joseph
    PROCEEDINGS 2016 EIGHTH ANNUAL IEEE GREEN TECHNOLOGIES CONFERENCE (GREENTECH 2016), 2016, : 130 - 135
  • [43] Real-Time Data-Intensive Computing
    Parkinson, Dilworth Y.
    Beattie, Keith
    Chen, Xian
    Correa, Joaquin
    Dart, Eli
    Daurer, Benedikt J.
    Deslippe, Jack R.
    Hexemer, Alexander
    Krishnan, Harinarayan
    MacDowell, Alastair A.
    Maia, Filipe R. N. C.
    Marchesini, Stefano
    Padmore, Howard A.
    Patton, Simon J.
    Perciano, Talita
    Sethian, James A.
    Shapiro, David
    Stromsness, Rune
    Tamura, Nobumichi
    Tierney, Brian L.
    Tull, Craig E.
    Ushizima, Daniela
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SYNCHROTRON RADIATION INSTRUMENTATION (SRI2015), 2016, 1741
  • [44] A Resistive TCAM Accelerator for Data-Intensive Computing
    Guo, Qing
    Guo, Xiaochen
    Bai, Yuxin
    Ipek, Engin
    PROCEEDINGS OF THE 2011 44TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 44), 2011, : 339 - 350
  • [45] Distributed Data Access/Find System with Metadata for Data-Intensive Computing
    Ikebe, Minoru
    Inomata, Atsuo
    Fujikawa, Kazutoshi
    Sunahara, Hideki
    2008 9TH IEEE/ACM INTERNATIONAL CONFERENCE ON GRID COMPUTING, 2008, : 361 - 366
  • [46] Load-balanced data layout approach in data-intensive computing
    Song, J. (songjie@mail.neu.edu.cn), 1600, Beijing University of Posts and Telecommunications (36):
  • [47] PARROT: AN APPLICATION ENVIRONMENT FOR DATA-INTENSIVE COMPUTING
    Thain, Douglas
    Livny, Miron
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2005, 6 (03): : 9 - 18
  • [48] Distributed Data Provenance for Large-Scale Data-Intensive Computing
    Zhao, Dongfang
    Shou, Chen
    Malik, Tanu
    Raicu, Ioan
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [49] A Spark-Based Big Data Platform for Massive Remote Sensing Data Processing
    Sun, Zhongyi
    Chen, Fengke
    Chi, Mingmin
    Zhu, Yangyong
    DATA SCIENCE, 2015, 9208 : 120 - 126
  • [50] Distributed Deep Learning for Big Remote Sensing Data Processing on Apache Spark: Geological Remote Sensing Interpretation as a Case Study
    Long, Ao
    Han, Wei
    Huang, Xiaohui
    Li, Jiabao
    Wang, Yuewei
    Chen, Jia
    WEB AND BIG DATA, PT I, APWEB-WAIM 2023, 2024, 14331 : 96 - 110