Towards building a data-intensive index for big data computing - A case study of Remote Sensing data processing

被引:50
|
作者
Ma, Yan [1 ]
Wang, Lizhe [1 ]
Liu, Peng [1 ]
Ranjan, Rajiv
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100864, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Big data; Parallel computing; Data-intensive computing; Remote Sensing data processing; SYSTEM;
D O I
10.1016/j.ins.2014.10.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the recent advances in Remote Sensing (RS) techniques, continuous Earth Observation is generating tremendous volume of RS data. The proliferation of RS data is revolutionizing the way in which RS data are processed and understood. Data with higher dimensionality, as well as the increasing requirement for real-time processing capabilities, have also given rise to the challenging issue of "Data-Intensive (DI) Computing". However, how to properly identify and qualify the DI issue remains a significant problem that is worth exploring. DI computing is a complex issue. While the huge data volume may be one of the reasons for this, some other factors could also be important. In this paper, we propose an empirical model (DIRS) of DI index to estimate RS applications. DIRS here is a novel empirical model (DIRS) that could quantify the DI issues in RS data processing with a normalized DI index. Through experimental analysis of the typical algorithms across the whole RS data processing flow, we identify the key factors that affect the DI issues mostly. Finally, combined with the empirical knowledge of domain experts, we formulate DIRS model to describe the correlations between the key factors and DI index. By virtue of experimental validation on more selected RS applications, we have found that DIRS model is an easy but promising approach. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:171 / 188
页数:18
相关论文
共 50 条
  • [31] Data-intensive applications, challenges, techniques and technologies: A survey on Big Data
    Chen, C. L. Philip
    Zhang, Chun-Yang
    INFORMATION SCIENCES, 2014, 275 : 314 - 347
  • [32] A brief survey on big data: technologies, terminologies and data-intensive applications
    Hemn Barzan Abdalla
    Journal of Big Data, 9
  • [33] Data-Intensive Scalable Computing for Scientific Applications
    Bryant, Randal E.
    COMPUTING IN SCIENCE & ENGINEERING, 2011, 13 (06) : 25 - 33
  • [34] Research on the architecture of data-intensive computing platform
    Hou, Ke
    Zhang, Jing
    Fang, Xing
    Journal of Software Engineering, 2015, 9 (03): : 686 - 701
  • [35] The Benefits of Service Choreography for Data-intensive Computing
    Barker, Adam
    Besana, Paolo
    Robertson, David
    Weissman, Jon B.
    CLADE09: 7TH INTERNATIONAL WORKSHOP ON CHALLENGES OF LARGE APPLICATIONS IN DISTRIBUTED ENVIRONMENTS, 2009, : 1 - 10
  • [36] Data Placement Strategies for Data-Intensive Computing over Edge Clouds
    Wei, Xinliang
    Rahman, A. B. M. Mohaimenur
    Wang, Yu
    2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [37] A Framework for Data-Intensive Computing with Cloud Bursting
    Bicer, Tekin
    Chiu, David
    Agrawal, Gagan
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 169 - 177
  • [38] Challenges and Opportunities for Data-Intensive Computing in the Cloud
    Jung, Eun-Sung
    Kettimuthu, Rajkumar
    COMPUTER, 2014, 47 (12) : 82 - 85
  • [39] Integrating Data-Intensive Computing Systems with Biological Data Analysis Frameworks
    Pedersen, Edvard
    Raknes, Inge Alexander
    Ernstsen, Martin
    Bongo, Lars Ailo
    23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 733 - 740
  • [40] Automated Debugging in Data-Intensive Scalable Computing
    Gulzar, Muhammad Ali
    Interlandi, Matteo
    Han, Xueyuan
    Li, Mingda
    Condie, Tyson
    Kim, Miryung
    PROCEEDINGS OF THE 2017 SYMPOSIUM ON CLOUD COMPUTING (SOCC '17), 2017, : 520 - 534