Introducing distributed dynamic data-intensive (D3) science: Understanding applications and infrastructure

被引:5
|
作者
Jha, Shantenu [1 ]
Katz, Daniel S. [2 ]
Luckow, Andre [1 ]
Hong, Neil Chue [3 ]
Rana, Omer [4 ]
Simmhan, Yogesh [5 ]
机构
[1] Rutgers State Univ, New Brunswick, NJ 08901 USA
[2] Univ Illinois, Champaign, IL USA
[3] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[4] Cardiff Univ, Cardiff, S Glam, Wales
[5] Indian Inst Sci, Bengaluru, Karnataka, India
来源
基金
美国国家科学基金会;
关键词
dynamic; distributed; data intensive; scientific applications; PROJECT;
D O I
10.1002/cpe.4032
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A common feature across many science and engineering applications is the amount and diversity of data and computation that must be integrated to yield insights. Datasets are growing larger and becoming distributed; their location, availability, and properties are often time-dependent. Collectively, these characteristics give rise to dynamic distributed data-intensive applications. While "static" data applications have received significant attention, the characteristics, requirements, and software systems for the analysis of large volumes of dynamic, distributed data, and data-intensive applications have received relatively less attention. This paper surveys several representative dynamic distributed data-intensive application scenarios, provides a common conceptual framework to understand them, and examines the infrastructure used in support of applications.
引用
收藏
页数:25
相关论文
共 50 条
  • [11] Open active services for data-intensive distributed applications
    Collet, C
    Vargas-Solar, G
    Grazziotin-Ribeiro, H
    2000 INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM - PROCEEDINGS, 2000, : 349 - 359
  • [12] Supporting Load Balancing For Distributed Data-Intensive Applications
    Glimcher, Leonid
    Ravi, Vignesh T.
    Agrawal, Gagan
    16TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), PROCEEDINGS, 2009, : 235 - 244
  • [13] Distributed Scientific Workflow Management for Data-Intensive Applications
    Shumilov, S.
    Leng, Y.
    El-Gayyar, M.
    Cremers, A. B.
    12TH IEEE INTERNATIONAL WORKSHOP ON FUTURE TRENDS OF DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2008, : 65 - 73
  • [14] Open active services for data-intensive distributed applications
    Collet, Christine
    Vargas-Solar, Genoveva
    Grazziotin-Ribeiro, Helena
    Proceedings of the International Database Engineering and Applications Symposium, IDEAS, 2000, : 349 - 359
  • [15] A distributed shared buffer space for data-intensive applications
    Lachaize, R
    Hansen, JS
    2005 IEEE International Symposium on Cluster Computing and the Grid, Vols 1 and 2, 2005, : 913 - 920
  • [16] Distributed data structure templates for data-intensive remote sensing applications
    Ma, Yan
    Wang, Lizhe
    Liu, Dingsheng
    Yuan, Tao
    Liu, Peng
    Zhang, Wanfeng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (12): : 1784 - 1797
  • [17] Modeling distributed events in data-intensive Rich Internet applications
    Carughi, Giovanni Toffetti
    Comai, Sara
    Bozzon, Alessandro
    Fraternali, Piero
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2007, PROCEEDINGS, 2007, 4831 : 593 - 602
  • [18] APPLICATIONS OF COMPUTATIONAL SCIENCE: DATA-INTENSIVE COMPUTING FOR STUDENT PROJECTS
    Howard, Jessica
    Padron, Omar
    Morreale, Patricia
    Joiner, David
    COMPUTING IN SCIENCE & ENGINEERING, 2012, 14 (02) : 84 - 89
  • [19] Scalable Programming and Algorithms for Data-Intensive Life Science Applications
    Qiu, Judy
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2011, 15 (04) : 235 - 237
  • [20] Matsearch: A Search Engine in Materials Science Distributed Data-Intensive Environment
    Li, Yang
    Hu, Chang-Jun
    Zhang, Ji-Lin
    JOURNAL OF INTERNET TECHNOLOGY, 2013, 14 (05): : 799 - 806