Programming languages for data-Intensive HPC applications: A systematic mapping study

被引:19
|
作者
Amaral, Vasco [1 ]
Norberto, Beatriz [1 ]
Goulao, Miguel [1 ]
Aldinucci, Marco [2 ]
Benkner, Siegfried [3 ]
Bracciali, Andrea [4 ]
Carreira, Paulo [5 ]
Celms, Edgars [6 ]
Correia, Luis [7 ]
Grelck, Clemens [8 ]
Karatza, Helen [9 ]
Kessler, Christoph [10 ]
Kilpatrick, Peter [11 ]
Martiniano, Hugo [7 ]
Mavridis, Ilias [9 ]
Pllana, Sabri [12 ]
Respicio, Ana [13 ]
Simao, Jose [14 ]
Veiga, Luis [5 ]
Visa, Ari [15 ]
机构
[1] Univ Nova Lisboa, Fac Ciencias & Tecnol, DI, NOVA LINCS, Lisbon, Portugal
[2] Univ Torino, Turin, Italy
[3] Univ Vienna, Vienna, Austria
[4] Univ Stirling, Stirling, Scotland
[5] Univ Lisbon, Inst Super Tecn, DEI, INESC ID, Lisbon, Portugal
[6] Univ Latvia, Inst Math & Comp Sci, Riga, Latvia
[7] Univ Lisbon, Fac Ciencias, BioISI, Lisbon, Portugal
[8] Univ Amsterdam, Amsterdam, Netherlands
[9] Aristotle Univ Thessaloniki, Thessaloniki, Greece
[10] Linkoping Univ, Linkoping, Sweden
[11] Queens Univ Belfast, Belfast, Antrim, North Ireland
[12] Linnaeus Univ, Vaxjo, Sweden
[13] Univ Lisbon, Fac Ciencias, LASIGE, Lisbon, Portugal
[14] Inst Politecn Lisboa, Inst Super Engn Lisboa, Lisbon, Portugal
[15] Tampere Univ, Tampere, Finland
关键词
High performance computing (HPC); Big data; Data-intensive applications; Programming languages; Domain-Specific language (DSL); General-Purpose language (GPL); Systematic mapping study (SMS); DOMAIN-SPECIFIC LANGUAGES; PARALLEL; ANALYTICS; MULTI; EFFICIENT; MODEL;
D O I
10.1016/j.parco.2019.102584
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A major challenge in modelling and simulation is the need to combine expertise in both software technologies and a given scientific domain. When High-Performance Computing (HPC) is required to solve a scientific problem, software development becomes a problematic issue. Considering the complexity of the software for HPC, it is useful to identify programming languages that can be used to alleviate this issue. Because the existing literature on the topic of HPC is very dispersed, we performed a Systematic Mapping Study (SMS) in the context of the European COST Action cHiPSet. This literature study maps characteristics of various programming languages for data-intensive HPC applications, including category, typical user profiles, effectiveness, and type of articles. We organised the SMS in two phases. In the first phase, relevant articles are identified employing an automated keyword-based search in eight digital libraries. This lead to an initial sample of 420 papers, which was then narrowed down in a second phase by human inspection of article abstracts, titles and keywords to 152 relevant articles published in the period 2006-2018. The analysis of these articles enabled us to identify 26 programming languages referred to in 33 of relevant articles. We compared the outcome of the mapping study with results of our questionnaire-based survey that involved 57 HPC experts. The mapping study and the survey revealed that the desired features of programming languages for data-intensive HPC applications are portability, performance and usability. Furthermore, we observed that the majority of the programming languages used in the context of data-intensive HPC applications are text-based general-purpose programming languages. Typically these have a steep learning curve, which makes them difficult to adopt. We believe that the outcome of this study will inspire future research and development in programming languages for data-intensive HPC applications. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A Model for Programming Data-Intensive Applications on FPGAs: A Genomics Case Study
    Brossard, Elliott
    Richmond, Dustin
    Green, Joshua
    Ebeling, Carl
    Ruzzo, Larry
    Olson, Corey
    Hauck, Scott
    2012 SYMPOSIUM ON APPLICATION ACCELERATORS IN HIGH PERFORMANCE COMPUTING (SAAHPC), 2012, : 84 - 93
  • [2] Scalable Programming and Algorithms for Data-Intensive Life Science Applications
    Qiu, Judy
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2011, 15 (04) : 235 - 237
  • [3] Storage QoS provisioning for execution programming of data-intensive applications
    Slota, Renata
    SCIENTIFIC PROGRAMMING, 2012, 20 (01) : 69 - 80
  • [4] DESIGNING AND PROTOTYPING DATA-INTENSIVE APPLICATIONS IN THE LOGRES AND ALGRES PROGRAMMING ENVIRONMENT
    CACACE, F
    CERI, S
    TANCA, L
    CRESPIREGHIZZI, S
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1992, 18 (06) : 534 - 546
  • [5] Platform-independent programming of data-intensive applications using UML
    Falda, Grzegorz
    Habela, Piotr
    Kaczmarski, Krzysztof
    Stencel, Krzysztof
    Subieta, Kazimierz
    BALANCING AGILITY AND FORMALISM IN SOFTWARE ENGINEERING, 2008, 5082 : 103 - +
  • [6] Data-Intensive HPC Tasks Scheduling with SDN to Enable HPC-as-a-Service
    Jamalian, Saba
    Rajaei, Hassan
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 596 - 603
  • [7] Applications in Data-Intensive Computing
    Shah, Anuj R.
    Adkins, Joshua N.
    Baxter, Douglas J.
    Cannon, William R.
    Chavarria-Miranda, Daniel G.
    Choudhury, Sutanay
    Gorton, Ian
    Gracio, Deborah K.
    Halter, Todd D.
    Jaitly, Navdeep D.
    Johnson, John R.
    Kouzes, Richard T.
    Macduff, Matthew C.
    Marquez, Andres
    Monroe, Matthew E.
    Oehmen, Christopher S.
    Pike, William A.
    Scherrer, Chad
    Villa, Oreste
    Webb-Robertson, Bobbie-Jo
    Whitney, Paul D.
    Zuljevic, Nino
    ADVANCES IN COMPUTERS, VOL 79, 2010, 79 : 1 - 70
  • [8] Metacomputing and data-intensive applications
    Messina, P
    WORLDWIDE COMPUTING AND ITS APPLICATIONS, 1997, 1274 : 226 - 236
  • [9] Next Generation HPC Clouds: A View for Large-Scale Scientific and Data-Intensive Applications
    Petcu, Dana
    Gonzalez-Velez, Horacio
    Nicolae, Bogdan
    Garcia-Gomez, Juan Miguel
    Fuster-Garcia, Elies
    Sheridan, Craig
    EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT II, 2014, 8806 : 26 - 37
  • [10] SunwayMR: A distributed parallel computing framework with convenient data-intensive applications programming
    Wu, Renke
    Huang, Linpeng
    Yu, Peng
    Zhou, Haojie
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 71 : 43 - 56