Repository logo
Research Data
Publications
Projects
Persons
Organizations
English
Français
Log In(current)
  1. Home
  2. Publications
  3. Article de recherche (journal article)
  4. Applying big data paradigms to a large scale scientific workflow: Lessons learned and future directions

Applying big data paradigms to a large scale scientific workflow: Lessons learned and future directions

Author(s)
Kropf, Peter  
Institut d'informatique  
Lapin, Andrei  
Faculté des sciences  
Carretero, Jesus
Caíno-Lores, Silvina
Date issued
June 1, 2020
In
Future Gener. Comput. Syst.
Vol
110
From page
440
To page
452
Reviewed by peer
1
Subjects
Scientific workflows Big data Cloud computing Apache spark Hydrology
Abstract
The increasing amounts of data related to the execution of scientific workflows has raised awareness
of their shift towards parallel data-intensive problems. In this paper, we deliver our experience combining
the traditional high-performance computing and grid-based approaches with Big Data analytics
paradigms, in the context of scientific ensemble workflows. Our goal was to assess and discuss the
suitability of such data-oriented mechanisms for production-ready workflows, especially in terms of
scalability. We focused on two key elements in the Big Data ecosystem: the data-centric programming
model, and the underlying infrastructure that integrates storage and computation in each node. We
experimented with a representative MPI-based iterative workflow from the hydrology domain, EnKFHGS,
which we re-implemented using the Spark data analysis framework. We conducted experiments on
a local cluster, a private cloud running OpenNebula, and the Amazon Elastic Compute Cloud (AmazonEC2).
The results we obtained were analysed to synthesize the lessons we learned from this experience, while
discussing promising directions for further research.
Publication type
journal article
Identifiers
https://libra.unine.ch/handle/20.500.14713/63436
DOI
10.1016/j.future.2018.04.014
File(s)
Loading...
Thumbnail Image
Download
Name

2020-06-15_258_2814.pdf

Type

Main Article

Size

23.56 KB

Format

Adobe PDF

Université de Neuchâtel logo

Service information scientifique & bibliothèques

Rue Emile-Argand 11

2000 Neuchâtel

contact.libra@unine.ch

Service informatique et télématique

Rue Emile-Argand 11

Bâtiment B, rez-de-chaussée

Powered by DSpace-CRIS

libra v2.1.0

© 2026 Université de Neuchâtel

Portal overviewUser guideOpen Access strategyOpen Access directive Research at UniNE Open Access ORCIDWhat's new