Preprint
Managing Workflows on top of a Cloud Computing Orchestrator for using heterogeneous environments on e-Science
Registro en:
CARRIÓN, Abel et al. Managing Workflows on top of a Cloud Computing Orchestrator for using heterogeneous environments on e-Science. International Journal of Web and Grid Services, v. 13, n. 4, p. 1-38, Mar. 2018.
Autor
Carrión, Abel
Caballer, Miguel
Blanquer, Ignacio
Kotowski, Nelson
Jardim, Rodrigo
Dàvila, Alberto Martin Rivera
Resumen
Versão preprint do autor: O autor pode arquivar a versão preprint (i.e. antes do peer-review) Scientific Workflows (SWFs) are widely used to model processes in e-Science. SWFs are executed by means of Workflow Management Systems (WMSs), which orchestrate the workload on top of computing infrastructures. The advent of cloud computing infrastructures has opened the door of using ondemand infrastructures to complement or even replace local infrastructures. However, new issues have arisen, such as the integration of hybrid resources or the compromise between infrastructure reutilization and elasticity. In this article we present an ad-hoc solution for managing workflows exploiting the capabilities of cloud orchestrators to deploy resources on demand according to the workload and to combine heterogeneous cloud providers (such as on-premise clouds and public clouds) and traditional infrastructures (clusters) to minimize costs and response time. The work does not propose yet another WMS, but demonstrates the benefits of the integration of cloud orchestration when running complex workflows. The article shows several configuration experiments from a realistic comparative genomics workflow called Orthosearch, to migrate memory-intensive workload to public infrastructures while keeping other blocks of the experiment running locally. The article computes running time and cost suggesting best practices.