Scientific Workflow Execution in the Cloud using a Dynamic Runtime Model

Johannes Erbel, Jens Grabowski

Abstract

To explain specific phenomena, scientists perform a sequence of tasks, e.g., to gather, analyze, and interpret data, forming a scientific workflow. Depending on the complexity of the workflow, scientists require access to various kinds of tools, applications and infrastructures for individual tasks. Current approaches are often limited to managing these resources at design time, requiring the scientist to preemptively set up applications essential for their workflow. Therefore, a dynamic provisioning and configuration of computing resources is required that fulfills these needs at runtime. In this paper, we present a dynamic runtime model that couples workflow tasks with their individual applications and infrastructure requirements. This runtime model is used as a knowledge base by a model-driven workflow execution engine orchestrating the sequence of tasks and their infrastructure. We exhibit that the simplicity of the runtime model supports the creation of highly tailored infrastructures, the integration of self developed applications, as well as a human-in-the-loop allowing scientists to monitor and interact with the workflow at runtime. To tackle the heterogeneity of cloud provider interfaces, we implement the workflow runtime model by extending the Open Cloud Computing Interface (OCCI) cloud standard which provides an extensible data model as well as a uniform interface to manage cloud resources. We demonstrate the applicability of our approach using three case studies and discuss the benefits of the runtime model from a user and system perspective.
Keywords: 
Runtime Model, Workflow, Cloud, OCCI
Document Type: 
Journal Articles
Language: 
English
Publisher: 
Springer
Journal: 
Software and Systems Modeling
Month: 
6
Year: 
2023
DOI: 
10.1007/s10270-023-01112-6
2024 © Software Engineering For Distributed Systems Group

Main menu 2