Research Area:  Cloud Computing
Computing clouds have become the platform of choice for the deployment and execution of scientific workflows. Due to the uncertainty and unpredictability of scientific exploration, the execution plan for a scientific workflow may vary from the definition. It is therefore of great significance to be able to discover actual workflows from execution histories (event logs) to reproduce experimental results and to establish provenance. However, most existing process mining techniques focus on discovering control flow-oriented business processes in a centralized environment, and thus, they are mostly inapplicable to the discovery of data flow-oriented, unstructured scientific workflows in distributed cloud environments. In this paper, we present Scientific Workflow Mining as a Service (SWMaaS) to support both intra-cloud and inter-cloud scientific workflow mining. The approach is implemented as a ProM plug-in and is evaluated on event logs derived from real-world scientific workflows. Through experimental results, we demonstrate the effectiveness and efficiency of our approach.
Author(s) Name:  Wei Song,Fangfei Chen,Hans-Arno Jacobsen,Xiaoxu Xia,Chunyang Ye and Xiaoxing Ma
Journal name:  IEEE Transactions on Parallel and Distributed Systems
Publisher name:  IEEE
Volume Information:  Oct. 2017, pp. 2979-2992, vol. 28
Paper Link:   https://www.computer.org/csdl/journal/td/2017/10/07907335/13rRUwdrdSk