Modular Resource Centric Learning for Workflow Performance Prediction

نویسندگان

  • Alok Singh
  • Mai H. Nguyen
  • Shweta Purawat
  • Daniel Crawl
  • Ilkay Altintas
چکیده

Workflows provide an expressive programming model for fine-grained control of large-scale applications in distributed computing environments. Accurate estimates of complex workflow execution metrics on large-scale machines have several key advantages. The performance of scheduling algorithms that rely on estimates of execution metrics degrades when the accuracy of predicted execution metrics decreases. This in-progress paper presents a technique being developed to improve the accuracy of predicted performance metrics of large-scale workflows on distributed platforms. The central idea of this work is to train resource-centric machine learning agents to capture complex relationships between a set of program instructions and their performance metrics when executed on a specific resource. This resource-centric view of a workflow exploits the fact that predicting execution times of sub-modules of a workflow requires monitoring and modeling of a few dynamic and static features. We transform the input workflow that is essentially a directed acyclic graph of actions into a Physical Resource Execution Plan (PREP). This transformation enables us to model an arbitrarily complex workflow as a set of simpler programs running on physical nodes. We delegate a machine learning model to capture performance metrics for each resource type when it executes different program instructions under varying degrees of resource contention. Our algorithm takes the prediction metrics from each resource agent and composes the overall workflow performance metrics by utilizing the structure of the corresponding Physical Resource Execution Plan. Keywords— Scientific Workflow, Cloud, Exascale, Machine Learning, Performance Prediction

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On user-centric QoE prediction for VoIP and video based on machine-learning

Assessing the impact of different network conditions on user experience is important for improving the telecommunication services. We have developed the MLQoE, a modular algorithm for user-centric QoE prediction. The MLQoE employs several machine learning (ML) algorithms and tunes their hyper-parameters. It selects the ML algorithm that exhibits the best performance and its parameters automatic...

متن کامل

Energy Prediction for I/O Intensive Workflow Applications

As workflow-based data-intensive applications have become increasingly popular, the lack of support tools to aid resource provisioning decisions, to estimate the energy cost of running such applications, or simply to support configuration choices has become increasingly evident. Our goal is to design techniques to predict the energy consumption of these workflow-based applications, evaluate dif...

متن کامل

Performance-Aware Workflow Management for Grid Computing

Grid middleware development has advanced rapidly over the past few years to support componentbased programming models and service-oriented architectures. This is most evident with the forthcoming release of the Globus toolkit (GT4), which represents a convergence of concepts (and standards) from both the grid and web-services communities. Grid applications are increasingly modular, composed of ...

متن کامل

E-science Workflow on the Grid

Grid computing, which can be characterized as large-scale distributed resource sharing and cooperation, has quickly become a mainstream technology in distributed computing. In this paper, we present the idea of applying certain grid workflow management techniques to mediate various services for grid-based e-science processes. The techniques of adaptable workflow services, aggressive sub-workflo...

متن کامل

GridFlow: Workflow Management for Grid Computing

Grid computing is becoming a mainstream technology for large-scale distributed resource sharing and system integration. Workflow management is emerging as one of the most important grid services. In this work, a workflow management system for grid computing, called GridFlow, is presented, including a user portal and services of both global grid workflow management and local grid sub-workflow sc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1711.05429  شماره 

صفحات  -

تاریخ انتشار 2017