HyperLoom: A Platform for Executing Scientific Pipelines in Distributed Environments (IT4I training)

Name: HyperLoom: A Platform for Executing Scientific Pipelines in Distributed Environments (IT4I training)
Start: 2018-06-04T09:30:00+02:00
End: 2018-06-04T16:30:00+02:00
Location: VŠB - Technical University Ostrava, IT4Innovations building

Monday 4 Jun 2018, 09:30 → 16:30 Europe/Prague

207 (VŠB - Technical University Ostrava, IT4Innovations building)

207

VŠB - Technical University Ostrava, IT4Innovations building

Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic

Description

Annotation

Real-world applications often encompass end-to-end data processing pipelines composed of a large number (millions) of interconnected computational tasks of various granularity. We introduce HyperLoom as a platform for defining and executing such pipelines in distributed environments using a Python API.

Scientific pipelines such those in machine learning compose of multiple data processing tasks. HyperLoom users can easily define dependencies between computational tasks and create a pipeline which can then be executed on HPC systems. The high-performance core of HyperLoom dynamically orchestrates the tasks over available resources respecting task requirements. The entire system was designed to have minimal overhead and to efficiently deal with varying computational times of the tasks. HyperLoom allows to execute pipelines that contain basic built-in tasks, user-defined Python tasks, tasks wrapping third-party applications or combinations of those.

This course will introduce HyperLoom and possibility of its usage in HPC environments based on our experience with HyperLoom deployment at IT4Innovations national supercomputing center.

Level

beginner - intermediate

Language

English

Purpose of the course (benefits for the attendees)

Attendees will learn the key concepts of HyperLoom, its architecture, and usage explained through practical examples.

About the tutor(s)

Stanislav Böhm is a computer science researcher at Advanced Data Analysis and Simulations Lab at IT4Innovations and Institute of Formal and Applied Linguistics at Charles University. He is interested in distributed systems and verification problems. He received his Ph.D. in 2014. He is the main author and team leader of the following software projects related to HPC: HyperLoom (framework for distributed pipelines), Aislinn (verification tool for MPI programs), Kaira (high-level development environment for MPI programs), and Haydi (combinatorial framework).

Vojtěch Cima is affiliated as a research assistant and Ph.D. student at Advanced Data Analysis and Simulations Lab at IT4Innovations where he actively participates in national and European research projects focusing on workload distribution and machine learning.

All presentations and educational materials of this course are provided under the Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license.

Support

training@it4i.cz

- 09:30 → 10:00
  
  Registration/presentation 207
  
  207
  
  VŠB - Technical University Ostrava, IT4Innovations building
  
  Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
- 10:00 → 11:30
  
  Introduction, overview, getting started 207
  
  207
  
  VŠB - Technical University Ostrava, IT4Innovations building
  
  Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
- 11:30 → 12:30
  
  Time for lunch 1h
- 12:30 → 14:00
  
  Hands-on session (distributed parameter search, model cross-validation...) 207
  
  207
  
  VŠB - Technical University Ostrava, IT4Innovations building
  
  Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
- 14:00 → 14:30
  
  Coffee break 30m 207
  
  207
  
  VŠB - Technical University Ostrava, IT4Innovations building
  
  Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
- 14:30 → 16:00
  
  Bring your own problem, discussion 207
  
  207
  
  VŠB - Technical University Ostrava, IT4Innovations building
  
  Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic

Choose timezone

HyperLoom: A Platform for Executing Scientific Pipelines in Distributed Environments (IT4I training)

207

VŠB - Technical University Ostrava, IT4Innovations building

Annotation

Level

Language

Purpose of the course (benefits for the attendees)

About the tutor(s)

207

VŠB - Technical University Ostrava, IT4Innovations building

207

VŠB - Technical University Ostrava, IT4Innovations building

207

VŠB - Technical University Ostrava, IT4Innovations building

207

VŠB - Technical University Ostrava, IT4Innovations building

207

VŠB - Technical University Ostrava, IT4Innovations building