5th Users' Conference of IT4Innovations

Name: 5th Users' Conference of IT4Innovations
Start: 2021-11-09T09:00:00+01:00
End: 2021-11-09T16:00:00+01:00
Location: IT4Innovations

9 November 2021

IT4Innovations

Europe/Prague timezone

Support

pr@it4i.cz

HyperQueue: Simplifying Usage of PBS/SLURM Clusters

9 Nov 2021, 14:00

30m

Online (IT4Innovations)

Online

IT4Innovations

Poster Poster session Poster session

Jakub Beránek (IT4Innovations)

In recent years, HPC workloads and communities have undergone substantial paradigm shifts. There is an increasing amount of users that want to leverage HPC clusters to execute many simple and embarrassingly parallel tasks as easily as possible. However, due to the limitations of traditional HPC job managers, these users must often resort to manual aggregation of tasks into a smaller number of jobs to reduce job manager overhead. This approach is both labour-intensive and inefficient, as it lacks dynamic load balancing required to fully utilize computational nodes with tens or hundreds of cores. We introduce HyperQueue - a task scheduling runtime that can execute a large amount of tasks on top of an HPC job manager by automatically aggregating tasks into jobs and dynamically load balancing them across all allocated nodes and CPU cores. HyperQueue is an open-source tool that is designed for ease of use and deployment.

Stanislav Böhm (IT4Innovations) Jakub Beránek (IT4Innovations) Vojtech Cima Roman Machacek Vyomkesh Jha Alfred Koci Jan Martinovic Branislav Jansik

There are no materials yet.

5th Users' Conference of IT4Innovations

Support

HyperQueue: Simplifying Usage of PBS/SLURM Clusters

Online

IT4Innovations

Speaker

Description

Primary authors

Presentation materials

Choose timezone

5th Users' Conference of IT4Innovations

Support

Speaker

Description

Primary authors

Presentation materials