[HYBRID] Moving your AI training jobs to LUMI: A Hands-On Workshop (EuroCC)

Name: [HYBRID] Moving your AI training jobs to LUMI: A Hands-On Workshop (EuroCC)
Start: 2024-11-26T09:00:00+01:00
End: 2024-11-27T17:00:00+01:00
Location: HYBRID (Online, IT4Innovations)

26–27 Nov 2024

HYBRID (Online, IT4Innovations)

Europe/Prague timezone

Overview

Support

training@it4i.cz

This course is organized by the LUMI User Support Team (LUST) and EuroCC National Competence Centers (NCCs) in Finland and Czech Republic.

Annotation

Join our two-day workshop, “Getting Started with AI on LUMI,” designed to familiarize you with the capabilities of the LUMI supercomputer for artificial intelligence applications. This workshop is ideal for those looking to transition from smaller-scale computing environments like laptops, workstations, or cloud VMs to the robust, GPU-intensive LUMI platform.

Participants are invited to bring their own AI training scripts to the workshop, where they will receive personalized support to adapt and run them on LUMI’s advanced GPU system. Whether you aim to leverage a single GPU or scale up to multiple GPUs, our workshop will provide valuable insights and practical skills to enhance your AI projects with LUMI’s powerful computing infrastructure.

Learning outcomes

Attending the workshop, you will acquire an understanding of the LUMI-G architecture tailored for AI training, including an introduction to SLURM, ROCm, the Lustre/LUMI-O file systems, and the Slingshot 11 interconnect. Specifically, you will:

Learn to utilize existing AI containers on LUMI and build your own using the container build tool, cotainr
Get to know how to run on a single GPU and how to monitor the efficiency
Learn to distribute AI workloads across multiple GPUs within a single LUMI-G node
Gain insight into advanced topics for optimizing AI training processes on the LUMI supercomputer

Agenda

The workshop consists of a mix of short lectures and hands-on exercises, that cover the following key topics:

LUMI-G architecture overview and its applications in AI
Introduction to the LUMI web-interface for development and monitoring
Using the AI framework PyTorch on LUMI
Building and deploying custom AI containers on LUMI
Strategies for scaling AI workloads across multiple GPUs
Get support to adapt and run your own AI training script on LUMI (only for on-site participants)

Each day will run from 9:00 to 17:00 CET, with breaks scheduled throughout.

Requirements

Participants are expected to have basic experience with:

Working on a Linux command line
Using Python and one or more of the Python AI frameworks PyTorch, Tensorflow, or JAX
Training an AI model on at least a single GPU, e.g. using a laptop, workstation, or cloud service
Managing Python environments, e.g. using the Conda and/or pip package managers

Participants are expected to bring a laptop to the workshop, including a charger.

Language

English

Registration

Acknowledgements

This project has received funding from the European High-Performance Computing Joint Undertaking (JU) under grant agreement No 101101903. The JU receives support from the Digital Europe Programme and Germany, Bulgaria, Austria, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, Greece, Hungary, Ireland, Italy, Lithuania, Latvia, Poland, Portugal, Romania, Slovenia, Spain, Sweden, France, Netherlands, Belgium, Luxembourg, Slovakia, Norway, Türkiye, Republic of North Macedonia, Iceland, Montenegro, Serbia. This project has received funding from the Ministry of Education, Youth, and Sports of the Czech Republic.

This course was supported by the Ministry of Education, Youth and Sports of the Czech Republic through the e-INFRA CZ (ID:90254).

Starts 26 Nov 2024, 09:00

Ends 27 Nov 2024, 17:00

Europe/Prague

HYBRID (Online, IT4Innovations)

IT4Innovations Studentská 6231/1B Ostrava – Poruba, 708 00 Czech Republic

Go to map

Registration

Deadline: 19. November 2024, at 16:00 CET

Register for the Workshop: https://ssl.eventilla.com/aitraningjobslumi

Please register by the deadline to secure your place. Spaces are limited and available on a first-come, first-served basis. Participation in the workshop is free and includes a sandwich for lunch as well as coffee/tea/water and a few snacks throughout the days.

We have 30 in-person places and every participant will receive a quick confirmation on a first-come, first-served basis so that they can arrange travel. On-line participants will receive confirmation shortly after the deadline. If your plans change, we kindly ask you to cancel your registration as soon as possible, ideally before the registration deadline. The email acknowledging your registration will contain a link to manage it.

Users who do not have an account on LUMI yet will receive temporary access for the purpose of the course. The allocated compute time should only be used for the course exercises. Any abuse will result in removal from the allocation for this and future courses.

Online Attendance Option

For those unable to attend in person, we are pleased to offer the option to join the lectures online. While the interactive hands-on exercises and personalized support for implementing your own workflows will be exclusive to in-person attendees, remote participants will still benefit from the comprehensive lectures streamed live from the workshop.

Venue

The workshop will take place in IT4Innovations offices in Ostrava. Participants are responsible for arranging their own travel and accommodations.

IT4Innovations
Studentská 6231/1B
Ostrava – Poruba, 708 00
Czech Republic
https://maps.app.goo.gl/HryAYm9wRdSZDpXk9

Suggested accommodation and travel information

https://www.it4i.cz/en/how-to-get-to-it4innovations

Choose timezone

[HYBRID] Moving your AI training jobs to LUMI: A Hands-On Workshop (EuroCC)

Support

Annotation

Learning outcomes

Agenda

Requirements

Language

Registration

Acknowledgements

Registration

Online Attendance Option

Venue

Suggested accommodation and travel information