[ONLINE] Fundamentals of Deep Learning for Multi-GPUs (EuroCC)

Europe/Prague
[ONLINE]

[ONLINE]

Description

Annotation

The computational requirements of deep neural networks used to enable AI applications like self-driving cars are enormous. A single training cycle can take weeks on a single GPU or even years for larger datasets like those used in self-driving car research. Using multiple GPUs for deep learning can significantly shorten the time required to train lots of data, making solving complex problems with deep learning feasible.

We will teach you how to use multiple GPUs to train neural networks. You'll learn:

  • Approaches to multi-GPUs training

  • Algorithmic and engineering challenges to large-scale training

  • Key techniques used to overcome the challenges mentioned above

This course is only offered to academia (see details below in section Capacity and Fees).

Level

intermediate

Language

English

Purpose of the course (benefits for the attendees)

Upon completion, you'll be able to effectively parallelize training of deep neural networks using Tensorflow 2.x.

About the tutor

Georg Zitzlsberger is a research specialist for Machine and Deep Learning at IT4Innovations. He has for over three years been certified by NVIDIA as a University Ambassador of the NVIDIA Deep Learning Institute (DLI) programme. This certification allows him to offer NVIDIA DLI courses to academic users of IT4Innovations' HPC services. In addition, in collaboration with Bayncore, he is a trainer for Intel HPC and AI workshops and conferences carried out across Europe. He has been contributing to these events, which are held for audiences from industry and academia, for five years.

NVIDIA Deep Learning Institute

The NVIDIA Deep Learning Institute delivers hands-on training for developers, data scientists, and engineers. The program is designed to help you get started with training, optimizing, and deploying neural networks to solve real-world problems across diverse industries such as self-driving cars, healthcare, online services, and robotics.

Acknowledgements

                                         

This event was supported by the EuroCC project. This project has received funding from the European High-Performance Computing Joint Undertaking (JU) under grant agreement No 951732. The JU receives support from the European Union’s Horizon 2020 research and innovation programme and Germany, Bulgaria, Austria, Croatia, Cyprus, the Czech Republic, Denmark, Estonia, Finland, Greece, Hungary, Ireland, Italy, Lithuania, Latvia, Poland, Portugal, Romania, Slovenia, Spain, Sweden, the United Kingdom, France, the Netherlands, Belgium, Luxembourg, Slovakia, Norway, Switzerland, Turkey, Republic of North Macedonia, Iceland, Montenegro. This project has received funding from the Ministry of Education, Youth and Sports of the Czech Republic (ID:MC2101).

 

 

This course is supported by the Ministry of Education, Youth and Sports of the Czech Republic through the e-INFRA CZ (ID:90140).

    • 08:45 09:00
      Time to join the meeting
    • 09:00 09:30
      Intro
    • 09:30 10:00
      Stochastic Gradient Descent
    • 10:00 10:15
      Comfort Break 15m
    • 10:15 11:15
      Hands-On: Stochastic Gradient Descent
    • 11:15 12:00
      Introduction to Distributed Training
    • 12:00 13:00
      Lunch Break 1h
    • 13:00 13:40
      Hands-On: Distributed Training
    • 13:40 14:30
      Algorithmic Challenges of Distributed SGD
    • 14:30 14:45
      Coffee Break 15m
    • 14:45 15:45
      Hands-On: Algorithmic Challenges of Distributed SGD
    • 15:45 16:00
      Q&A