26–27 Apr 2023
ONLINE and onsite
Europe/Prague timezone

Annotation

This course is focused on data analysis and modeling in R statistical programming language. The first day of the course will introduce how to approach a new dataset to get a better understanding of the data and its features. Modeling based on the modern set of packages jointly called TidyModels will be shown afterward. This set of packages strives to make the modeling in R as simple and as reproducible as possible.

The second day is focused on increasing the efficiency of computation by introducing Rcpp for seamless integration of C++ code into R code. A simple example of CUDA usage with Rcpp will be shown. In the afternoon, the section on parallelization of the code with future and/or MPI will be presented.

Benefits for the attendees, and what they will learn:

  • What are the first steps to understanding a new dataset
  • Prepare data for the modeling
  • Creation of the standard modeling workflow using modern R packages
  • To speed up code by using C++
  • Parallelization of the code and execution of the code on a cluster

Level

intermediate

Language

English

Prerequisites

Some experience with programming in R, knowledge of dplyr is an advantage.

Tutor

Tomáš Martinovič obtained his Ph.D. in computational sciences at IT4Innovations, VSB - Technical University of Ostrava in 2018. From 2015 to 2018 he worked in a team focused on the analysis of complex dynamical systems, where he worked on scalable implementations of algorithms from the field of nonlinear time series analysis. Since the start of 2022, he leads a team focused on machine learning/AI and operations research with the defined objective of research and transfer of knowledge in cooperation with industry.

Acknowledgments

This project has received funding from the European High-Performance Computing Joint Undertaking (JU) under grant agreement No 101101903. The JU receives support from the Digital Europe Programme and Germany, Bulgaria, Austria, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, Greece, Hungary, Ireland, Italy, Lithuania, Latvia, Poland, Portugal, Romania, Slovenia, Spain, Sweden, France, Netherlands, Belgium, Luxembourg, Slovakia, Norway, Türkiye, Republic of North Macedonia, Iceland, Montenegro, Serbia. This project has received funding from the Ministry of Education, Youth and Sports of the Czech Republic.

This course was supported by the Ministry of Education, Youth and Sports of the Czech Republic through the e-INFRA CZ (ID:90254).

Starts
Ends
Europe/Prague
ONLINE and onsite
207
Go to map

Registration

Registration is obligatory. Only registered participants will receive the Zoom link.

Please note that the training is held using Zoom. We advise all participants to download the Zoom application to enjoy full functionality. 

After the number of registrations has reached its maximum or the registration form has been closed, you may want to send us an email stating that you are interested to be put on the waiting list. (Vacancies may occur due to cancellations, etc.) E-mail to training@it4i.cz

Practicalities

This training will be an hybrid event. Technical details about joining will be sent to the accepted registrants before the event. If you are coming to IT4Innovations to attend personally, please bring your own laptop.

Capacity and Fees

The capacity is limited to 30 participants combined online and onsite.

The course is free of charge for all participants.

Surveys
There is an open survey.