Data science with R and Python (PTC course)

Europe/Prague
207 (VŠB - Technical University Ostrava, IT4Innovations building)

207

VŠB - Technical University Ostrava, IT4Innovations building

Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
Description

Annotation

The R part of course (first day) will be focused on presenting the basics of data analysis in R and visualization of data. The course will cover the introduction to the R statistical language introducing the basic data types and workflow. Afterwards, packages from the “tidyverse” collection will be presented. These includes packages for the loading of data, preprocessing data, basic data exploration, and visualization.

The Python oriented part (second day) will introduce essential data-scientific packages and will be complemented with hands-on exercises that will demonstrate their usage with real world data analytic problems, and showing how to tackle such problems.

The course will be up to 50% hands-on exercises covering all topics to practice the techniques, and patterns gained.

Level

beginner - intermediate

Language

English

Purpose of the course (benefits for the attendees)

Target audience: Users that want to use Python and/or R for data analysis and prototyping. The participants will learn basic and intermediate skills for exploratory data analysis and visualization in the programming languages of R and Python.

About the tutor(s)

Tomáš Martinovič obtained his PhD in computational sciences at IT4Innovations, VSB - Technical University of Ostrava in 2018. From 2015 to 2018 he worked in a team focused on analysis of complex dynamical systems, where he worked on scalable implementations of algorithms from the field of nonlinear time series analysis. Since the start of 2019 he has been working in a team focused on high performance data analysis with the defined objective of research and transfer of knowledge in cooperation with industry.

Stanislav Böhm has a PhD in computer science, and is a researcher at IT4Innovations. He is interested in distributed systems, verification, and scheduling.

Acknowledgements

This event was partially supported by The Ministry of Education, Youth and Sports from the National Programme of Sustainability (NPU II) project „IT4Innovations excellence in science - LQ1602“ and partially by the PRACE-5IP project - the European Union’s Horizon 2020 research and innovation programme under grant agreement No 730913.

  • Wednesday, 24 April
    • 09:30 10:00
      Registration/presentation 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
    • 10:00 11:30
      Introduction; Data import in R; Tidying data in R; Hands-on 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
    • 11:30 13:00
      Lunch 1h 30m Restaurant Pustkovecka Basta

      Restaurant Pustkovecka Basta

    • 13:00 14:30
      Exploratory analysis with tidyverse in R; Hands-on 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
    • 14:30 15:00
      Coffee break 30m 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
    • 15:00 16:30
      Advanced data visualization and analysis with ggplot2 and trelliscopejs in R; Hands-on; Q & A 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
  • Thursday, 25 April
    • 09:45 10:45
      Introduction to Pandas 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
    • 10:45 11:00
      Coffee 15m 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
    • 11:00 12:00
      Hands-on 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic
    • 12:00 13:00
      Lunch 1h Restaurant Pustkovecka Basta

      Restaurant Pustkovecka Basta

    • 13:00 14:30
      Exploratory analysis in Pandas 207

      207

      VŠB - Technical University Ostrava, IT4Innovations building

      Studentská 6231/1B 708 33 Ostrava–Poruba Czech Republic