Speaker
Mr
Petr Stehlík
(FIT VUT)
Description
In the race toward exascale computing, supercomputing systems are getting more complex. This makes it more difficult to operate the computing resources, infrastructure and software components at the most efficient point. The first step to counteract these trends is to give users and system administrators a cockpit from which they can visually inspect status of the cluster and executed applications.
We present Examon Web -- an open source framework for visualization of performance, power and energy statistics of HPC applications and cluster status. Examon Web is built on top of a fine grain monitoring framework which collects and handle a wide set of sensors and performance counters for the cluster computing resources, job scheduling data and infrastructure metrics all sampled at a fine granularity. Examon Web combines these information on a per job and per cluster base to provide visual insights on applications and cluster performance, power and energy.
Primary author
Mr
Petr Stehlík
(FIT VUT)
Co-author
Dr
Jiri Jaros
(Brno University of Technology)