The HPC cluster will be unavailable during the weekend of the 12th of October for scheduled data center network maintenance.
We will be placing the cluster into draining mode at 17:00 on Friday the 11th at which point no new …
The HPC cluster will be unavailable during the weekend of the 12th of October for scheduled data center network maintenance.
We will be placing the cluster into draining mode at 17:00 on Friday the 11th at which point no new …
Our new cluster is now live. The head node is hex.uct.ac.za, for those that remember our rollout of Dell C6145’s in 2013.
At present the cluster consists of 16 C6620 nodes with 48 cores and 384GB of RAM each. …
Over the past few days the HPC and networks team relocated 3 racks worth of servers to a new location in the upper campus data center. This will free up space for the continuation of the refurbishment project. The end …
The UCT HPC cluster is being migrated to its final location in the newly refurbished ICTS data center. The move will take place in the first week of June during which time the cluster will be unavailable.
The cluster will …
The HPC cluster will be unavailable during the weekend of the 20th of April due to a network upgrade.
We will be placing the cluster into draining mode at 17:00 on Friday the 19th at which point no new jobs …
On Monday 5 Jun at 13:45 an environmental event in the Upper Campus Data Center caused damage to the HPC rack. Currently 5 worker nodes are offline. We have ordered replacement parts from our suppliers, however the implication is that …
We have added another GPU server to our a100 partition. This server was purchased with funding from several groups as well as ICTS and additional resources will be dedicated to the shared a100free account.
The server contains four a100-80GB cards…
Being able to analyze the energy usage of every core in every CPU of the cluster enables us to detect jobs that are not making good use of allocated cores over time.
Here is a node that is using 1 …
Our new cluster will use cgroup to control RAM and thread allocation. One of the biggest hassles we’ve faced over the years is code not adhering to the scheduler reservation, in other words grabbing more cores and more RAM than …
We have moved away from Cacti\Nagios for graphing and now make use of Grafana. Unfortunately there is no public facing portal for Grafana, however there is a way to export graphs as static png files, so we have set up …