Our current cluster, hex, runs Torque with MAUI as the scheduler. While MAUI is GPU aware it does not allow GPUs to be scheduled. In other words you can list the nodes with GPUs but you cannot submit a job …
New worker nodes
We've installed four new 600 series worker nodes on hex. This increases the core count by 256, and we hope to add a few more to both the hex and hpc clusters shortly. If you notice anything odd please let…
Clusters back on line
Power has been restored and the clusters are back on-line. Please notify us if you spot any irregularities.…
Urgent notification of downtime, Sun 13 Jun
We have been notified that the UCT Upper Campus data centre will be
shut down for emergency electrical work, see notice below. Please
can you checkpoint or shut down your jobs by Saturday 12th July
20:00.
We apologise for this…
Maintenance feedback
What went right?
- The HPC rack was neatened up. This involved moving and consolidating servers, making space for PDU's and removing redundant cables that were impeding airflow. New HPC servers were installed. This task took two entire days as other
Changes to HPC
Our older cluster, hpc.uct.ac.za is undergoing several changes. The 200 series will soon be decomissioned. The nodes are old, have insufficient core density and are inefficient power-wise compared to more modern servers. The space in the racks is required for…
AWS postscript
So after waiting 24 hours and then looking at the AWS billing reports a couple of things stand out. At first we were puzzled as to why we were billed for Run Instances as well as Spot Instances. Turns out
…