21 brand new DELL C6620 servers, ready for power and networking. Each server has two 24 core Xeon Gold 6442 processors with 384GB of RAM. And to top it all off, a DELL R760XA with four L40S GPU cards. …
IMPORTANT – SOFTWARE INSTALLS We have noticed an increasing number of people installing software\libraries directly via the head node. Please do not do this. Many of these installs require a large amount of RAM or induce heavy load on the…
Accounts and acknowledgements If you no longer require your UCT HPC account, or if you have recently submitted a thesis or paper acknowledging usage of the cluster, please let us know. Core reservations When requesting resources please do not oversubscribe …
Our new cluster is now live. The head node is hex.uct.ac.za, for those that remember our rollout of Dell C6145’s in 2013. At present the cluster consists of 16 C6620 nodes with 48 cores and 384GB of RAM each. …
Over the past few days the HPC and networks team relocated 3 racks worth of servers to a new location in the upper campus data center. This will free up space for the continuation of the refurbishment project. The end…
We have added another GPU server to our a100 partition. This server was purchased with funding from several groups as well as ICTS and additional resources will be dedicated to the shared a100free account. The server contains four a100-80GB cards…
The HPC cluster has been moved to the new upper campus data centre. The new data centre provides more electrical power and cooling and also has a new UPS and generators in order to better withstand load shedding. In addition…
Dear colleagues, As part of the process of ongoing improvement, ICTS will be migrating the High-Performance Computing cluster from its current location to the new data centre. This will result in some downtime for the cluster. How does this affect…
TL;DR; – The cluster head node operating system, firmware and storage system have all been patched\upgraded. – The wall time for ada has been increased from 72 to 170 hours – Matlab is now available on the cluster Cluster…
Thermal regulation and air flow are two important parts of any HPC installation. Adding Infiniband increases the amount of physical material at the back of the servers which can obstruct fans, reduce airflow and lead to overheating and damaged equipment. …