Intel released its Intel Phi coprocessor yesterday which is specifically aimed at highly parallel applications. The processor boasts a whopping 1 teraflop of compute power contained on a single PCI-E bus. Petascale and Exascale computing is definitely around the …
New worker nodes
We've added two new nodes, 213 and 214, to the 200 series. Both nodes have 8GB of RAM. This brings our cluster up to 136 cores.…
RAM upgrade and more worker nodes
The RAM of worker node 209 has been upgraded from 4 to 8GB and the node moved back to the 200 series. We will be deploying a few more 200 series and at least 1 more 400M series worker nodes…
Surviving the power outage
Yesterday's Peninsula wide blackout fortunately had no effect on our HPC and Grid systems. Both data center generators kept their UPS's running until power was restored after about an hour.
As can be seen from the monitoring dashboard all devices…
Memory upgrade complete
The cluster memory upgrade seems to have gone OK, /scratch01 is remounted. Server 209 is still at 4GB as we only budgeted for 9 memory kits. Once we can confirm that the extra 4GB RAM
has made a difference for…
RAM upgrade
THE ICTS-HPC team will be upgrading the
RAM in several of the 200 series servers between 10 and 12 on Friday the
16th of September 2011. The implication is that the /scratch01 disk
area will be unavailable at this time.
…
RAM upgrade
Our new RAM has arrived. We're taking node 211 offline to test the new chips, this should have no impact on computation or the Gluster file system.
Update: server upgraded from 4 to 8GB of RAM. We'll do the…
New toys for spring
To celebrate spring we've deployed some more worker nodes and disk space. Yet another 200 series server with 4 cores and 4GB of RAM, as well as two new 400 series servers. These last two servers come out of the…
Power disruptions
Power to the Rondebosch area dropped again this morning, once for 2 minutes and then a brief brownout just after 11:30. We are assured that this is not "load shedding" by Eskom, although this is of little relief to anyone…
A stressful day
Patched kernels on HPC servers to 2.6.18-238.1.1.el5; All went fine except for the head node which has an issue with latest kernel (dies at boot with a kernel panic) so booting it into older version 2.6.18-194.1.1.el5 until we can sort…