On the morning of Monday, August 11, 2008, the air cooling (A/C) equipment in the TSRB Data Center (TSRB MDF) failed, and the room temperature rose to an unsafe level to continue operating the servers. Therefore at approximately 12:10 PM, several HPC clusters and file servers were powered off. This would have affected CS, CSE, and IC faculty and students. After the A/C equipment was repaired, the affected servers were powered back on. This was completed shortly before 2:00PM.
The affected servers/clusters are listed below:
arc - CS/Theory Server
odin - CS/CERCS Server
parliament - IC/Intelligent Systems Server
parliament2 - IC/Intelligent Systems Server
parliament3 - IC/Intelligent Systems Server
hydrogen - IC/Intelligent Systems Server
Loki - CS/CERCS HPC Cluster
Rohan - CS/CERCS HPC Cluster
Thunderbird - IC/RIM HPC Cluster
Topaz - CSE HPC Cluster
Wilks - IC/CPL HPC Cluster
Owner of Alert
TSO