[SOLVED] TSD Colossus Problem: Cluster not available at the moment (05/12-2016 at 13:00)
Due to a failure in the heating system, the Colossus front-end and one of the rack went down this morning (05/12-2016) and the cluster is not available at the moment. We are working to reboot the system at the moment.
Jobs that were running on the rack that went down, unavoidably died while those running on the other rack are most likely running even though the front end is not available.
We apologize for the inconvenience.
Published Dec. 5, 2016 1:12 PM
- Last modified Dec. 5, 2016 4:25 PM