Page tree
Skip to end of metadata
Go to start of metadata

Status

COMPLETED

Points of Contact:

help@pawsey.org.au
Start Date/Time (AWST)

11:35

Responsible:

Mark Gray: Head of Platforms
Estimated End Date/Time (AWST)

15:15

Accountable:

Mark O'Shea: Head of  Supercomputing Operations
End Date/Time (AWST)

15:40

Informed:

Mailman: magnus_users
Summary:Loss of cooling to cabinets
Systems/Services AffectedMagnus


Updates:

  • 11:35 Ongoing re-balancing work of the water cooling systems across the Pawsey data-centre has seen five cabinets in Magnus shutdown
    • Correction: when we said "Ongoing" above, it represented our understanding at the time we issued the Incident report
      We have since been informed the re-balancing work of the water cooling systems across the Pawsey data-centre had ended at 11:00.
  • 11:39 On-site Cray engineer informs us that he will have to shutdown Magnus in order to restore the service
  • 11:40 Commencement of shutdown of Magnus
  • 14:55 We have allowed jobs to start running again
  • 15:40 Jobs appear to  have been running as normal.


Post-Incident Summary: