Page tree
Skip to end of metadata
Go to start of metadata

Status

   COMPLETED

Points of Contact:

help@pawsey.org.au

Start Date/Time (AWST)

16:30

Responsible:Chris Schlipalius
Estimated End Date/Time (AWST)


Accountable:Mark Gray
End Date/Time (AWST)

08:15

Informed:CASDA and MWA Admins
SummaryPawsey HSM partial outage
Systems/Services AffectedData Portal, HSM, CASDA and MWA nodes.


Updates:

  • DMF is not processing requests.
  • CASDA is unable to access a filesystem
  • HPe Maintenance has been contacted

15:45

  • DMF has stopped responding to all requests.
  • Vendor data collected
  • Kernel dump of main metadata server initiated.
  • Cluster unstable and all clients unable to access any filesystems.

20:30

  • Cluster stable minus primary Metadata Server.
  • Primary Metadata Server rebooted.
  • Cluster complete and stable
  • DMF restarted
  • HSM files being processed.

20210902 08:15

  • HSM stable over night, returning to service

Post-Incident Summary: