This incident has been resolved.
Jul 8, 10:44 AWST
Supercomputing clusters: Magnus, Galaxy, Zeus, and Topaz, along with lustre filesystems, have returned to operation.
Jul 3, 16:41 AWST
One of the lustre filesystems (/astro) isn't coming back online cleanly. A call has been lodged with the vendor and we're awaiting an update from them before proceeding. Supercompute services will remain offline until the filesystem issue is addressed.
Jul 2, 21:05 AWST
All the affected services are fed via a single sub-board. CSIRO facilities staff are on their way to site to investigate cause and Pawsey staff will commence service recovery when given the all-clear
Jul 2, 16:25 AWST
We are continuing to investigate this issue.
Jul 2, 15:57 AWST