Get webhook notifications whenever Pawsey Supercomputing Research Centre creates an incident, updates an incident, resolves an incident or changes a component status.
Our regular first-Tuesday-of-the-month maintenance will not proceed in August. Instead, an extended maintenance period for Setonix is scheduled for August 9th - 14th.
During the shutdown, HPE will replace Setonix's management system, bringing several benefits: • Moving forward system updates and patches will happen during regular maintenance, minimising disruptions. • HPE reports improved system stability with the upgraded management system.
What this mean? • Setonix (including the Setonix visualisation nodes) and Garrawarla will be unavailable during August 9th to 14th • All Pawsey systems will be subject to regular maintenance on the 13th of August including disruptive testing on both Acacia clusters, an upgrade to the control plane of Nimbus and patching of the Banksia system.
The replacement of Setonix's management system has been successfully implemented on Pawsey's test and development system. When Setonix is returned to service, the version of Cray Operating System won't have changed nor will the software stack provided by Pawsey. Only security fixes are being applied.
The login and visualisation nodes are bring moved to the 100 gigabit network which means anyone providing external software access to Setonix should allow for access from the 146.118.74.0/22 network.
Further updates will be provided on status.pawsey.org.au, and any questions should be directed to help@pawsey.org.au. Posted on
Jul 25, 2024 - 11:15 AWST
Completed -
The scheduled maintenance has been completed.
Jul 23, 17:12 AWST
Update -
We will be undergoing scheduled maintenance during this time.
Jul 19, 08:49 AWST
Scheduled -
CBIS contractors have identified during their thermographic and ultrasonic testing in the SC Cell underfloor that an isolator has a hot spot.
The isolator and cabling terminations are in need of urgent isolation and replacement, so the Setonix cabinet attached to the isolator is currently being drained. This will be the highmem partition is not available, and the work partition has reduced capacity.
Jul 19, 08:49 AWST
Resolved -
The issue has been resolved, thankyou for your patience.
Jul 17, 15:39 AWST
Identified -
The issue has been identified and services restarted. One tape library is back running jobs, just working on the second tape library.
Jul 17, 08:44 AWST
Investigating -
Hello there appears to be a blip with the scoutam service this morning affecting staging so we will work on this issue.
Jul 17, 08:00 AWST