Pawsey Supercomputing Research Centre
Update - All services in the Pawsey Supercomputing Centre were shutdown by 5:30 PM (AWST).

CBIS will commence HV maintenance tomorrow morning at 6 AM (AWST).

Feb 09, 2026 - 21:20 AWST
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Feb 09, 2026 - 11:00 AWST
Scheduled - Mandatory Annual testing of the Site Main Electrical ACB Switches and High Voltage equipment will be performed on the 10th February 2026.

Pawsey will shutdown all services housed in the Pawsey Centre starting at 11am on Monday 9th February 2026 in preparation for the mandatory testing.

Staff will commence returning services and performing routine maintenance activities on Wednesday, 11th February 2026.

Systems will return to service progressively throughout Wednesday 11th February and all services are expected to be returned to researchers by noon Thursday 12th February 2026.

Planned work for this window includes:
• Setonix will have it's /software filesystem replaced with an NFS based filesystem. A final synchronisation of the Lustre filesystem and NFS filesystem will be performed.
• The new visualisation software stack on the Setonix Remote Visualisation nodes will be made default.
• Banksia will have a ScoutAM upgrade.
• The tape libraries supporting Banksia will have a firmware update.
• Acacia Projects will conclude migration work off Puppet infrastructure.
• Patching of core Pawsey services will be undertaken.

The removal of the 2023.08, 2025.03, and 2024.05 software stacks that were previously scheduled for the February maintenance has been rescheduled to be removed in that order across the March, April, and May maintenances to allow researchers more time to migrate to the 2025.08 software stack.

Feb 9, 2026 11:00 - Feb 12, 2026 12:00 AWST
Setonix Under Maintenance
Login nodes Under Maintenance
Data-mover nodes Under Maintenance
Slurm scheduler Under Maintenance
Setonix work partition Under Maintenance
Setonix debug partition Under Maintenance
Setonix long partition Under Maintenance
Setonix copy partition Under Maintenance
Setonix askaprt partition Under Maintenance
Setonix highmem partition Under Maintenance
Setonix gpu partition Under Maintenance
Setonix gpu high mem partition Under Maintenance
Setonix gpu debug partition Under Maintenance
Lustre filesystems Under Maintenance
/scratch filesystem Under Maintenance
/software filesystem Under Maintenance
/askapbuffer filesystem Under Maintenance
/askapingest filesystem Under Maintenance
Storage Systems Under Maintenance
Acacia Ingest Under Maintenance
Acacia MWA Under Maintenance
Acacia Projects Under Maintenance
Banksia Under Maintenance
Data Portal Systems Under Maintenance
MWA ASVO Under Maintenance
ASKAP Under Maintenance
ASKAP ingest nodes Under Maintenance
ASKAP service nodes Under Maintenance
Central Services Under Maintenance
Authentication and Authorization Under Maintenance
Service Desk Under Maintenance
License Server Under Maintenance
Application Portal Under Maintenance
Origin Under Maintenance
/home filesystem Under Maintenance
/pawsey filesystem Under Maintenance
Central Slurm Database Under Maintenance
Documentation Under Maintenance
Visualisation Services Under Maintenance
Remote Vis Under Maintenance
Vis scheduler Under Maintenance
Setonix vis nodes Under Maintenance
Nebula vis nodes Under Maintenance
Visualisation Lab Under Maintenance
Reservation Under Maintenance
CARTA - Stable Under Maintenance
CARTA - Test Under Maintenance
Pawsey Remote VR Under Maintenance
The Australian Biocommons Under Maintenance
Fgenesh++ Under Maintenance
Nimbus - Legacy Under Maintenance
Ceph storage Under Maintenance
Nimbus instances Under Maintenance
Nimbus dashboard Under Maintenance
Nimbus APIs Under Maintenance
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Feb 10, 2026

No incidents reported today.

Feb 9, 2026

Unresolved incident: Pawsey HV Shutdown.

Feb 8, 2026

No incidents reported.

Feb 7, 2026

No incidents reported.

Feb 6, 2026

No incidents reported.

Feb 5, 2026

No incidents reported.

Feb 4, 2026

No incidents reported.

Feb 3, 2026

No incidents reported.

Feb 2, 2026
Resolved - The secondary tape library maintenance was successful on Friday and we've been monitoring it over the weekend.
Feb 2, 09:33 AWST
Identified - Banksia – one of two tape copies unavailable (at risk)
The Banksia service is currently in a degraded, “at risk” state as it is operating with only one tape library instead of the standard two. As a result, the secondary copy of files will be unavailable for staging or archiving until Library 2 is restored to service. The primary copy is still available for all data so this should not impact the service. If you experience any issues accessing data please let us know at help@pawsey.org.au

The issue has been traced to a faulty tape drive in DBA 5. To resolve this, the field engineer will remove the drives from the DBA, replace the faulty unit, and reseat all components. Since DBA 5 is currently blocked, DBA 6 will need to be removed first to allow access for the repair work. This work will take some time but the engineer indicates that in the worst case it will take until Tuesday but they are hoping for resolution today.

Jan 30, 09:36 AWST
Feb 1, 2026

No incidents reported.

Jan 31, 2026

No incidents reported.

Jan 30, 2026
Jan 29, 2026

No incidents reported.

Jan 28, 2026

No incidents reported.

Jan 27, 2026
Resolved - HPE believe the issue has been resolved and has closed the support case.

They believe the issue was:
"Global Flow Control was disabled on the E1000's. Once enabled performance was regained. ClusterStor team working on a fix (CSPROD-18819) to make Global Flow Control enabled all the time moving forward."

Jan 27, 10:06 AWST
Update - HPE made change to the Global Flow Control on scratch and software during January's maintenance, as well as a modification to the configuration of the LAG ports in the Slingshot fabric.

We haven't seen any C_EC_CRIT errors on the login nodes since maintenance, and are continuing to monitor them like hawks.

Jan 19, 08:21 AWST
Update - setonix-04 reported a C_EC_CRIT error yesterday. It is not in the login pool, but HPE are stumped at why this is happening.
Nov 7, 13:40 AWST
Monitoring - HPE rebooted a number of Slingshot switches during maintenance.

We haven't observed any Slingshot errors on the login, data mover or visualisation nodes for 48 hours.

We will continue to monitor.

Nov 6, 12:29 AWST
Update - HPE have provided no new information
Oct 31, 08:11 AWST
Update - HPE have provided no new information.
Oct 27, 10:59 AWST
Update - HPE have provided no new information.
Oct 24, 21:01 AWST
Update - HPE have provided no new information.

setonix-08 has slingshot issues. Pawsey is rebooting it.

Oct 20, 13:25 AWST
Update - setonix-02 and setonix-03 have been added back to the RR DNS.
Oct 16, 14:09 AWST
Investigating - There appears to be an issue will the Slingshot interfaces in the login nodes in Setonix. We appear to be down to 1 login node in the normal pool of login nodes.

We have had a case open with HPE for weeks, but they appear to be no closer to providing any kind of solution.

Please, please, please, please don't run any computational intensive operations on the login nodes. We have lovely compute nodes for that.

Please be aware that you can log into setonix-workflow.pawsey.org.au and get access to additional "workflow" nodes.

Oct 16, 12:02 AWST