Pawsey Scheduled Maintenance (May)

Scheduled Maintenance Report for Pawsey Supercomputing Research Centre

Completed

All Pawsey services have been returned to service. Please note that next maintenance is currently scheduled for the *last* Tuesday in June due to HPE resourcing issues. We will provide further updates in the near future.

Work completed:
• HPE have installed RAID cards in the SU Leader nodes in Setonix.
• The NetApp providing the /home filesystem has had an OnTap upgrade.
• Banksia has had new batteries installed in the storage controllers.
• Temporary bridge interfaces on Acacia MWA hosts have been removed.
• ASKAP Buffer has had disk firmware updates applied.
• Operating System updates to the SLURM database daemon and SLURM controller for ASKAP Ingest have been applied.
• Visualisation services have been patched.
• Core Pawsey services have been patched.

Thank you to all Pawsey and HPE staff involved.

As always, be kind, and e-mail our friendly help desk staff (help@pawsey.org.au) if you encounter any issues.
Posted May 07, 2025 - 10:22 AWST

Verifying

HPE handed Setonix to Pawsey at 4 AM this morning.

Pawsey have booted Setonix into as testing state and are currently running reframe.
Posted May 07, 2025 - 08:46 AWST

Update

Apparently HPE have not "resolved the issue" and refuse to tell Pawsey what the issue is.
Posted May 06, 2025 - 21:31 AWST

Update

HPE have "resolved the issue" however will require at least 4 hours before they can return Setonix to Pawsey.

At this stage, Pawsey estimate that they won't be able to return Setonix to researchers until tomorrow.
Posted May 06, 2025 - 18:00 AWST

Update

We (Pawsey) are still waiting for HPE to hand Setonix back to Pawsey. We have no ETA. We wait.
Posted May 06, 2025 - 16:39 AWST

Update

Core services have been patched.

ASKAP Ingest has been handed back to ASKAP.

HPE are still struggling with the SU Leader nodes. We have no ETA on when Setonix will be handed back to Pawsey.
Posted May 06, 2025 - 14:07 AWST

Update

Banksia up and staging has resumed.
Posted May 06, 2025 - 10:16 AWST

In progress

Scheduled maintenance is currently in progress. We will provide updates as necessary.
Posted May 06, 2025 - 08:00 AWST

Scheduled

Maintenance will be carried out on Pawsey systems on Tuesday the 6th May to apply required patches and updates to improve the systems stability, security, and performance. This maintenance window will also be used to undertake other tasks which require down-time to achieve.

Planned work for this window includes:
• HPE will be installing RAID cards in the SU Leader nodes in Setonix. This will allow HPE to provide operations staff with advanced monitoring as part of their HPE Cray EX platform.
• The NetApp providing the /home filesystem will have an OnTap upgrade.
• Banksia will have new batteries installed in the storage controllers.
• Operating system upgrades for Acacia.
• Remove temporary additional bridge interfaces on Acacia MWA hosts.
• ASKAP Buffer will be having disk firmware updates.
• Operating System updates to the SLURM database daemon and SLURM controller for ASKAP Ingest.
• Patching of visualisation services will be undertaken.
• Patching of core Pawsey services will be undertaken.

We expect to be able to bring all services back by the end of the day (in the case of Setonix, sometime in the evening). If you have any questions, please contact help@pawsey.org.au.
Posted Apr 29, 2025 - 12:46 AWST
This scheduled maintenance affected: ASKAP (ASKAP ingest nodes, ASKAP service nodes), Central Services (Authentication and Authorization, Service Desk, License Server, Application Portal, Origin, /home filesystem, /pawsey filesystem, Central Slurm Database, Documentation), The Australian Biocommons (Fgenesh++), Storage Systems (Acacia - Projects, Banksia, Data Portal Systems, MWA Nodes, CASDA Nodes, Acacia - Ingest, MWA ASVO), Lustre filesystems (/scratch filesystem, /software filesystem, /askapbuffer filesystem, /askapingest filesystem), Setonix (Login nodes, Data-mover nodes, Slurm scheduler, Setonix work partition, Setonix debug partition, Setonix long partition, Setonix copy partition, Setonix askaprt partition, Setonix highmem partition, Setonix gpu partition, Setonix gpu high mem partition, Setonix gpu debug partition), and Visualisation Services (Remote Vis, Vis scheduler, Setonix vis nodes, Nebula vis nodes, Visualisation Lab, Reservation, CARTA - Stable, CARTA - Test, Pawsey Remote VR).