Pawsey Scheduled Maintenance (July)
Scheduled Maintenance Report for Pawsey Supercomputing Research Centre
Completed
Scheduled maintenance has been completed.

All Supercomputing, networking and Cloud work was successful. Banksia updates were cancelled.

An extended outage is planned in August.

Important Notice: Our usual first-Tuesday maintenance in August is cancelled. Instead, a prolonged downtime is scheduled for August 9th - 14th for significant upgrades.

During the shutdown, HPE will upgrade Setonix's management system, bringing several benefits to you and your research:
• Smoother updates: Moving forward system updates and patches will happen during regular maintenance, minimising disruptions to your research.
• Increased uptime: HPE reports improved system stability with the upgraded management system, meaning more time for your research.
• Enhanced Scratch filesystem: The upgrade will facilitate initiatives designed to improve the performance and stability of the Scratch filesystem.

What this means for you:
• Setonix and Garrawarla Downtime between August 9th - 14th

Why are we doing this?
This upgrade is planned to improve Setonix services, ensuring we continue to deliver the best possible experience for your research. We appreciate your patience and support as we implement these changes.
Posted Jul 02, 2024 - 16:49 AWST
In progress
Scheduled maintenance is currently in progress. We will provide updates as necessary.
Posted Jul 02, 2024 - 07:00 AWST
Scheduled
Maintenance will be carried out on Pawsey systems on Tuesday the 2nd July to apply required patches and updates to improve the systems stability, security, and performance. This maintenance window will also be used to undertake other tasks which require down-time to achieve. Due to the firewall testing all services will be at risk during the window, including Acacia which isn’t having any specific work done on it.

Planned work for this window includes:
• Setonix will have a replacement to some power equipment in one of the racks.
• The services team have multiple improvements to the installed software including AMBER 24 with GPU support, improvements to multinode Fluent support and improvements to Spack recipes for MWA software.
• Banksia will upgrade ScoutAM and the Tape Library Firmware
• Nimbus will rebuild the compute nodes to new version.
• Garrawarla has an update of firmware on node management controllers.
• ASKAP Ingest Cluster has an update to firmware on nodes and node management controllers.
• Networking will perform a Firewall Failover test
• There will be patching of core Pawsey services

We expect to be able to bring all services back by the end of the day.
Posted Jun 25, 2024 - 13:58 AWST
This scheduled maintenance affected: ASKAP (ASKAP ingest nodes, ASKAP service nodes), Central Services (Authentication and Authorization, Service Desk, License Server, Application Portal, Origin, /home filesystem, /pawsey filesystem, Central Slurm Database, Documentation), Nimbus (Ceph storage, Nimbus instances, Nimbus dashboard, Nimbus APIs), Garrawarla (Garrawarla workq partition, Garrawarla gpuq partition, Garrawarla asvoq partition, Garrawarla copyq partition, Garrawarla login node, Slurm Controller (Garrawarla)), The Australian Biocommons (Fgenesh++), Storage Systems (Acacia - Projects, Banksia, Data Portal Systems, MWA Nodes, CASDA Nodes, Acacia - Ingest, MWA ASVO), Lustre filesystems (/scratch filesystem (new), /software filesystem, /askapbuffer filesystem, /askapingest filesystem), Setonix (Login nodes, Data-mover nodes, Slurm scheduler, Setonix work partition, Setonix debug partition, Setonix long partition, Setonix copy partition, Setonix askaprt partition, Setonix highmem partition, Setonix gpu partition, Setonix gpu high mem partition, Setonix gpu debug partition), and Visualisation Services (Remote Vis, Vis scheduler, Setonix vis nodes, Nebula vis nodes, Visualisation Lab, Reservation, CARTA - Stable, CARTA - Test, Pawsey Remote VR).