Pawsey Supercomputing Research Centre
All Systems Operational
Setonix Operational
Login nodes ? Operational
Data-mover nodes ? Operational
Slurm scheduler ? Operational
Setonix work partition Operational
Setonix debug partition Operational
Setonix long partition Operational
Setonix copy partition Operational
Setonix askaprt partition Operational
Setonix highmem partition Operational
Setonix gpu partition Operational
Setonix gpu high mem partition Operational
Setonix gpu debug partition Operational
Lustre filesystems Operational
/scratch filesystem (new) ? Operational
/software filesystem ? Operational
/askapbuffer filesystem ? Operational
/askapingest filesystem ? Operational
Storage Systems Operational
Acacia - Projects ? Operational
Banksia ? Operational
Data Portal Systems ? Operational
MWA Nodes Operational
CASDA Nodes Operational
Acacia - Ingest ? Operational
MWA ASVO ? Operational
ASKAP Operational
ASKAP ingest nodes ? Operational
ASKAP service nodes Operational
Garrawarla Operational
Garrawarla workq partition ? Operational
Garrawarla gpuq partition ? Operational
Garrawarla asvoq partition ? Operational
Garrawarla copyq partition ? Operational
Garrawarla login node Operational
Slurm Controller (Garrawarla) Operational
Nimbus Operational
Ceph storage ? Operational
Nimbus instances ? Operational
Nimbus dashboard ? Operational
Nimbus APIs ? Operational
Central Services Operational
Authentication and Authorization ? Operational
Service Desk Operational
License Server Operational
Application Portal ? Operational
Origin ? Operational
/home filesystem Operational
/pawsey filesystem Operational
Central Slurm Database ? Operational
Documentation ? Operational
Visualisation Services Operational
Remote Vis ? Operational
Vis scheduler ? Operational
Setonix vis nodes ? Operational
Nebula vis nodes ? Operational
Visualisation Lab Operational
Reservation ? Operational
CARTA - Stable ? Operational
CARTA - Test ? Operational
Pawsey Remote VR Operational
The Australian Biocommons Operational
Fgenesh++ ? Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Scheduled Maintenance
Pawsey Scheduled Maintenance (December) Dec 3, 2024 05:00-20:00 AWST
Maintenance will be carried out on Pawsey systems on Tuesday the 3rd December to apply required patches and updates to improve the systems stability, security, and performance. This maintenance window will also be used to undertake other tasks which require down-time to achieve.

The day before maintenance the firewall will be updated to patch a security issue. Due to its high availability design, this update should be transparent to researchers.

Planned work for this window includes:
• HPE will be shutting down the Slingshot fabric to integrate four additional switches into the fabric.
• HPE will updating the Cabinet Controllers, Node Controllers and BIOS on CPU and GPU nodes to the latest supported version.
• Banksia will have a ScoutAM upgrade.
• Spectralogic drive firmware upgrade.
• Spectralogic Library Control Module (LCM) replacement.
• Patching of visualisation services will be undertaken.
• Patching of core Pawsey services will be undertaken.

When Setonix is returned to service:
• the gpu-dev partition will contain 10 nodes with a limit of 1 running job and 4 queued jobs per user (and 2 nodes per job).
• the slurm module will no longer be loaded by default, as all the environment variables it set are already in the pawsey module.
• the slurm controller configuration will be tweaked to weight the size of the job (Pawsey will monitor this change and determine the final weightings in the new year).

We expect to be able to bring all services back by the end of the day. If you have any questions, please contact help@pawsey.org.au.

Posted on Nov 25, 2024 - 15:51 AWST
Allocated Cores (Setonix)
Fetching
Allocated Nodes (Setonix work partition)
Fetching
Allocated nodes (Setonix askaprt partition) ?
Fetching
Allocated nodes (Garrawarla workq partition)
Fetching
Active Instances (Nimbus)
Fetching
Active Cores (Nimbus)
Fetching
Past Incidents
Dec 2, 2024

No incidents reported today.

Dec 1, 2024

No incidents reported.

Nov 30, 2024

No incidents reported.

Nov 29, 2024

No incidents reported.

Nov 28, 2024

No incidents reported.

Nov 27, 2024

No incidents reported.

Nov 26, 2024

No incidents reported.

Nov 25, 2024

No incidents reported.

Nov 24, 2024

No incidents reported.

Nov 23, 2024

No incidents reported.

Nov 22, 2024

No incidents reported.

Nov 21, 2024

No incidents reported.

Nov 20, 2024

No incidents reported.

Nov 19, 2024

No incidents reported.

Nov 18, 2024

No incidents reported.