Pawsey Supercomputing Research Centre

Minor Service Outage

Setonix Operational
Login nodes ? Operational
Data-mover nodes ? Operational
Slurm scheduler ? Operational
Setonix work partition Operational
Setonix debug partition Operational
Setonix long partition Operational
Setonix copy partition Operational
Setonix askaprt partition Operational
Setonix highmem partition Operational
Setonix gpu partition Operational
Setonix gpu high mem partition Operational
Setonix gpu debug partition Operational
Lustre filesystems Operational
/scratch filesystem ? Operational
/software filesystem ? Operational
/askapbuffer filesystem ? Operational
/askapingest filesystem ? Operational
Storage Systems Partial Outage
Acacia - Projects ? Partial Outage
Banksia ? Operational
Data Portal Systems ? Operational
MWA Nodes Operational
CASDA Nodes Operational
Acacia - Ingest ? Operational
MWA ASVO ? Operational
ASKAP Operational
ASKAP ingest nodes ? Operational
ASKAP service nodes Operational
Central Services Operational
Authentication and Authorization ? Operational
Service Desk Operational
License Server Operational
Application Portal ? Operational
Origin ? Operational
/home filesystem Operational
/pawsey filesystem Operational
Central Slurm Database ? Operational
Documentation ? Operational
Visualisation Services Operational
Remote Vis ? Operational
Vis scheduler ? Operational
Setonix vis nodes ? Operational
Nebula vis nodes ? Operational
Visualisation Lab Operational
Reservation ? Operational
CARTA - Stable ? Operational
CARTA - Test ? Operational
Pawsey Remote VR Operational
The Australian Biocommons Operational
Fgenesh++ ? Operational
Nimbus - Legacy Operational
Ceph storage ? Operational
Nimbus instances ? Operational
Nimbus dashboard ? Operational
Nimbus APIs ? Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Allocated Cores (Setonix)
Fetching
Allocated Nodes (Setonix work partition)
Fetching
Allocated nodes (Setonix askaprt partition) ?
Fetching
Jun 8, 2025

No incidents reported today.

Jun 7, 2025

No incidents reported.

Jun 6, 2025

No incidents reported.

Jun 5, 2025

No incidents reported.

Jun 4, 2025

No incidents reported.

Jun 3, 2025

No incidents reported.

Jun 2, 2025

No incidents reported.

Jun 1, 2025

No incidents reported.

May 31, 2025

No incidents reported.

May 30, 2025

No incidents reported.

May 29, 2025
Resolved - This incident has been resolved.
May 29, 12:48 AWST
Monitoring - Firewall rules have been reinstated, and all daemons are responding correctly. The service is restored to operation.

The wrong cluster was initially identified - this is for the Acacia Ingest cluster, not Acacia Projects. We apologise for the confusion.

May 29, 12:17 AWST
Update - We are continuing to work on a fix for this issue.
May 29, 12:07 AWST
Identified - Inadvertently changed firewall rules stopped the machines from communicating with each other. We are fixing the rules.
May 29, 12:06 AWST
Investigating - Storage daemons on a number of machines for Acacia Ingest are not operating properly. Some read and write operations still succeed, others others fail if they are directed toward those machines.

We are investigating the cause.

May 29, 11:51 AWST
May 28, 2025

No incidents reported.

May 27, 2025

No incidents reported.

May 26, 2025

No incidents reported.

May 25, 2025

No incidents reported.