Pawsey Supercomputing Centre
Investigating - Automated alerting has just indicated there may be a high speed network issue in Galaxy. It looks like one of the blades (containing nid00500- to nid00503]) went offline around 6pm Perth time.

Staff will investigate in the morning, but jobs running at that time may be impacted
Oct 21, 18:10 AWST
Magnus Operational
Magnus Compute nodes ? Operational
Magnus login nodes ? Operational
Slurm Controller (Magnus) ? Operational
Galaxy Operational
Galaxy Compute nodes ? Operational
Galaxy login nodes ? Operational
Slurm Controller (Galaxy) Operational
Topaz Operational
Slurm Controller (topaz) ? Operational
GPU partition ? Operational
Garrawarla Operational
Garrawarla compute nodes ? Operational
Slurm Controller (Garrawarla) Operational
Zeus Operational
Zeus login node Operational
Zeus Compute nodes ? Operational
Galaxy ingest nodes ? Operational
Data Mover nodes (CopyQ) ? Operational
Slurm Controller (Zeus) Operational
Central Slurm Database ? Operational
Lustre filesystems Operational
90 days ago
99.96 % uptime
Today
/scratch filesystem ? Operational
90 days ago
100.0 % uptime
Today
/group filesystem ? Operational
90 days ago
100.0 % uptime
Today
/astro filesystem ? Operational
90 days ago
99.86 % uptime
Today
/askapbuffer filesystem ? Operational
90 days ago
99.99 % uptime
Today
Nimbus Operational
Ceph storage ? Operational
Nimbus instances ? Operational
Nimbus dashboard ? Operational
Storage Systems Operational
Data Portal Systems Operational
Hierarchical Storage Management Systems Operational
MWA Nodes Operational
CASDA Nodes Operational
Central Services Operational
Authentication and Authorization ? Operational
Service Desk Operational
License Server Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Active Instances (Nimbus)
Fetching
Active Cores (Nimbus)
Fetching
Allocated Nodes (Magnus) ?
Fetching
Allocated Nodes (Galaxy) ?
Fetching
Past Incidents
Nov 25, 2020

No incidents reported today.

Nov 24, 2020

No incidents reported.

Nov 23, 2020

No incidents reported.

Nov 22, 2020

No incidents reported.

Nov 21, 2020

No incidents reported.

Nov 20, 2020

No incidents reported.

Nov 19, 2020

No incidents reported.

Nov 18, 2020

No incidents reported.

Nov 17, 2020

No incidents reported.

Nov 16, 2020

No incidents reported.

Nov 15, 2020

No incidents reported.

Nov 14, 2020

No incidents reported.

Nov 13, 2020

No incidents reported.

Nov 12, 2020

No incidents reported.

Nov 11, 2020

No incidents reported.