Skip to content

Latest commit

 

History

History
42 lines (27 loc) · 1.29 KB

changelog.md

File metadata and controls

42 lines (27 loc) · 1.29 KB

Status & Outages

Outages

Current

Upcoming

When Duration What
Oct 21 2024 10 hrs (planned) Upgrade to all S3DF Weka clusters. We do NOT anticipate service interruptions.

Past

When Duration What
Oct 3 2024 1.5 hrs (unplanned) Storage issue impacted home directory access and SSH logins
Jul 10 2024 4 days (planned) Urgent electrical maintenance is required in SRCF datacenter
Jun 26 2023 5 days (planned) Everything down due to power outage
Jan 15 2023 2 days (unplanned) Fix: one weka server rebooted. Underlying issue under investigation. Symptom: sdfdata hanging on several nodes.

Monitoring

Grafana

Ganglia

Nagios

Roadmap :id=roadmap

Please see our Technology Migration Timeline (Select the TIMELINE tab)

Slurm Dashboard

sdf-slurm-summary