University of Cambridge Computer Laboratory - GPU VM cluster maintenance – Maintenance details

Network experiencing partial outage

GPU VM cluster maintenance

Completed
Scheduled for 22 June, 2025 at 13:00 – 14:50

Affects

Virtual Machine Hosting

Under maintenance from 1:00 PM to 2:50 PM

GPUs

Under maintenance from 1:00 PM to 2:50 PM

Updates
  • Completed
    22 June, 2025 at 14:50
    Completed
    22 June, 2025 at 14:50
    Maintenance has completed successfully.
  • Update
    22 June, 2025 at 14:33
    Update
    22 June, 2025 at 14:33

    Some capacity to run user VMs is now back online. You may try to start your VM again via Xen Orchestra if you need it. If it fails to start, try again after half an hour when there should be more capacity available.

  • Update
    22 June, 2025 at 13:51
    Update
    22 June, 2025 at 13:51

    Storage server maintenance is complete. The shared server dev-gpu-2 is coming back up. VM hypervisor upgrades are now beginning so personal dev-* VMs will remain down.

  • In progress
    22 June, 2025 at 13:00
    In progress
    22 June, 2025 at 13:00
    Maintenance is now in progress
  • Planned
    22 June, 2025 at 13:00
    Planned
    22 June, 2025 at 13:00

    The GPU VM cluster which hosts dev-gpu-* and dev-cpu-* virtual machines, and the associated storage server, requires some urgent software updates and hardware maintenance in order to rectify a couple of known problems. We propose to do this on Sunday; however this could be rescheduled if this would be particularly disruptive (contact mas90 ASAP if so).

    All dev-gpu-* and dev-cpu-* VMs plus the shared servers dev-gpu-1, dev-gpu-2, dev-cpu-1, dev-gpu-acs and dev-cpu-acs must be shut down during this maintenance, as the storage server that holds VM disks and home directories will be unavailable for a short time. Capacity to host VMs will gradually be restored during the maintenance as each VM host is updated and brought back online.