University of Cambridge Computer Laboratory - Notice history

100% - uptime

Caelum Console (server management) - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 99.92%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

Request Tracker - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

Other Internal Services - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 99.98%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

External Services - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

Network - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 99.95%
Jan 2025
Feb 2025
Mar 2025
100% - uptime

GN09 - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 99.21%
Jan 2025
Feb 2025
Mar 2025

WCDC - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025
100% - uptime

Main VM Pool (WCDC) - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

GPUs - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

Secondary VM Hosts - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

Xen Orchestra - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025
100% - uptime

Filer - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

Archive Server - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

Data Replication - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025

Other Secondary Storage Systems - Operational

100% - uptime
Jan 2025 · 100.0%Feb · 100.0%Mar · 100.0%
Jan 2025
Feb 2025
Mar 2025
100% - uptime

Third Party: Fastmail → General Availability - Operational

Third Party: Fastmail → Mail delivery - Operational

Third Party: Fastmail → Web client and mobile app - Operational

Third Party: Fastmail → Mail access (IMAP/POP) - Operational

Third Party: Fastmail → Login & sessions - Operational

Third Party: Fastmail → Contacts (CardDAV) - Operational

Notice history

Mar 2025

Chiller fault
  • Resolved
    Resolved

    This incident has been resolved. GN09 is fully operational. Most servers that were previously running have been restarted.

    If you have a physical server that is not running, you may be able to start it yourself via https://console.caelum.cl.cam.ac.uk as usual, or contact service-desk@cst.cam.ac.uk.

    VMs that were not set to start automatically have not been restarted. You can start VMs when you need them via https://xo.cl.cam.ac.uk as usual.

    Contact service-desk@cst.cam.ac.uk if there are any remaining issues.

  • Update
    Update

    Cooling has been restored and is expected to remain stable. The cause of the chiller shutting down was the chilled water circulation pumps stopping for some other reason, which will be investigated next week but which we expect to have been an isolated incident. The chiller still has one alarm present which is not preventing operation but is still being investigated.

    We are taking the opportunity of GN09 being shut down to perform some routine firmware and software updates on network hardware and storage systems, so we will not start turning servers back on quite yet, but expect to be able to do so shortly.

  • Update
    Update

    Progress has been made; the chiller is running again but there is a problem still under investigation. We are hopeful that servers can be turned back on again today, but will await the all-clear from the chiller technician.

  • Update
    Update

    Most servers in GN09 are now off, and must remain off until further notice. The emergency technician has arrived and is investigating.

  • Identified
    Identified

    The William Gates Building's chiller has a fault and has stopped running. Temperatures in our on-site data centre GN09 are rising rapidly. Engineers have been called out but it is likely that we will have to start shutting down servers in order to protect them.

Jan 2025

WCDC power distribution unit replacement
  • Completed
    January 30, 2025 at 5:25 PM
    Completed
    January 30, 2025 at 5:25 PM
    Maintenance has completed successfully.
  • In progress
    January 30, 2025 at 2:30 PM
    In progress
    January 30, 2025 at 2:30 PM
    Maintenance is now in progress
  • Planned
    January 30, 2025 at 2:30 PM
    Planned
    January 30, 2025 at 2:30 PM

    We will be replacing a power distribution unit (PDU) in our core infrastructure rack in the West Cambridge Data Centre, which powers the 1Gbps switches and a small number of other infrastructure systems. No user impact is expected, except for the following cases:

    • User servers tfc-app1, tfc-app2, tfc-app4 will lose networking for approximately half an hour

    • Verex access control management (card access updates etc.) will be unavailable for approximately half an hour

    • Minor delays in authenticating to Active Directory are possible, as one of the three domain controllers (adsrv07) will be turned off for approximately 45 minutes

    • BMC and serial console access to other systems in WCDC will be unavailable for approximately 30 minutes

    One of the two DHCP servers (sxp12) will also be turned off, but the other server should seamlessly handle all DHCP requests.

    This work is not related to the Estates electrical work happening in WCDC on the same day, but we have scheduled our work to take place during the same vulnerable period. Our PDU replacement will not reduce resilience any further.

Mailing lists rejecting email
  • Resolved
    Resolved

    We believe that Mimecast has unblocked us.

    There are some unrelated issues with some mailing lists still under investigation, not connected in any way (as far as we know) with the Mimecast problem; if you experience any more problems please contact service-desk@cst.cam.ac.uk.

  • Identified
    Identified

    We believe that UIS has successfully worked around this issue, and email sent to mailing lists from departmental addresses should now work.

    However, we now also believe that this was a symptom of a broader problem with email to one particular email anti-spam service provider, Mimecast. Email to other institutions which also use Mimecast may also be affected. We are working on getting this resolved.

    If you do encounter the issue, you may be able to get email through successfully by sending from an @cam.ac.uk address.

  • Update
    Update

    As a workaround, messages should get through if you send mail using Outlook from your @cam.ac.uk address to the relevant internal @lists.cam.ac.uk address for the mailing list; if you don't know what that address is for a particular list, contact service-desk@cst.cam.ac.uk.

  • Investigating
    Investigating

    We are aware that UIS's mailing list service is rejecting some email sent from Exchange Online to University mailing lists via cl.cam.ac.uk/cst.cam.ac.uk aliases. We have asked UIS to investigate.

Jan 2025 to Mar 2025

Next