Resolved: 2025-02-07: KVM@TACC issues affecting instance launch

As of 11am central time, we’re announcing a sev2 outage affecting KVM@TACC.

Users are sometimes receiving the message “No valid host was found. There are not enough hosts available.” when attempting to launch an instance.

We’re investigating, and suspect this is related to an error with the rabbitmq message bus which occurred around 1am,

1 Like

This was resolved around 1pm the same day.
https://chameleoncloud.org/user/outages/kvmtacc-issues-launching-instances/

The issue turned out to be that some compute nodes had stale information about the message queue, after said queue had been converted from a single instance to a clustered configuration.

1 Like