Google Cloud MAJOR
Increased VM failure rates in a subset of Google Cloud zones
September 17, 2021 · 04:00 PM UTC – 07:25 PM UTC · Duration: 3h 25min
Affected Services
Google Compute Engine
Timeline
12:33 AM
Incident Start: 17 September 2021 08:00
Incident End: 17 September 2021 11:25
Duration: 3 hours, 25 minutes
Affected Services and Features:
Google Compute Engine
Regions/Zones: us-central1-f
Description:
Google Compute Engine reported elevated instance failures. From preliminary analysis an issue with a node software rollout was initially suspected, but subsequently ruled out. Due to the potential impact, we proactively notified customers on the Cloud Status Dashboard. However, further analysis concluded the error rates were negligible and not a cause for concern.
Customer Impact:
After analysis, it was determined this particular incident did not have any customer impact.
Additional details:
We are continuing to enhance our detection mechanisms to avoid false positives.
10:20 PM
The issue with Google Compute Engine is believed to be affecting a very small number of customers and our Engineering Team is working on it.
If you have questions or are impacted, please open a case with the Support Team and we will work with you until this issue is resolved.
No further updates will be provided here.
We thank you for your patience while we're working on resolving the issue.
08:00 PM
Summary: Increased VM failure rates in a subset of Google Cloud zones
Description: Mitigation work is still underway with our engineering team.
We do not have an ETA for mitigation at this point.
As a workaround, customers can use alternative zones.
We will provide more information by Friday, 2021-09-17 14:15 US/Pacific.
Diagnosis: Customers may be experiencing higher VM failure rates.
Workaround: Use alternative zones.
06:18 PM
Summary: Increased VM failure rates in a subset of Google Cloud zones
Description: Mitigation work is still underway with our engineering team.
We do not have an ETA for mitigation at this point.
We will provide more information by Friday, 2021-09-17 12:15 US/Pacific.
Diagnosis: Customers may be experiencing higher VM failure rates.
Workaround: None at this time.
04:52 PM
Summary: Increased VM failure rates in a subset of Google Cloud zones
Description: Mitigation work is still underway with our engineering team.
We do not have an ETA for mitigation at this point.
We will provide more information by Friday, 2021-09-17 10:15 US/Pacific.
Diagnosis: Customers may be experiencing higher VM failure rates.
Workaround: None at this time.
04:06 PM
Summary: Increased VM failure rates in a subset of Google Cloud zones
Description: Mitigation work is currently underway by our engineering team.
We do not have an ETA for mitigation at this point.
We will provide more information by Friday, 2021-09-17 09:00 US/Pacific.
Diagnosis: Customers may be experiencing higher VM failure rates.
Workaround: None at this time.
03:39 PM
Summary: Increased VM failure rates in a subset of Google Cloud zones
Description: We are experiencing an intermittent issue with Google Compute Engine in the following zones:
asia-east1-c
asia-east2-c
asia-northeast1-c
asia-northeast2-c
asia-northeast3-c
asia-south1-c
asia-south2-c
asia-southeast1-c
asia-southeast2-c
australia-southeast1-c
australia-southeast2-c
europe-central2-c
europe-north1-c
europe-west1-c
europe-west2-c
europe-west3-c
europe-west4-c
europe-west6-c
northamerica-northeast1-c
northamerica-northeast2-c
southamerica-east1-c
us-central1-c
us-central1-f
us-east1-c
us-east4-c
us-west1-c
us-west2-a
us-west2-b
us-west2-c
us-west3-c
us-west4-c
Our engineering team continues to investigate the issue.
We will provide an update by Friday, 2021-09-17 08:10 US/Pacific with current details.
Diagnosis: Customers may be experiencing higher VM failure rates.
Workaround: None at this time.