Site shows 504 errors

Incident Report for IncentFit

Postmortem

~2:30PM: A new release is deployed. It had many different changes to various bugs throughout the system, including some updates to a new feature.

~2:55pm: Release succeeds.

3:00pm: Operations team begins to notice general slowness throughout site.

3:04pm: Issue is escalated, 502's begin appearing on webapp

~3:10pm: Initial investigation determines that a bad database update was made. Problematic entry is removed servers are restarted.

~3:15pm The 502 Errors continue.

3:38pm: It is determined that a new piece of code that looks up data for the aforementioned new feature is slowing down the site.

3:40pm: New Deploy is sent. Some site functionality is restored.

4:16pm: We determine that some of this bad code is still live running in production.

4:18pm: We stop all processes for this code on all servers.

4:24pm: Deploy is successful

Posted Feb 14, 2023 - 17:19 EST

Resolved

This incident has been resolved.
Posted Feb 14, 2023 - 16:49 EST

Monitoring

A fix has been implemented and we are monitoring the results.
Posted Feb 14, 2023 - 16:04 EST

Identified

We've identified the issue and are deploying a fix.-
Posted Feb 14, 2023 - 15:41 EST
This incident affected: Web Application, Mobile Application, and API.