Our team has analyzed the outage on 11/2/2022 and determined the root cause. We have also determined and implemented a solution to prevent this occurrence in the future.
IncentFit uses AWS’s auto-scaling architecture to dynamically scale our systems based on user demand on our site and application. On 11/2 we had an extremely brief, but high spike in traffic and the system that automatically scales the servers did not properly handle it. Our team responded within minutes to the problem; unfortunately it took about 2 hours to fully diagnose the problem and an additional hour to fix it. Our Product team has already implemented a solution to the automatic scaling logic so that if this type of activity were to happen in the future, it would not affect user experience.