I have a large AWS CF template that I have worked on for about a year with hundreds of deploys.
Today I changed a the key on a DynamoDB table and the full stack went into Rollback_complete. The only solution was to delete the stack and manually delete a lot of resources as well, including an S3 bucket with loads of data. The account is only for dev environment so no real harm happened, but I really don’t want it to happen again.
Should the CF stack have other parameters for “on fail” or are there other solutions for this?
I still can’t understand why a small change like this could ruin the whole stack…