We are running an ECS Cluster with a service running 2 tasks.
At 10:05 the service stopped showing any metrics.
At 13:45 I forced the service to restart and at that point the service gave metrics again.
- All tasks stayed in a running state
- All tasks stayed healthy
- Logs keep going during the “outages”
- I couldn’t access the app but the logs show, some people were able to get in the app
- Tasks were stable since the day before.
Does anyone have an idea why this could have happened?