How to Scale Spring Boot Application in GKE Based on Current HTTP Requests
I have deployed a Spring Boot JAR application in Google Kubernetes Engine (GKE). I need to scale it based on the number of HTTP requests. Here are my constraints and requirements:
Scale up Springboot jar container with http request
I have deployed a spring boot jar application in GKE. I want it should scale on the basis of http request. I cannot use HPA based on CPU and Memory because there is very less CPU usage and My application cannot release the heap memory. Currently we are using a group of VM instance which run these jars and they scale up and down on the basis of http request. Also I want to scale on the basis of current request not pending requests. 1 Pod can take only 10 http request and pass the other request to other pod If there is not enough pods it should scale the pod. I don’t have any Idea of Managed promemtheus also I don’t want it should cost me a lot of money. I want minimal cost setup. So is there any tool or way available in GCP so that I can Implement this.