Relative Content

Tag Archive for pythonazurecondadatabricksazure-databricks

Azure Databricks: Error when scaling from zero

I’ve deployed an endpoint for a RAG chain in Databricks. I’ve set scale_to_zero_enabled=True. The problem is: Sometimes, scaling up from zero works fine and sometimes it results in an error. It’s also interesting that in spite of the exception in the logs, the serving endpoint state never changes to Error, but remains Ready (Scaling from zero) instead.