How to avoid CUDA Our of Memory Error when deploying Rhea-72b-v0.5 or other big LLMS on Azure AI Studio
I am using Azure AI Studio and would like to deploy this model davidkim205-rhea-72b-v0.5
.
I am using Azure AI Studio and would like to deploy this model davidkim205-rhea-72b-v0.5
.