Relative Content

Tag Archive for amazon-sagemaker

Serverless Sagemaker inference endpoint failed

I am trying to deploy a SageMaker model using a serverless endpoint. The model is stored in ECR, and I created the model and endpoint configuration successfully. However, when I try to deploy the serverless endpoint, I encounter the following error in CloudWatch:

Sagemaker custom model metrics from eval pipeline step

I’ve created a sagemaker pipeline with a register model step show below. The model_metrics is produced by an eval step. It’s based of the sagemaker tutorials but all the examples I can find use an SKLearn estimator, my model is pytorch based (it’s a sentence-transformers model evaluated using their information retrieval evaluator).