I have a semantic segmentation model which I deployed on ml.m4.xlarge I am using invoke_endpoint from inside an AWS Lambda function using the following bit of code.
with open('\tmp\image.jpg', 'rb') as imfile:
imbytes = imfile.read()
response = runtime.invoke_endpoint(EndpointName = 'xyx', ContentType = 'image/jpeg',
Body = imbytes)
This is when I get the error as mentioned above
Your invocation timed out while waiting for a response from container primary
Does it mean my datapoint is reaching the model endpoint but it's taking too long to do the inference or is my data not even transferring over to the endpoint?