Questions tagged [amazon-sagemaker]

Amazon SageMaker is a fully-managed AWS service that enables developers and data scientists to quickly and easily build, train, and deploy machine learning models at any scale.

Amazon SageMaker is Amazon's official system for developing machine learning systems in the cloud. It is based on Jupyter notebooks and has the ability to talk with other AWS services in order to explore data, automate processes and conduct various kinds of analysis.

It provides ready-to-use frameworks and algorithms to deal with different use cases and gives the ability to develop fully custom models.

One of the most important capabilities of SageMaker is the ability to deploy models in fully scalable and secure environments.

The only prerequisites for using Amazon SageMaker are an AWS account and an IAM (Identity and Access Management) admin user.


Tagging Recommendation:

Use the tag for all SageMaker-related questions. If it is a question about must also be redirected to this tag as a subset.

For all other SageMaker sub-services, if there is not yet a dedicated tag, it is good to redirect them to SageMaker only if they are closely related.


FAQ:

There are some recurring and important questions that one is faced with when starting to use SageMaker that should not be duplicated:


References


Free SageMaker Programming Books


SageMaker Online Courses


SageMaker Video Tutorials


Official Logo

SageMaker logo

2832 questions
63
votes
9 answers

Load S3 Data into AWS SageMaker Notebook

I've just started to experiment with AWS SageMaker and would like to load data from an S3 bucket into a pandas dataframe in my SageMaker python jupyter notebook for analysis. I could use boto to grab the data from S3, but I'm wondering whether there…
34
votes
1 answer

How to use SageMaker Estimator for model training and saving

The documentations of how to use SageMaker estimators are scattered around, sometimes obsolete, incorrect. Is there a one stop location which gives the comprehensive views of how to use SageMaker SDK Estimator to train and save models?
mon
  • 18,789
  • 22
  • 112
  • 205
26
votes
5 answers

Which is lower cost, Sagemaker or EC2?

For example, ml.p2.8xlarge for training job at ap-northeast on Sagemaker takes 16.408 USD / hour, but p2.8xlarge for on-demand at ap-northeast on Ec2 takes 12.336 USD/hour. Is it cheap to just train the DL models on Ec2 rather than Sagemaker if we…
Yu Sato
  • 391
  • 1
  • 3
  • 7
26
votes
3 answers

The current AWS identity is not a role for sagemaker?

I am getting error when i call get_execution_role() from sagemaker in python. I have attached the error for the same. I have added the SagemakerFullAccess Policy to role and user both.
Karan Nadagoudar
  • 434
  • 1
  • 5
  • 10
23
votes
4 answers

Download an entire folder from AWS sagemaker to laptop

I have a folder with predicted masks on AWS Sagemaker. ( It has 4 folders inside it and lot of files inside those folders. ) I want to download the entire folder to my laptop. This might sound so simple and easy, but I could not find a way to do…
Newbie
  • 530
  • 1
  • 10
  • 21
20
votes
6 answers

Automatically "stop" Sagemaker notebook instance after inactivity?

I have a Sagemaker Jupyter notebook instance that I keep leaving online overnight by mistake, unnecessarily costing money... Is there any way to automatically stop the Sagemaker notebook instance when there is no activity for say, 1 hour? Or would…
17
votes
3 answers

How to fix aws region error "ValueError: Must setup local AWS configuration with a region supported by SageMaker"

I am running sagemaker for the first time from my laptop. When I try to start the session I get this error ValueError: Must setup local AWS configuration with a region supported by SageMaker Local config is set to eu-west-1 which is supported by…
user2957674
  • 171
  • 1
  • 1
  • 5
15
votes
2 answers

How to pass dependency files to sagemaker SKLearnProcessor and use it in Pipeline?

I need to import function from different python scripts, which will used inside preprocessing.py file. I was not able to find a way to pass the dependent files to SKLearnProcessor Object, due to which I am getting ModuleNotFoundError. Code: from…
shaik moeed
  • 5,300
  • 1
  • 18
  • 54
15
votes
1 answer

how to connect private github repo in sagemaker

Does anyone know how to set create the jupyternotebook to connect to the private repo as now it says 'unable to access the repo'?
user5768866
14
votes
1 answer

How to delete a Sagemaker Ground Truth Labeling Job?

How can I delete an Amazon Sagemaker Ground Truth Labeling Job? Can't find that option on the console.
Franco Piccolo
  • 6,845
  • 8
  • 34
  • 52
14
votes
2 answers

Memory error in amazon sagemaker

Memory error occurs in amazon sagemaker when preprocessing 2 gb of data which is stored in s3. No problem in loading the data. Dimension of data is 7 million rows and 64 columns. One hot encoding is also not possible. Doing so results in memory…
VaRun Sabu
  • 143
  • 1
  • 1
  • 5
14
votes
2 answers

Why did Cloudwatch stop logging Sagemaker?

I have a Sagemaker instance running for a while now. I didn't change anything in between, but now I can't see new logs on Cloudwatch anymore. The old logs are still there, but no new ones since 2 days. The Sagemaker instance is still running. It's…
Martin Thoma
  • 124,992
  • 159
  • 614
  • 958
14
votes
5 answers

How to schedule tasks on SageMaker

I have a notebook on SageMaker I would like to run every night. What's the best way to schedule this task. Is there a way to run a bash script and schedule Cron job from SageMaker?
13
votes
4 answers

SageMaker and TensorFlow 2.0

What is the best way to run TensorFlow 2.0 with AWS Sagemeker? As of today (Aug 7th, 2019) AWS does not provide TensorFlow 2.0 SageMaker containers, so my understanding is that I need to build my own. What is the best Base image to use? Example…
Anton
  • 3,587
  • 2
  • 12
  • 27
13
votes
4 answers

How do I print debugging info from Sagemaker training?

I have a jupyter notebook script that just launches a training script, presumably in a docker container. I added some print statements in that training script but it's not showing up in the notebook or CloudWatch. I'm using regular print()…
kane
  • 5,465
  • 6
  • 44
  • 72
1
2 3
99 100