2

I can no longer view projects, pipelines or create new note books. The complaint appears to be credential related.

Failed to launch app [None]. Error when retrieving credentials from container-role: Error retrieving metadata: Received non 200 response (400) from ECS metadata: ResourceNotFoundException

Has anyone run into this before? Nothing should have changed in the environment.

enter image description here

Additionally, when I view my user in SageMaker console, it doesn't have an arn listed under the Execution Role section: enter image description here

Editing the user and re-saving it doesn't populate the role either. The role exists in IAM and it appears in the wizard when I edit the user details: enter image description here

Ray Hogan
  • 319
  • 3
  • 12
  • Do you have access to the sagemaker service? You could try to restart the server. Also you can click in file -> exit -> suspend all application and server and enter again, that would restart the instance of the server. – Jose Jun 20 '23 at 16:01
  • I have access to the SageMaker console in AWS if that's what you mean? Where can I restart the service from within there? In SageMaker Studio when I go to File-Shutdown it throws a similar error "Error message: Missing credentials in config, if using AWS_CONFIG_FILE, set AWS_SDK_LOAD_CONFIG=1. Args: undefined" – Ray Hogan Jun 20 '23 at 16:04
  • Yes, I meant the console in AWS. I think you can do it in Domains -> User Profile, and if you click in the user you can check all the instances that are running and you can try to restart them or delete them (it won't delete your data, only the instance running). – Jose Jun 20 '23 at 19:16

2 Answers2

3

Creating a more intuitive UI based solution:

  1. go to sage maker: Go to sage maker

  2. go to the affected domain: enter image description here

  3. go to the affected user profile enter image description here

  4. delete this. enter image description here

then relaunch studio as you normally would. If you deleted the right server it should say that it's creating a new one. Your files will remain.

Warlax56
  • 1,170
  • 5
  • 30
2

We resolved this issue by deleting the JupyterServer from the user profile and then re-launching Studio which will automatically create a new one (user data isn't affected as it is stored in EFS). One caveat is that we were unable to use the UI to delete the JupyterServer. The delete option was there but nothing happened when clicked, instead we had to use the CLI to remove it.

This had to be done on every user profile.

Ray Hogan
  • 319
  • 3
  • 12