Auto3dSeg Swin-Unet Shared Memory issues #1105
Unanswered
peterhessey
asked this question in
Q&A
Replies: 2 comments 5 replies
-
if this is within a docker container, perhaps you can set |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
When trying to run the
AutoRunner
in AzureML I am running into lack of shared memory errors during the Swin-Unet training stage. The node has 448GB of shared memory which I thought would be more than sufficient, especially as I am only using a single image per batch. Interestingly I don't run into this error when running on my local cluster with the same GPUs + RAM available.Any ideas on the causes and potential fixes for this issue?
Stack trace for reference:
Beta Was this translation helpful? Give feedback.
All reactions