You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I use 8 40G or 64G graphics cards to train, batchsize is set to 1, and then oom will still appear during the training process.
I've seen that most time memory usage during training probably stays around 30G, but at some point it exceeds the memory capacity.
The text was updated successfully, but these errors were encountered:
I use 8 40G or 64G graphics cards to train, batchsize is set to 1, and then oom will still appear during the training process.
I've seen that most time memory usage during training probably stays around 30G, but at some point it exceeds the memory capacity.
The text was updated successfully, but these errors were encountered: