NaN value while training #8

louiseblade · 2024-07-12T08:25:48Z

I've read google team implementation too, and they use gamma normalization to prevent NaN while training .
when i use your code, first several epochs were fine but suddenly torch.autograd detect anomaly while training . I do believe that's because of the RGLRU . Is there anyway to avoid NaN or should we use some sort of normalization ?

proger · 2024-07-12T17:31:15Z

Hey, could you share where that nan comes from? The anomaly detector should be helping with that.

louiseblade · 2024-07-17T04:18:17Z

Thank you for you reply . I'm using pycharm and here is the log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NaN value while training #8

NaN value while training #8

louiseblade commented Jul 12, 2024

proger commented Jul 12, 2024

louiseblade commented Jul 17, 2024

NaN value while training #8

NaN value while training #8

Comments

louiseblade commented Jul 12, 2024

proger commented Jul 12, 2024

louiseblade commented Jul 17, 2024