NaN in gradients #6

SebastianM-C · 2024-04-06T23:26:15Z

When working locally on this I initially encountered an issue where the gradient would always be NaN, which is what I think it's causing #5. I enabled NaN safe mode and that seemed to fix the issue. Is that a bug or should we just document this?

Also, what's the best way of setting this up in CI?

ChrisRackauckas · 2024-04-07T01:59:02Z

What do you mean by NaN safe mode?

SebastianM-C · 2024-04-07T02:03:15Z

https://juliadiff.org/ForwardDiff.jl/stable/user/advanced/#Fixing-NaN/Inf-Issues

ChrisRackauckas · 2024-04-07T12:09:06Z

You shouldn't need that here.

ChrisRackauckas · 2024-04-07T13:04:26Z

Why do you get a NaN in the first place?

SebastianM-C · 2024-04-07T16:04:10Z

It looks like the problem comes from the loss computation

using ForwardDiff
x0′ = ForwardDiff.Dual{:tag}.(x0, 1)

test_p = SciMLStructures.replace(Tunable(), prob.p, x0′)
test_prob = remake(prob, p = test_p)

test_sol = solve(test_prob, Rodas4(autodiff=false), saveat=sol_ref.t)
sum(sqrt.(abs2.(get_vars(test_sol, 1) .- get_refs(sol_ref, 1))))

gives

Dual{:tag}(0.0,NaN)

I also see that NaNs appear if I print in the loss with

for i in eachindex(new_sol.u)
        loss += sum(sqrt.(abs2.(get_vars(new_sol, i) .- get_refs(sol_ref, i))))
        if any(isnan.(ForwardDiff.partials(loss)))
            @info i
        end
    end

ChrisRackauckas · 2024-04-07T16:06:26Z

What's the first spot of nan?

SebastianM-C · 2024-04-07T16:08:05Z

It's due to sqrt.

ChrisRackauckas · 2024-04-07T16:12:19Z

What are the values?

SebastianM-C · 2024-04-07T16:13:09Z

julia> sqrt.(abs2.(get_vars(test_sol, 1) .- get_refs(sol_ref, 1)))
2-element Vector{ForwardDiff.Dual{:tag, Float64, 1}}:
 Dual{:tag}(0.0,NaN)
 Dual{:tag}(0.0,NaN)

ChrisRackauckas · 2024-04-07T16:13:22Z

Yes but what are the values that go in?

SebastianM-C · 2024-04-07T16:15:44Z

julia> get_vars(test_sol, 1)
2-element Vector{ForwardDiff.Dual{:tag, Float64, 1}}:
 Dual{:tag}(3.1,0.0)
 Dual{:tag}(1.5,0.0)

julia> get_refs(sol_ref, 1)
2-element Vector{Float64}:
 3.1
 1.5

Hmm, let me check why they are the same 🤔

SebastianM-C · 2024-04-07T16:18:08Z

aah, it because we start with the same initial conditions

sum(sqrt.(abs2.(get_vars(test_sol, 2) .- get_refs(sol_ref, 2)))

gives Dual{:tag}(0.2685941909005718,0.08984350230039442)

ChrisRackauckas · 2024-04-07T16:19:36Z

Yeah the gradient at zero is NaN for sqrt. That seems like a loss function issue.

SebastianM-C · 2024-04-07T16:19:43Z

I started with the same initial conditions as in https://docs.sciml.ai/Overview/stable/showcase/missing_physics/, which means that at the very first time point we get 0 and NaN in the gradient, which ends up poisoning the whole loss.

SebastianM-C · 2024-04-07T16:22:11Z

So not a bug, but we should document this.

SebastianM-C added the bug Something isn't working label Apr 7, 2024

SebastianM-C removed the bug Something isn't working label Apr 7, 2024

SebastianM-C mentioned this issue Apr 7, 2024

Fix NaN in gradients #8

Merged

5 tasks

ChrisRackauckas closed this as completed in #8 Apr 7, 2024

SebastianM-C mentioned this issue Apr 7, 2024

CI fails #5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NaN in gradients #6

NaN in gradients #6

SebastianM-C commented Apr 6, 2024

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024 •

edited

Loading

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024 •

edited

Loading

SebastianM-C commented Apr 7, 2024 •

edited

Loading

SebastianM-C commented Apr 7, 2024

NaN in gradients #6

NaN in gradients #6

Comments

SebastianM-C commented Apr 6, 2024

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024 • edited Loading

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024

ChrisRackauckas commented Apr 7, 2024 • edited Loading

SebastianM-C commented Apr 7, 2024 • edited Loading

SebastianM-C commented Apr 7, 2024

SebastianM-C commented Apr 7, 2024 •

edited

Loading

ChrisRackauckas commented Apr 7, 2024 •

edited

Loading

SebastianM-C commented Apr 7, 2024 •

edited

Loading