Different order of parameters in MCMC chain vs. MAP result #1322

ElOceanografo · 2020-06-10T19:13:27Z

First off, the new MAP functionality is great--something I've been waiting for a while, thanks!

I was playing around with the example in the docs and noticed that the MAP result places the variables in a different order than what is returned in the MCMC chain:

using Turing, Optim
@model function gdemo(x)
    s ~ InverseGamma(2, 3)
    m ~ Normal(0, sqrt(s))

    for i in eachindex(x)
        x[i] ~ Normal(m, sqrt(s))
    end
end

model = gdemo(randn(10))
map_estimate = optimize(model, MAP())
names(coef(map_estimate))[1] # s, m
chain1 = sample(model, NUTS(), 1000)
names(DataFrame(chain)) # m, s

It looks like they are alphabetized in the chain, but printed in order of definition in the model in the MAP result. This caused me some confusion till I noticed it; I think it would probably be better to have them in the same order...

harryscholes · 2020-06-29T13:20:56Z

This doesn't solve your problem, but the behaviour of DataFrame(chain) has changed in MCMCChains v4.0.0. TuringLang/MCMCChains.jl#193 implemented the Tables interface. At the same time, the dataframes-compat.jl code was removed, which allowed the user to select which sections of parameters to use as columns in a DataFrame.

On Turing master branch the behaviour is now:

using Turing, Optim, StatsBase, DataFrames

@model function gdemo(x)
    s ~ InverseGamma(2, 3)
    m ~ Normal(0, sqrt(s))

    for i in eachindex(x)
        x[i] ~ Normal(m, sqrt(s))
    end
end

model = gdemo(randn(10))
map_estimate = optimize(model, MAP())
names(coef(map_estimate))[1] # `s, m`
chain = sample(model, NUTS(), 100)
names(DataFrame(chain)) # NB not `m, s` anymore

julia> names(DataFrame(chain)) # NB not `m, s` anymore
16-element Array{String,1}:
 "iteration"
 "chain"
 "acceptance_rate"
 "hamiltonian_energy"
 "hamiltonian_energy_error"
 ⋮
 "nom_step_size"
 "numerical_error"
 "s"
 "step_size"
 "tree_depth"

devmotion · 2020-06-29T13:36:19Z

Part of the redesign of MCMCChains tries to remove the amount of "automagic" that was present in MCMCChains, which could lead to surprising behaviour and type instabilities. Instead users have to subset the chains now in some cases explicitly. Usually that's possible without much additional code and IMO highlights more clearly what you want to achieve. Moreover, it allows (and slightly enforces) to write more efficient code since now subsets are not computed anymore everytime you calculate something but instead you can/should subset the chains once and then perform your calculations without any additional modification of the underlying arrays.

In this case you might want to only extract the parameters by writing

chain = sample(model, NUTS(), 100)
chains_only_params = Chains(chain, :parameters)
names(DataFrame(chains_only_params))

devmotion · 2020-06-29T13:56:56Z

I think the main issue here (different order of parameters in MCMC chain vs MAP result) could be fixed by using natural sort order for MAP results as well (which could be disabled optionally by setting sorted = false, analogously to the behaviour of sample).

Red-Portal · 2023-07-24T09:24:52Z

I'll close this since this seems to have been resolved.

Red-Portal closed this as completed Jul 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different order of parameters in MCMC chain vs. MAP result #1322

Different order of parameters in MCMC chain vs. MAP result #1322

ElOceanografo commented Jun 10, 2020

harryscholes commented Jun 29, 2020 •

edited

Loading

devmotion commented Jun 29, 2020

devmotion commented Jun 29, 2020

Red-Portal commented Jul 24, 2023

Different order of parameters in MCMC chain vs. MAP result #1322

Different order of parameters in MCMC chain vs. MAP result #1322

Comments

ElOceanografo commented Jun 10, 2020

harryscholes commented Jun 29, 2020 • edited Loading

devmotion commented Jun 29, 2020

devmotion commented Jun 29, 2020

Red-Portal commented Jul 24, 2023

harryscholes commented Jun 29, 2020 •

edited

Loading