Added plotting utilities and refactored hyperparams #143

ThibeauWouters · 2024-02-03T12:29:11Z

This PR adds a few basic functionalities to simplify the treatment of the hyperparameters (now moved to a separate file) and by adding a few functionalities that allow to easily plot a few key quantities such as acceptance rates by a simple function call from any flowMC or jim script.

src/flowMC/sampler/Sampler.py

src/flowMC/utils/hyperparameters.py

src/flowMC/utils/__init__.py

src/flowMC/sampler/Sampler.py

kazewong · 2024-02-12T18:36:29Z

@ThibeauWouters The checks seem to have failed due to some missing hyperparameters, would you mind have a look of that?

ThibeauWouters · 2024-02-13T09:25:41Z

@kazewong I have pushed some updates for the failed checks, and have also added some additional postprocessing analysis code to check the Gelman-Rubin R statistic.

ThibeauWouters · 2024-02-13T10:16:50Z

My tests show something is still off, I need to double check things

ThibeauWouters · 2024-02-13T12:16:14Z

It should be fixed now

kazewong

I would not include r_hat related functionality in this PR. I am more inclined to have that as a notebook in example, since statistical metric like r_hat or correlation length needs to be used with context of user understanding the meaning of what they are measuring or monitoring.

kazewong · 2024-02-05T17:14:41Z

src/flowMC/utils/postprocessing.py

+import jax.numpy as jnp
+from flowMC.sampler.Sampler import Sampler
+
+def plot_summary(sampler: Sampler, which: str = "training", **plotkwargs) -> None:


Instead of taking the entire Sampler as input, these post-processing function should only interact with serialized output from the sampler to avoid run time complication.

This might need a bit more work in terms of tidying up what kind of output we are serialize from the sampler. I would suggest holding these modifications off or open a separate issue. I will get to it soonish

kazewong · 2024-02-13T22:22:35Z

src/flowMC/sampler/Sampler.py

@@ -113,11 +128,16 @@ def __init__(
        production["log_prob"] = jnp.empty((self.n_chains, 0))
        production["local_accs"] = jnp.empty((self.n_chains, 0))
        production["global_accs"] = jnp.empty((self.n_chains, 0))
+
+        if self.track_gelman_rubin:


Since R_hat can be compute in post-processing, I don't think we should have it in the main sampler.

kazewong · 2024-02-13T22:22:59Z

src/flowMC/sampler/Sampler.py

@@ -264,6 +284,15 @@ def sampling_loop(
                global_acceptance[:, 1::self.output_thinning],
                axis=1,
            )
+
+        if self.track_gelman_rubin:


Same as above. Please remove this from the main sampler

kazewong · 2024-02-13T22:25:08Z

src/flowMC/utils/postprocessing.py

+        plt.ylim(0-eps, 1+eps)
+    plt.savefig(f"{outdir}{name}_{which}.png", bbox_inches='tight')
+
+def gelman_rubin(chains: Float[Array, "n_chains n_steps n_dim"], discard_fraction: float = 0.1) -> Array:


Arviz actually support r_hat and many more different statistical metric. I would suggest to use that instead of writing our own function.

kazewong

I took away the gelman rubin part of the previous PR. Will merge it if the test does not fail

added plotting utilities and refactored hyperparams

571d61f

kazewong self-requested a review February 4, 2024 13:52

kazewong reviewed Feb 4, 2024

View reviewed changes

ThibeauWouters added 3 commits February 5, 2024 00:20

fixed hyperparams

95f3ce8

restructured plotting functionalities

7a8cf97

typo

e20fb57

kazewong reviewed Feb 5, 2024

View reviewed changes

src/flowMC/sampler/Sampler.py Outdated Show resolved Hide resolved

ThibeauWouters and others added 4 commits February 6, 2024 00:16

revert sampler state arg

e77c57b

revert summary dict initialization code

ca10ff1

removed matplotlib import

ffb1462

Merge branch 'main' into plotting

3918734

Thibeau Wouters added 4 commits February 13, 2024 09:27

fixed global sampler failure

3a92843

added computation for gelman-rubin R

dcf695c

typo

c92b2c9

added code to track Gelman-Rubin R

978301b

added gelman rubin key for plotting

02fcb1f

fixed test runs

ad1a32d

kazewong requested changes Feb 13, 2024

View reviewed changes

kazewong added 2 commits February 16, 2024 14:04

Update postprocessing.py

3741e22

Update Sampler.py

dc2319f

kazewong approved these changes Feb 16, 2024

View reviewed changes

kazewong merged commit 716d9ef into kazewong:main Feb 16, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added plotting utilities and refactored hyperparams #143

Added plotting utilities and refactored hyperparams #143

ThibeauWouters commented Feb 3, 2024

kazewong commented Feb 12, 2024

ThibeauWouters commented Feb 13, 2024

ThibeauWouters commented Feb 13, 2024

ThibeauWouters commented Feb 13, 2024

kazewong left a comment

kazewong Feb 5, 2024

kazewong Feb 13, 2024

kazewong Feb 13, 2024

kazewong Feb 13, 2024

kazewong left a comment

Added plotting utilities and refactored hyperparams #143

Added plotting utilities and refactored hyperparams #143

Conversation

ThibeauWouters commented Feb 3, 2024

kazewong commented Feb 12, 2024

ThibeauWouters commented Feb 13, 2024

ThibeauWouters commented Feb 13, 2024

ThibeauWouters commented Feb 13, 2024

kazewong left a comment

Choose a reason for hiding this comment

kazewong Feb 5, 2024

Choose a reason for hiding this comment

kazewong Feb 13, 2024

Choose a reason for hiding this comment

kazewong Feb 13, 2024

Choose a reason for hiding this comment

kazewong Feb 13, 2024

Choose a reason for hiding this comment

kazewong left a comment

Choose a reason for hiding this comment