Added qmc_quad based method for estimation of the constrained normalization factor #839

JasperMartins · 2024-10-25T14:46:42Z

This PR implements a new method to estimate the normalization factor for constrained priors.
The changes are two-fold:

The integration is performed with scipy.integrate.qmd_quad, a quasi-Monte Carlo-based integration routine that is expected to yield better results than regular Monte Carlo integration. However, the routine requires a rescaling step from the unit cube rather than direct sampling.
The termination of the integration is based on its relative statistical error rather than the number of accepted samples.

I have tested the two implementations with a relatively easy scenario: A 2D uniform prior on the [-1,1] cube, constrained to an inscribed circle with different radii:

The new method is significantly faster for high normalization factors, and the relative errors show a similar spread.

The implementation is marked as a draft because of the requirement of a rescale-method of the priors. Thus, it could be nice to keep the old method as a fallback. Also, the relative-error termination criterion could be applied just as well to the old implementation.

Related issue: #835

ColmTalbot · 2024-10-25T16:32:11Z

The implementation is marked as a draft because of the requirement of a rescale-method of the priors. Thus, it could be nice to keep the old method as a fallback. Also, the relative-error termination criterion could be applied just as well to the old implementation.

We currently have a (soft) requirement that all priors should implement a rescale method (currently it will just return None, which is not ideal, https://github.com/bilby-dev/bilby/blob/main/bilby/core/prior/base.py#L137-L153), so this approach should be safe.

Even if we make the base class raise an error when you attempt to rescale it will still be possible to use some samplers in that case.

It's possible that people will implement their own prior subclasses that don't support the rescale, so I'm not opposed to keeping a fallback. It may make everything easier if we change bilby.core.prior.base.Prior.rescale to raise a NotImplementedError, but that should probably get some more eyes and be done in a separate PR.

ColmTalbot · 2024-10-25T17:09:38Z

I think that actually the existing method won't work if the new prior doesn't implement rescale as sample. I think it's sufficiently unlikely that people are manually defining sample without rescale.

JasperMartins · 2024-10-30T14:51:00Z

I have updated the PR quite a bit. The core logic of the integration of the normalization factor is now handled by one of two functions: either MC-Integration based on samples from PriorDict.sample, or quasi MC-Integration based on the rescale method. The user can choose which is used, but the code also checks if the rescale method is implemented if qmc_quad is used and will default to from_samples if not. For both methods, termination of the integration is handled vi a bound of the estimated relative error. For both methods, a max_trials kwarg can be used to limit the number of probability evaluations.

I have also optimized the from_samples implementation. Before, every time min_accept was not reached, new samples were added to a list, and the constrained was applied to the full list - yielding a steep increase in runtime with the number of iterations while it would have been sufficient to check the new samples.

For the example I gave above, the qmc-based implementation is now actually slower than the from_samples method due to a higher overhead. Priors that implement sample by just calling rescale on unit-samples should perform much closer.
I still selected qmc_quad as the default because, as the attached plot shows, for normalization factors close to 1 (which is more likely in most applications), the relative error is smaller.

I have also improved the robustness against bugs by checking if the chosen keys are sufficient to compute the constrained, and if the PriorDict is constrained in the first place.

ColmTalbot

This is looking good! I just have a few specific comments/questions.

bilby/core/prior/dict.py

ColmTalbot · 2024-10-30T15:07:59Z

bilby/core/prior/dict.py

+                if np.any(np.isnan(samples[key])):
+                    print("The rescale method appears to be not working. Switching to 'sample_based'.")
+                    integrator = self._integrate_normalization_factor_from_samples
+        except Exception:


I'd rather not use a plain Exception. Do you have an example of a case that failed and what error is raised? I would imagine it would be something like NotImplementedError or AttributeError?

It's probably better completely removed. I thought about user-side errors in the implementation of custom priors, but such cases should probably fail loudly rather than silently.

Maybe catching NotImplementedError could be added to prepare for later changes that add NotImplementedErrors to the base classes, but that would also render the check if some priors yield None unnecessary.

This sounds good to me.

I updated the PR so that both cases are accounted for

bilby/core/prior/dict.py

…zation factor

…nst bugs

…putaional overhead and repeateability

…f rescale method is save

ColmTalbot

This is in great shape, just two last documentation based comments.

bilby/core/prior/dict.py

ColmTalbot · 2024-11-12T15:35:44Z

bilby/core/prior/dict.py

+            The normalization factor, rounded to the number of significant digits based on the standard deviation of
+            the integration estimate.
+
+        """


Thanks for adding this docstring, the argument description is nice, since we're adding can I suggest also adding a notes section with a seealso directive to point to Halton and a versionchanged directive set to 2.4.0.

I hope I did this in the right way :)

I think you could do, e.g., :func:scipy.integrate.qmc_quad, but I'm not sure how to link to the external package. I would probably go for just :code:scipy.integrate.qmc_quad. The "Scipy's scipy...." is kind of redundant and can just be ":code:scipy...."

This is how it currently renders (you can download the produced as a CI artefact if you can't build the docs locally.)

ColmTalbot linked an issue Oct 25, 2024 that may be closed by this pull request

Improve perfomance of the normalization factor estimation for constraint priors #835

Open

ColmTalbot added enhancement New feature or request >100 lines priors labels Oct 30, 2024

ColmTalbot requested changes Oct 30, 2024

View reviewed changes

JasperMartins added 5 commits November 12, 2024 15:51

Added qmc_quad based method for estimation of the constrained normali…

68eabcd

…zation factor

added unit test

474d5c9

Refactor normalization factor estimation and make it more robust agai…

8db3dc1

…nst bugs

Applied changes discussed on Github and updated qmc_quad for less com…

a8cf17b

…putaional overhead and repeateability

Added check if constraint estimation is necessary and updated check i…

98bd7c6

…f rescale method is save

JasperMartins force-pushed the improved_normalization_factor_estimation branch from 8e73e20 to 98bd7c6 Compare November 12, 2024 15:15

ColmTalbot reviewed Nov 12, 2024

View reviewed changes

ColmTalbot added this to the 2.4.0 milestone Nov 12, 2024

JasperMartins added 2 commits November 12, 2024 17:23

switched to logger.info()

11b32f7

updated documentation to include versionchanged and seealso

4b3cdcb

JasperMartins changed the title ~~Draft: Added qmc_quad based method for estimation of the constrained normalization factor~~ Added qmc_quad based method for estimation of the constrained normalization factor Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added qmc_quad based method for estimation of the constrained normalization factor #839

Added qmc_quad based method for estimation of the constrained normalization factor #839

JasperMartins commented Oct 25, 2024 •

edited

Loading

ColmTalbot commented Oct 25, 2024

ColmTalbot commented Oct 25, 2024

JasperMartins commented Oct 30, 2024 •

edited

Loading

ColmTalbot left a comment

ColmTalbot Oct 30, 2024

JasperMartins Nov 1, 2024

ColmTalbot Nov 5, 2024

JasperMartins Nov 12, 2024

ColmTalbot left a comment

ColmTalbot Nov 12, 2024

JasperMartins Nov 12, 2024

ColmTalbot Nov 12, 2024

Added qmc_quad based method for estimation of the constrained normalization factor #839

Are you sure you want to change the base?

Added qmc_quad based method for estimation of the constrained normalization factor #839

Conversation

JasperMartins commented Oct 25, 2024 • edited Loading

ColmTalbot commented Oct 25, 2024

ColmTalbot commented Oct 25, 2024

JasperMartins commented Oct 30, 2024 • edited Loading

ColmTalbot left a comment

Choose a reason for hiding this comment

ColmTalbot Oct 30, 2024

Choose a reason for hiding this comment

JasperMartins Nov 1, 2024

Choose a reason for hiding this comment

ColmTalbot Nov 5, 2024

Choose a reason for hiding this comment

JasperMartins Nov 12, 2024

Choose a reason for hiding this comment

ColmTalbot left a comment

Choose a reason for hiding this comment

ColmTalbot Nov 12, 2024

Choose a reason for hiding this comment

JasperMartins Nov 12, 2024

Choose a reason for hiding this comment

ColmTalbot Nov 12, 2024

Choose a reason for hiding this comment

JasperMartins commented Oct 25, 2024 •

edited

Loading

JasperMartins commented Oct 30, 2024 •

edited

Loading