Try to update calc on init #92

jobrachem · 2023-09-07T12:22:45Z

This is my proposal for #79

In this PR, a calculator will try to update its value during initialization. If the update fails, a warning with the title of the error is logged. Also, the full traceback of the error is logged to the debug logger.

Example 1:

>>> import liesel.model as lsl
>>> 
>>> a = lsl.Data(1.0)
>>> b = lsl.Calc(lambda x: x / 0, a)
liesel.model.nodes - WARNING - Calc(name="") was not updated during initialization, because the following exception occured: RuntimeError('Error while updating Calc(name="").'). See debug log for the full traceback.

Example 2:

>>> import logging
>>> import liesel.model as lsl
>>> 
>>> logger = logging.getLogger("liesel")
>>> logger.handlers[0].setLevel(logging.DEBUG)
>>> 
>>> a = lsl.Data(1.0)
>>> b = lsl.Calc(lambda x: x / 0, a)
liesel.model.nodes - WARNING - Calc(name="") was not updated during initialization, because the following exception occured: RuntimeError('Error while updating Calc(name="").'). See debug log for the full traceback.
liesel.model.nodes - DEBUG - Calc(name="") was not updated during initialization, because the following exception occured:
Traceback (most recent call last):
  File "/Users/johannesbrachem/Documents/git/liesel/liesel/model/nodes.py", line 526, in update
    self._value = self.function(*args, **kwargs)
  File "<string>", line 1, in <lambda>
ZeroDivisionError: float division by zero

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/Users/johannesbrachem/Documents/git/liesel/liesel/model/nodes.py", line 500, in __init__
    self.update()
  File "/Users/johannesbrachem/Documents/git/liesel/liesel/model/nodes.py", line 528, in update
    raise RuntimeError(f"Error while updating {self}.") from e
RuntimeError: Error while updating Calc(name="").

Update nodes.py

jobrachem · 2023-09-07T12:26:07Z

@wiep this is something we discussed briefly during the weekly on August 30. You had some reservations about the general idea to try to update a calculator on initialization - so to enrich the conversation about this proposal, I created this PR which shows how I think the update could be implemented without breaking the calculator's ability to be initialized without a working update. What do you think?

Due to the exception being caught and re-raised in the update() method, we do not get the actual error text here.

Correct sampling for sigma_sq

jobrachem · 2023-09-11T09:50:55Z

Converted back to draft, because I just saw that with this implementation, we get the following warnings during model initialization:

liesel.model.nodes - WARNING - Calc(name="_model_log_prior") was not updated during initialization, because the following exception occured: RuntimeError('Error while updating Calc(name="_model_log_prior").'). See debug log for the full traceback.
liesel.model.nodes - WARNING - Calc(name="_model_log_prob") was not updated during initialization, because the following exception occured: RuntimeError('Error while updating Calc(name="_model_log_prob").'). See debug log for the full traceback.

This is not pretty, because it is just intended behavior and I think probably users should not be warned about this. Before this PR is ready for review, I would like to come up with a nicer solution for this case.

wiep · 2023-09-12T00:50:35Z

i'm still not super happy with the default warnings. so far, we say liesel you can build a partial graph but that would trigger warnings what is not really what should happen. usually i do not like this magic, but here it might be justified: a context manager that turns the updates on (or off). i think that is better than fiddling with the log levels.

with lsl.instant_evaluation():
     a = lsl.Data(1.0)
     b = lsl.Calc(lambda x: x / 0, a)
# warnings

a = lsl.Data(1.0)
b = lsl.Calc(lambda x: x / 0, a)
# no warnings

jobrachem · 2023-09-12T07:31:33Z

I don't really think the magic context manager would bring us a lot of joy here. Manually, the desired functionality can already be achieved by

a = lsl.Data(1.0)
b = lsl.Calc(lambda x: x / 0, a).update()

This makes it more obvious what is happening than the context manager.

The idea of this PR is mostly to make it easier for users to spot errors in their code early without introducing additional manual work. If you don't know the internals of how, when and why exactly a calculator gets updated, it is currently easy to get tripped up. At least that is what happened to me.

I think an alternative to the work this PR could lie in documentation.

i think that is better than fiddling with the log levels.

You only need to change the log levels if you have no other way of accessing the original traceback. When you are working interactively, say in quarto, you can just run b.update() manually one time to get the error message. If you can't do that, activating the "debug" log-level for debugging is reasonable I think.

What is true is that this code looks a little fiddly:

logger = logging.getLogger("liesel")
logger.handlers[0].setLevel(logging.DEBUG)

This is a result of how we set up our logging. I think we could (and should) change the setup such that you can set the log level like this:

logger = logging.getLogger("liesel")
logger.setLevel(logging.DEBUG)

But that is a different issue.

Co-authored-by: Hannes Riebl <[email protected]>

wiep · 2023-09-17T18:14:38Z

Building a non-initialized model is possible without warnings is imho a crucial aspect of Liesel. However, the proposed implementation may require users to turn off logging, build the non-initialized model, and then turn the logging back on. I find that not ideal. Checking the calculations during initialization is often beneficial from a user's perspective, though. Therefore, I think, it may be necessary to have two different modes to cater to both scenarios.

One possible solution is to enable updates by default and disable them only when a particular setting is altered. For example, a ctx manager could be used to adjust the settings temporarily.

lsl.update_on_construction = False
# build model

# or

with lsl.disable_updates_on_construction():
    # build model

We can discuss this further during one of our upcoming weekly meetings.

Co-authored-by: wiep <[email protected]>

Co-authored-by: Hannes Riebl <[email protected]>

Co-authored-by: Gianmarco Callegher <[email protected]>

* Efficient MVN Degenerate * Fixed blackjax refactor error * Refactor * indent more code * Update CHANGELOG.md --------- Co-authored-by: Gianmarco Callegher <[email protected]> Co-authored-by: Johannes Brachem <[email protected]>

* Update summary_m.py * Update CHANGELOG.md

jobrachem · 2023-11-01T13:38:43Z

We could also simply add an init parameter to calculators:

a = lsl.Data(1.0)
b = lsl.Calc(lambda x: x / 0, a, update_on_init=False)

This could be set to True by default.

Allow passing key as seed

* Adds NamedTupleInterface * Format * Add documentation * Export NamedTupleInterface in goose.__init__.py * Format * Update documentation

jobrachem · 2024-01-17T15:27:42Z

We will go with the init argument, which defaults to True. We can add a context manager later, if it turns out we want one.

…#167) * Transform method handles nodes duplicates * Refactor * Added tests. Removed useless check * Update CHANGELOG.md * rename and move test * rephrase message * remove unnecessary code --------- Co-authored-by: Gianmarco Callegher <[email protected]> Co-authored-by: Johannes Brachem <[email protected]>

Update nodes.py

Due to the exception being caught and re-raised in the update() method, we do not get the actual error text here.

…liesel into update-calc-on-init

jobrachem · 2024-01-31T15:07:30Z

@wiep if you got a notification for this PR, you can ignore it. @GianmarcoCallegher is doing the review.

GianmarcoCallegher · 2024-02-09T21:28:04Z

LGTM

jobrachem added 2 commits September 5, 2023 12:08

correct sampling for sigma_sq

2ed4609

try to update calc on init

03b7b45

Update nodes.py

jobrachem added the enhancement New feature or request label Sep 7, 2023

jobrachem requested a review from wiep September 7, 2023 12:22

jobrachem self-assigned this Sep 7, 2023

fix test

8bf4cd4

Due to the exception being caught and re-raised in the update() method, we do not get the actual error text here.

jobrachem linked an issue Sep 7, 2023 that may be closed by this pull request

Call Calc.update() once as the last step of initialization #79

Closed

Merge pull request #85 from liesel-devs/fix-linreg-tutorial

332c9de

Correct sampling for sigma_sq

jobrachem marked this pull request as draft September 11, 2023 09:49

Add Goose-based initialization strategies / jittering (#72)

ddb6d15

Co-authored-by: Hannes Riebl <[email protected]>

jobrachem and others added 13 commits September 27, 2023 15:22

Cap BlackJAX version (#95)

f0d14b9

Fix PyMC tutorial (#96)

060b879

Co-authored-by: wiep <[email protected]>

Fix mypy errors (#97)

97ad3b4

Improve import name

3d50398

Fix title in plot_scatter(), closes #98

33ae765

Adapt to BlackJAX 1.0.0 (#100)

6a05035

Co-authored-by: Hannes Riebl <[email protected]>

increase timeout

6ffdc03

Update metadata v0.2.5

15f960a

update metadata 0.2.6-dev

be72c5c

Fix Mypy errors (#102)

b8303c8

Co-authored-by: Gianmarco Callegher <[email protected]>

More Efficient MVN Degenerate (#101)

6275b74

* Efficient MVN Degenerate * Fixed blackjax refactor error * Refactor * indent more code * Update CHANGELOG.md --------- Co-authored-by: Gianmarco Callegher <[email protected]> Co-authored-by: Johannes Brachem <[email protected]>

Fix #103 (#109)

4f8524b

* Update summary_m.py * Update CHANGELOG.md

Update deprecation message

0eb30cd

update deprecation messages

3a9ae8c

wiep and others added 14 commits November 16, 2023 08:36

change type to be consistent with Goose

18b1de3

Merge pull request #156 from Seb-Lorek/change-seed

8a284f6

Allow passing key as seed

pin pymc version

727bd85

Implements NamedTupleInterface (#151)

3ad50be

* Adds NamedTupleInterface * Format * Add documentation * Export NamedTupleInterface in goose.__init__.py * Format * Update documentation

Update CHANGELOG.md

5f45cb0

Updated version

98ef516

Merge remote-tracking branch 'origin/main' into main

ab44292

Update CHANGELOG.md

15e393e

fix name basis_matrix in 07-groups.qmd (#166)

0a46ef5

Better data generation for location-scale regression tutorial

204b290

Fixed deprecated warnings

9335b7c

Update docs (#169)

5b0fdd1

Update model.rst

8176c13

Update __version__.py

12b3ba9

GianmarcoCallegher and others added 5 commits January 31, 2024 15:18

try to update calc on init

d54aecc

Update nodes.py

fix test

7b8c50b

Due to the exception being caught and re-raised in the update() method, we do not get the actual error text here.

implement init argument

0a4f227

Merge branch 'update-calc-on-init' of https://github.com/liesel-devs/…

b25a307

…liesel into update-calc-on-init

jobrachem marked this pull request as ready for review January 31, 2024 14:38

jobrachem requested a review from GianmarcoCallegher January 31, 2024 14:38

jobrachem added 2 commits January 31, 2024 15:43

fix Calc docs

f39fa9f

Update CHANGELOG.md

496bef2

Removed duplicated logger in nodes module

0fe859e

GianmarcoCallegher merged commit df53f7d into main Feb 9, 2024
3 checks passed

GianmarcoCallegher deleted the update-calc-on-init branch February 9, 2024 21:32

jobrachem mentioned this pull request Apr 16, 2024

Turn off update_on_init for special model nodes #186

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try to update calc on init #92

Try to update calc on init #92

jobrachem commented Sep 7, 2023

jobrachem commented Sep 7, 2023

jobrachem commented Sep 11, 2023

wiep commented Sep 12, 2023 •

edited

Loading

jobrachem commented Sep 12, 2023

wiep commented Sep 17, 2023 •

edited

Loading

jobrachem commented Nov 1, 2023

jobrachem commented Jan 17, 2024

jobrachem commented Jan 31, 2024

GianmarcoCallegher commented Feb 9, 2024

Try to update calc on init #92

Try to update calc on init #92

Conversation

jobrachem commented Sep 7, 2023

jobrachem commented Sep 7, 2023

jobrachem commented Sep 11, 2023

wiep commented Sep 12, 2023 • edited Loading

jobrachem commented Sep 12, 2023

wiep commented Sep 17, 2023 • edited Loading

jobrachem commented Nov 1, 2023

jobrachem commented Jan 17, 2024

jobrachem commented Jan 31, 2024

GianmarcoCallegher commented Feb 9, 2024

wiep commented Sep 12, 2023 •

edited

Loading

wiep commented Sep 17, 2023 •

edited

Loading