Skip to content

Commit

Permalink
- Adding norm_to_scale_identity_weight_per_block to multiply and upda…
Browse files Browse the repository at this point in the history
…te_cache methods of estimator which allows the identity_weight to be scaled differently for each block according to some kind of norm (or norm-like function) of the curvature for that block.

- Fixing minor bug that would cause some curvature blocks to use an improperly scaled damping when multiplying with power=1 and use_cached=True, for classes that have non-trivial state_dependent_scale methods.

- Adding whitespace to improve readability.

PiperOrigin-RevId: 571364604
  • Loading branch information
james-martens authored and KfacJaxDev committed Oct 11, 2023
1 parent 9a65189 commit 5565e32
Show file tree
Hide file tree
Showing 5 changed files with 421 additions and 48 deletions.
Loading

0 comments on commit 5565e32

Please sign in to comment.