Skip to content

Commit

Permalink
- Adding norm_to_scale_identity_weight_per_block to multiply and upda…
Browse files Browse the repository at this point in the history
…te_cache methods of estimator which allows the identity_weight to be scaled differently for each block according to some kind of norm (or norm-like function) of the curvature for that block.

- Fixing minor bug that would cause some curvature blocks to use an improperly scaled damping when multiplying with power=1 and use_cached=True, for classes that have non-trivial state_dependent_scale methods.

- Adding whitespace to improve readability.

PiperOrigin-RevId: 577679697
  • Loading branch information
james-martens authored and KfacJaxDev committed Oct 29, 2023
1 parent e637eea commit bacdf8e
Show file tree
Hide file tree
Showing 5 changed files with 382 additions and 50 deletions.
Loading

0 comments on commit bacdf8e

Please sign in to comment.