Add `Flow.metadata` attribute used in `Flow.update_metadata` method #679

janosh · 2024-09-18T21:50:07Z

Flow.update_metadata is now much more similar to and has reached feature parity with Job.update_metadata:

Lines 922 to 930 in 367836b

    
           def update_metadata( 
        
               self, 
        
               update: dict[str, Any], 
        
               name_filter: str = None, 
        
               function_filter: Callable = None, 
        
               dict_mod: bool = False, 
        
               dynamic: bool = True, 
        
           ): 
        
               """

…in update_metadata method

to use equivalent code as Flow

codecov · 2024-09-18T21:51:42Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.24%. Comparing base (a740c6c) to head (367836b).
Report is 14 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #679   +/-   ##
=======================================
  Coverage   99.23%   99.24%           
=======================================
  Files          21       21           
  Lines        1573     1583   +10     
  Branches      427      431    +4     
=======================================
+ Hits         1561     1571   +10     
  Misses         10       10           
  Partials        2        2

Files with missing lines	Coverage Δ
src/jobflow/core/flow.py	`100.00% <100.00%> (ø)`
src/jobflow/core/job.py	`99.15% <100.00%> (-0.01%)`	⬇️

janosh · 2024-09-27T21:33:19Z

@utf i'm sure you're strapped for time but even partial feedback here would be much appreciated!

utf

Thanks @janosh. My only concern is about breaking the API of update_metadata between Flows and Jobs. If we add the target option to Job.update_metadata that should make this function easier to use and the implementation cleaner.

utf · 2024-09-30T09:16:02Z

src/jobflow/core/flow.py

+                self.metadata.update(update)
+
+        if target in ["jobs", "both"]:
+            for job in self:


Here job could actually be a nested flow. So you should still iterate over these if target = "flow".

perhaps the naming could be improved. target = "flow" isn't meant to update the metadata of all nested flows (to the exclusion of jobs) but rather only the flow itself, not any of its jobs or nested flows.

how about we rename from target: Literal["flow", "jobs", "both"] = "both" to target: Literal["self", "nested", "both"] = "both"? unless you think it's important to have more control, i.e. be able to only update nested Flow or Job metadata. there might be use cases for that. in which case maybe we want target: Literal["self", "nested", "jobs", "flows", "both"] = "both"?

to clarify this could be the doc string:

target Specifies where to apply the metadata update. Options are: - "self": Update only the Flow's own metadata - "nested": Update only the metadata of Jobs/Flows within the Flow - "jobs": Update only the metadata of Jobs within the Flow - "flows": Update only the metadata of Flows within the Flow - "all": Update both the Flow's metadata and nested Job+Flow metadata (default)

I think this is overcomplicating it. If you want to select specific flows or jobs then you can pass in a name or class filter. So no need for the extra options. The main thing is the API should be consistent between jobs/flows.

makes sense. then the easiest thing would be to get rid of target altogether and use the "all" behavior?

That's an interesting point. I think the benefit of target is that it would be a shortcut for class=Flow vs class=Job. I would be happy either having the target option or specifying the shortcut in the docstring.

different issue but while we're on the topic of API design, what's your take on adding a callback_filter: Callable[[Flow | Job], bool]? users would pass in a function which takes the Flow or Job instance on which you invoke update_metadata (or update_config, ...) and returns True if updates should be applied. perhaps more prone to user error but also very versatile. usage example:

Flow().update_metadata( {"material_id": 42}, callback_filter=lambda flow: SomeMaker in map(type, flow) and flow.name == "flow name" )

I think the benefit of target is that it would be a shortcut for class=Flow vs class=Job. I would be happy either having the target option or specifying the shortcut in the docstring.

i don't follow. by class=Job|Flow, did you mean filter_function=Job|Flow? the job variant filter_function=Job doesn't seem to work the way you're suggesting so maybe you meant sth else?

EDIT: i guess you meant class_filter on Maker.update_kwargs but that isn't implemented for Flow/Job.update_metadata

Yes, I was thinking of class_filter. I think the callback_filter you proposed sounds very flexible and would be useful!

…e_metadata

…cussion

janosh added 3 commits September 18, 2024 17:42

add attributes metadata and metadata_updates to Flow class, now used …

b4b349d

…in update_metadata method

test Flow.metadata and Flow.update_metadata

afdbb1f

minor Job.metadata refactor

367836b

to use equivalent code as Flow

janosh added the enhancement New feature or request label Sep 18, 2024

janosh mentioned this pull request Sep 18, 2024

How to add metadata to flows docs? Matgenix/jobflow-remote#124

Open

janosh requested a review from utf September 18, 2024 21:52

janosh changed the title ~~Add Flow.metadata attribute now used in Flow.update_metadata method~~ Add Flow.metadata attribute used in Flow.update_metadata method Sep 18, 2024

utf reviewed Sep 30, 2024

View reviewed changes

janosh added 3 commits September 30, 2024 12:47

tweaks

e38237d

drop target: Literal["flow", "jobs", "both"] = "both" from Flow.updat…

9969640

…e_metadata

remove target kwarg from Flow tests, skipping 2 that need further dis…

c6a1732

…cussion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Flow.metadata` attribute used in `Flow.update_metadata` method #679

Add `Flow.metadata` attribute used in `Flow.update_metadata` method #679

janosh commented Sep 18, 2024

codecov bot commented Sep 18, 2024

janosh commented Sep 27, 2024

utf left a comment

utf Sep 30, 2024

janosh Oct 2, 2024

janosh Oct 2, 2024

utf Oct 2, 2024

janosh Oct 2, 2024

utf Oct 2, 2024

janosh Oct 2, 2024

janosh Oct 2, 2024 •

edited

Loading

utf Oct 2, 2024

	def update_metadata(
	self,
	update: dict[str, Any],
	name_filter: str = None,
	function_filter: Callable = None,
	dict_mod: bool = False,
	dynamic: bool = True,
	):
	"""

Add Flow.metadata attribute used in Flow.update_metadata method #679

Are you sure you want to change the base?

Add Flow.metadata attribute used in Flow.update_metadata method #679

Conversation

janosh commented Sep 18, 2024

codecov bot commented Sep 18, 2024

Codecov Report

janosh commented Sep 27, 2024

utf left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

janosh Oct 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add `Flow.metadata` attribute used in `Flow.update_metadata` method #679

Add `Flow.metadata` attribute used in `Flow.update_metadata` method #679

janosh Oct 2, 2024 •

edited

Loading