New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Decouple ray submitter, worker, and head resources #2924

Open

Sovietaced wants to merge 3 commits into flyteorg:master from Sovietaced:issue-5666

+108 −6

Sovietaced commented Nov 13, 2024 •

edited

Loading

Tracking issue

Related to flyteorg/flyte#5666

Why are the changes needed?

These changes update the flytekit-ray package such that users can configure pod specs for worker and head workloads.

What changes were proposed in this pull request?

Updating the version of flyteidl and plumb k8s pods through worker and head config.

How was this patch tested?

See unit tests.

Check all the applicable boxes

I updated the documentation accordingly.
All new and existing tests passed.
All commits are signed-off.

Related PRs

flyteorg/flyte#5933

Sovietaced changed the title ~~wip~~ Decouple ray submitter, worker, and head resources


          Update ray task to use latest updates in flyteidl

c621359

Signed-off-by: Jason Parraga <[email protected]>

Sovietaced force-pushed the issue-5666 branch from 5e8306a to c621359 Compare

November 13, 2024 04:56

codecov bot commented Nov 13, 2024 •

edited

Loading

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 79.07%. Comparing base (3f0ab84) to head (2db498c).
Report is 2 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2924      +/-   ##
==========================================
+ Coverage   76.33%   79.07%   +2.73%     
==========================================
  Files         199      199              
  Lines       20840    20840              
  Branches     2681     2681              
==========================================
+ Hits        15908    16479     +571     
+ Misses       4214     3622     -592     
- Partials      718      739      +21

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Sovietaced commented

View reviewed changes

plugins/flytekit-ray/flytekitplugins/ray/models.py


		from flytekit.models import common as _common


		class K8sObjectMetadata(_common.FlyteIdlEntity):

Author

Sovietaced Nov 13, 2024 •

edited

Loading

I took these from flytekit/models/task.py instead of reusing them. I'm happy to just reuse the other models but I wasn't sure if we would want to simplify these for the ray use case or ensure they are decoupled so there are no unintended regressions.


          Update docs

c2e7f36

Signed-off-by: Jason Parraga <[email protected]>

Sovietaced commented

View reviewed changes

plugins/flytekit-ray/flytekitplugins/ray/models.py Outdated Show resolved Hide resolved

Sovietaced commented

View reviewed changes

plugins/flytekit-ray/flytekitplugins/ray/models.py

                   @property
                   def ray_start_params(self):
                       """
-                      The ray start params of worker node group.
+                      The ray start params of head node group.

Author

Sovietaced Nov 13, 2024

lil typo

Sovietaced marked this pull request as ready for review

November 13, 2024 05:07

Sovietaced requested review from wild-endeavor, kumare3, eapolinario, pingsutw, cosmicBboy, samhita-alla, thomasjpfan and Future-Outlier as code owners

November 13, 2024 05:07

kumare3 reviewed

View reviewed changes

plugins/flytekit-ray/flytekitplugins/ray/models.py Outdated Show resolved Hide resolved

kumare3 reviewed

View reviewed changes

plugins/flytekit-ray/flytekitplugins/ray/models.py Outdated

+                      self,
+                      metadata: K8sObjectMetadata = None,
+                      pod_spec: typing.Dict[str, typing.Any] = None,
+                      data_config: typing.Optional[DataLoadingConfig] = None,

Contributor

kumare3 Nov 13, 2024

drop this. Also why add this K8s pod. We dont need that, as Ray config will simply float like a json. So just use K8s pod object?

Contributor

kumare3 Nov 13, 2024

we can keep the k8s pod too, i see that you are actually setting the pod properties.
This way we could also use pod template?

Author

Sovietaced Nov 13, 2024

Yeah we had a discussion about this on the backend change here: flyteorg/flyte#5933 (comment)

It started as just adding support for resources but we realized it would be more flexible if we added support for something similar to a pod template since that is ultimately what the kuberay contract is.

kumare3 reviewed

View reviewed changes

plugins/flytekit-ray/flytekitplugins/ray/task.py

                       ray_job = RayJob(
                           ray_cluster=RayCluster(
                               head_group_spec=(
-                                  HeadGroupSpec(cfg.head_node_config.ray_start_params) if cfg.head_node_config else None
+                                  HeadGroupSpec(cfg.head_node_config.ray_start_params, cfg.head_node_config.k8s_pod)

Contributor

kumare3 Nov 13, 2024

IMO this is not great
cc @EngHabu @pingsutw
I would have loved us to model it more like a json so that modifying it would be faster without needing a protobuf change.

But i see what you are doing now


          drop data loading config

2db498c

Signed-off-by: Jason Parraga <[email protected]>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

kumare3 kumare3 left review comments

wild-endeavor Awaiting requested review from wild-endeavor wild-endeavor is a code owner

eapolinario Awaiting requested review from eapolinario eapolinario is a code owner

pingsutw Awaiting requested review from pingsutw pingsutw is a code owner

cosmicBboy Awaiting requested review from cosmicBboy cosmicBboy is a code owner

samhita-alla Awaiting requested review from samhita-alla samhita-alla is a code owner

thomasjpfan Awaiting requested review from thomasjpfan thomasjpfan is a code owner

Future-Outlier Awaiting requested review from Future-Outlier Future-Outlier is a code owner

At least 1 approving review is required to merge this pull request.

Labels

None yet