Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allows workers and rsets to be set by detected GPUs. #939

Draft
wants to merge 1 commit into
base: develop
Choose a base branch
from

Conversation

shuds13
Copy link
Member

@shuds13 shuds13 commented Dec 18, 2022

Decide option name and type set_workers_by_gpus.
Sets resource_sets to no. of GPUs, and currently sets workers to rsets+1. But maybe we don't want to expose users to resource_sets terminology, as intended to be simplified.

But consider setting rsets and honor workers if exists. Only for local comms - unless honoring workers. rsets is more elegant as can establish basic configuration and then just have a multiplier.

Could allow the user to set up resources in their calling script, similar to the executor. This could include the configuration of a resource set. The user could also use the returned attributes in calling script if want - including nworkers (when set by detection). If set by user, would not get re-calculated within libE. This would also allow checks before calling libE. You could specify system / code / env variables here - or get back whether a known system is detected. It may be also that you want to access some shared configuration file - and that could also be specified here (or a common location could be looked for). Would need to be added to YAML parsing - as with any additions. Then, do you set the above here, instead of libE_specs?

Uses set_workers_by_gpus.
But consider setting rsets and honor workers if exists.
Only for local comms - unless honor workers.
@shuds13 shuds13 self-assigned this Feb 8, 2023
@shuds13 shuds13 mentioned this pull request Dec 15, 2023
20 tasks
@shuds13 shuds13 mentioned this pull request Feb 5, 2024
4 tasks
@jmlarson1 jmlarson1 changed the base branch from feature/gpu_detect to develop February 8, 2024 17:12
@shuds13 shuds13 mentioned this pull request Mar 20, 2024
19 tasks
@shuds13 shuds13 mentioned this pull request Apr 25, 2024
12 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant