PERF-#239: Improve task scheduler for MPI backend #245

arunjose696 · 2023-02-15T13:47:15Z

What do these changes do?

Improve task scheduler for MPI backend, Replaced the round robin algorithm to submit tasks, with a getting a worker rank with maximum data share. Data is pushed to workers in the put method.
Data id and data sizes would be stored in object store.

passes flake8 .
passes black .
signed commit with git commit -s
Resolves [MPI] Improve task scheduler for MPI backend #239
tests added and passing
module layout described at docs/developer/architecture.rst is up-to-date

YarShev · 2023-03-01T22:26:25Z

unidist/core/backends/mpi/core/controller/api.py

+    else:
+        data_id = object_store.generate_data_id(garbage_collector)
+        dest_rank = RoundRobin.get_instance().schedule_rank()
+        object_store.put(data_id, data)


Why do we need to put data into the object store of the master process? We want to push data to a worker and save the owner of the data into the object store of the main process. In order to achieve this we should probably rework push_data method.

Have added an additional function(push_data_directly_to_worker) to push data directly without storing in object store. Did not modify push_data method as the new method would need the actual data also as argument and it relied on object_store.

YarShev · 2023-03-01T22:57:00Z

unidist/core/backends/mpi/core/controller/api.py

+    # dest_rank = RoundRobin.get_instance().schedule_rank()
+    dest_rank = choose_destination_rank(collected_data_ids)
+    print(dest_rank, collected_data_ids)
+    push_data_owners(dest_rank, collected_data_ids)


As far as I understand, there is a case when we push the data to, for instance, rank 2 during ref = unidist.put(data) and then we push the data owner (the same worker with rank 2) to it itself during foo.remote(ref). We should probably avoid such cases, i.e., don't push the data owner if the worker is the data owner itself.

# simple example import unidist unidist.init() o_ref = unidist.put(1) @unidist.remote def foo(obj): return obj foo.remote(o_ref) # here we push the owner (rank 2) to the worker with rank 2

YarShev · 2023-03-01T23:01:47Z

unidist/core/backends/mpi/core/controller/common.py

+    if isinstance(value, (list, tuple)):
+        for v in value:
+            collect_all_data_id_from_args(v, collected_data_ids)
+    elif isinstance(value, dict):
+        for v in value.values():
+            collect_all_data_id_from_args(v, collected_data_ids)
+    elif is_data_id(value):


Is it possible if we don't get into any of if branches? You can add raise an exception at the end of the function to check this when testing with Modin.

Here as the plan is to just collect data_ids, recursively so if the args are not data_ids, eg functions, python objects etc it is expected not to get into any if branches. Will add an exception if object store does not contain an owner.

YarShev · 2023-03-01T23:03:16Z

unidist/core/backends/mpi/core/controller/common.py

+    int
+        A rank number.
+    """
+    if not data_ids:


When is data_ids an empty list?

This would for the case where there are no arguments for the remote function. So the data_ids would be empty and none of workers will have data shares so submitting them using round robin.

eg.

@unidist.remotedef foo(): return 1 o = foo.remote()

Put a comment describing this into the code.

YarShev · 2023-03-01T23:52:19Z

unidist/core/backends/mpi/core/controller/object_store.py

+        # Data sizes
+        self._data_sizes = defaultdict(int)
+        # Mapping from python identity id() to DataID {id() : DataID}
+        self._identity_data_id_map = defaultdict(lambda: None)


Why do we need this map?

This is no longer required, will remove this.

The original purpose of this map was to make sure to not send same python object to workers multiple times, as put was planned to be used in submit method for all arguments but this was later changed.

…ethod and submitting task to workers with maximum data share

…collect and send data_id

Signed-off-by: arun696 <[email protected]>

…ta_id Signed-off-by: arunjose696 <[email protected]>

Signed-off-by: arun696 <[email protected]>

…data_id Signed-off-by: arunjose696 <[email protected]>

arunjose696 force-pushed the scheduling_MPI branch 3 times, most recently from d4c6ecb to b619245 Compare February 27, 2023 16:58

YarShev reviewed Mar 1, 2023

View reviewed changes

arunjose696 added 5 commits March 2, 2023 10:56

implemented a scheduling by changing pushing data to workers to put m…

b0bc015

…ethod and submitting task to workers with maximum data share

static check issues

1b94726

removed put method call in submit, and added function to recursively …

c496f8f

…collect and send data_id

debug logs

71cc00d

Signed-off-by: arun696 <[email protected]>

pr comments

93204b2

arunjose696 force-pushed the scheduling_MPI branch 2 times, most recently from 9edb5b7 to 569e88b Compare March 6, 2023 15:03

process check_pending_get_requests only if the objectStore has the da…

47b59cb

…ta_id Signed-off-by: arunjose696 <[email protected]>

arunjose696 force-pushed the scheduling_MPI branch from 569e88b to 47b59cb Compare March 6, 2023 15:06

arunjose696 and others added 2 commits March 6, 2023 16:30

Merge branch 'modin-project:master' into scheduling_MPI

8abb067

using isend instead of send for operation

3d093cb

arunjose696 force-pushed the scheduling_MPI branch from 600c169 to 3d093cb Compare March 6, 2023 16:18

arunjose696 added 2 commits March 7, 2023 13:28

adding logger name

186e090

merge logger name change

7045e94

arunjose696 force-pushed the scheduling_MPI branch from f4717d9 to 51718b5 Compare March 7, 2023 15:33

added check for pending get requests after put data

73fb416

Signed-off-by: arun696 <[email protected]>

arunjose696 force-pushed the scheduling_MPI branch from 51718b5 to 73fb416 Compare March 7, 2023 15:48

arunjose696 added 3 commits March 8, 2023 11:55

sync with master

a3302e4

added code to extend async handlers

02f405d

adding temporary workaround for getSize issue and fixed passing base_…

e60fe96

…data_id Signed-off-by: arunjose696 <[email protected]>

arunjose696 force-pushed the scheduling_MPI branch from 6a63384 to e60fe96 Compare March 9, 2023 13:26

checking if gc is present

e963e3c

YarShev force-pushed the master branch from 28e6411 to 1e2922a Compare May 29, 2024 14:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF-#239: Improve task scheduler for MPI backend #245

PERF-#239: Improve task scheduler for MPI backend #245

arunjose696 commented Feb 15, 2023 •

edited by YarShev

Loading

YarShev Mar 1, 2023

arunjose696 Mar 2, 2023

YarShev Mar 1, 2023

YarShev Mar 1, 2023

arunjose696 Mar 2, 2023

YarShev Mar 1, 2023

arunjose696 Mar 2, 2023

YarShev Mar 2, 2023

YarShev Mar 1, 2023

arunjose696 Mar 2, 2023

PERF-#239: Improve task scheduler for MPI backend #245

Are you sure you want to change the base?

PERF-#239: Improve task scheduler for MPI backend #245

Conversation

arunjose696 commented Feb 15, 2023 • edited by YarShev Loading

What do these changes do?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arunjose696 commented Feb 15, 2023 •

edited by YarShev

Loading