You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Given an array of node addresses as input, this function returns an array of launched worker PIDs. However, the returned pids do not necesssarily match the order of input addresses.
For example, the outcome of (p1, p2) = addprocs([machine1, machine2]) may be p1 running on machine2 and p2 running on machine1, or vice versa.
The cause of such underministic behavior is that launch(manager::SSHManager, ...) launches workers in parallel. Upon launching each worker, the pid of that worker will be pushed to launched, where no synchronization / ordering is performed:
Given an array of node addresses as input, this function returns an array of launched worker PIDs. However, the returned pids do not necesssarily match the order of input addresses.
For example, the outcome of
(p1, p2) = addprocs([machine1, machine2])
may bep1
running onmachine2
andp2
running onmachine1
, or vice versa.The cause of such underministic behavior is that
launch(manager::SSHManager, ...)
launches workers in parallel. Upon launching each worker, the pid of that worker will be pushed tolaunched
, where no synchronization / ordering is performed:Distributed.jl/src/managers.jl
Lines 177 to 185 in fd9d120
While this is not a bug, this undeterministic behavior seems counter-intuitive and is error-prone.
The text was updated successfully, but these errors were encountered: