Improve array_on_device
performance via subloc-aware comm calls
#16242
Job | Run time |
---|---|
2m 34s | |
14m 45s | |
17m 38s | |
12m 24s | |
9m 19s | |
12m 35s | |
1h 9m 15s |