-
Notifications
You must be signed in to change notification settings - Fork 41
RMA WG 11 02 2020
Min Si edited this page Nov 3, 2020
·
3 revisions
- Go over all active topics listed on the RMA WG wiki page.
- Add the link to the corresponding ticket for each topic.
Did not record.
- New proposal
shmem_quiet_all
(Naveen): may be added to RMA WG or Threads WG. #453 -
SHMEM_TEAM_PTR
(Jim): Prefer to hand over to someone. Please contact Jim or follow up on the ticket if you are interested. #319 - Noncontiguous communication (Jim):
- Have several proposals listed at #365
- Have read the Block Interleaved API, need more time to study other options.
- One concern is no strong application support. Might find examples from NVSHMEM customers.
- Signal API (Jim):
- Functionally identical to
shmem_put_signal
with 0 byte transfer, but the new API is more straightforward - Naveen: has concern related to multi-rail network. E.g.,
shmem_put_signal
will use a single NIC, how to deal with separateput
+signal
? - Jim: has to require the user to call
fence
(orquiet
) explicitly to ensure orderedput
+signal
- Manju: need stronger motivation for the proposal. E.g., the user has to call extra
fence
beforesignal
. - Jim: also benefits performance as it saves a conditional check; inlining might get rid of the check but it may also cause some overhead
- Min: we have observed significant instruction cache misses when do full-inlining with MPICH.
- Functionally identical to
- GPU (Jim):
- Memory space is a prerequisite
- Need coordinate with Khaled to get generic API proposal that covers both NVSHMEM and ROC_SHMEM
- Potential proposal would be to add multithreading resource input in existing RMA API. This will allow the runtime to parallelize large PUT/GET by using available threads
- Need more understanding on utilization of context
- Min: might need a separate GPU chapter since the new API might not benefit CPU-oriented programs.
- Wasi will present performance study regarding user-level threads
-
Working Groups
-
Errata