Skip to content

For a 2-D grid kernel launch, does blockIdx.y change faster than blockIdx.x? #1058

Discussion options

You must be logged in to vote

Hello, @zhaolianshuizls! There isn't a guarantee on the order. The thread blocks are scheduled independently. I think due to the current num_items restriction, we haven't actually tested the scenario where blockIdx.y is larger than 1. We should take a look once we get to supporting 64-bit offsets in select / partition. If this is critical to you, please, feel free to open an issue against https://github.com/NVIDIA/cccl.

Replies: 0 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by zhaolianshuizls
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
CUB
Labels
None yet
2 participants