Skip to content

what's the purpose of CUB_SUBSCRIPTION_FACTOR #1060

Discussion options

You must be logged in to vote

The work / CTA partitioning in our reduce is static, meaning it's a function of problem size/architecture (no work stealing). The subscription factor is intended to improve load balancing. Imagine a GPU with 2 SMs, each holding only one CTA at a time. If we launch only 2 CTAs and one of them gets all the simple work, it'll finish early, keeping GPU underutilized. On the other hand, if we had another CTA available, it'd replace the finished one.

Replies: 0 comments 5 replies

Comment options

You must be logged in to vote
4 replies
@zhaolianshuizls
Comment options

@gevtushenko
Comment options

@zhaolianshuizls
Comment options

@gevtushenko
Comment options

Answer selected by zhaolianshuizls
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
CUB
Labels
None yet
2 participants