Skip to content

grid_dim.y boundary check #1059

Discussion options

You must be logged in to vote

Hello, @zhaolianshuizls! The code you mentioned is written in terms of tiles. Tile size depends on the data type, but let's say it's 128 threads, each processing ten items, so the tile size is 1280 (~2^10) items. To calculate the number of tiles, select first checks the maximum x-dimension of a grid of thread blocks. It should be around 2^31 blocks. This already converts to 2^41 items without increasing the number of blocks along the Y dimension. If you are using our device API, the num_items type is currently limited to be an int, so it should be impossible to get scan_grid_size.y > 1. If you are using the dispatch layer directly, providing OffsetT different from int, it should be possib…

Replies: 0 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by zhaolianshuizls
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
CUB
Labels
None yet
2 participants