Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BACKEND] Add folder for addptr(ptr, 0) -> ptr #5166

Merged
merged 3 commits into from
Nov 15, 2024
Merged

Conversation

peterbell10
Copy link
Contributor

I noticed this rather obvious pattern was missing. It might come up for example if you have an expression like:

ptrs = ptr + y_stride * tl.arange(0, YBLOCK)[:, None]

and the YBLOCK is set to 1 during autotuning.

I noticed this rather obvious pattern was missing. It might come up for
example if you have an expression like:
```python
ptrs = ptr + y_stride * tl.arange(0, YBLOCK)[:, None]
```
and the `YBLOCK` is set to 1 during autotuning.
Copy link
Collaborator

@ThomasRaoux ThomasRaoux left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@peterbell10 peterbell10 enabled auto-merge (squash) November 15, 2024 17:46
@peterbell10 peterbell10 merged commit 9883a9b into main Nov 15, 2024
7 checks passed
@peterbell10 peterbell10 deleted the pb/addptr-folder branch November 15, 2024 17:53
hmalgewatta pushed a commit to hmalgewatta/triton that referenced this pull request Nov 15, 2024
I noticed this rather obvious pattern was missing. It might come up for
example if you have an expression like:
```python
ptrs = ptr + y_stride * tl.arange(0, YBLOCK)[:, None]
```
and the `YBLOCK` is set to 1 during autotuning.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants