-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PyTorch UT regression (1/2) #2579
Comments
bisected to this commit:
https://github.com/intel/intel-xpu-backend-for-triton/tree/d651a8444fbf096253b2bc090cc6039be29784a8 As you can tell, this change is from upstream. @guangyey do you know if upstream Triton and upstream PyTorch have the same issue? |
The upstream PyTorch uses commit cf34004b8a67d290a962da166f5aa2fc66751326 as its CI commit pin. I looked through the commit history, and the bad commit seems not to be included. |
@guangyey I installed pytorch with
When I run your reproducer I get the below error.(Code fails at line 45 Error Could you see if your reproducer works with torch==2.5.1. at your end? |
@arunjose696 It couldn't be reproduced with the 2.5.1 release branch because of the changing internal API in inductor. You have to use main branch with these different triton commits. |
May I know if there is any update? |
This was both caused by and fixed by upstream: (see corresponding issue in the inductor release thread: pytorch/pytorch#139348 I have asked @anmyachev and @whitneywhtsang to prioritize syncing the relevant commits. |
The two corresponding commits are now merged in our Intel repo. |
I have verified that this tests passes:
with Triton commit 0d9c0d3 and PyTorch commit pytorch/pytorch@78a8f7f. Based on that, I believe this ticket can be closed. |
[TL,DR]
The following case is a PyTorch UT reproducer. It has regression with the commit b6cdccd compared with 91b14bf. It means that
buf0
hasNaN
value with the newer commit b6cdccd.This impacts PyTorch upstream update triton commit pin, see pytorch/pytorch#137886. The original PyTorch Inductor UT failure log is here
Please help take a look.
The text was updated successfully, but these errors were encountered: