forked from tensorflow/tensorflow
-
Notifications
You must be signed in to change notification settings - Fork 16
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
PR tensorflow#9150: [NVIDIA XLA GPU] Make CUTLASS gemmDus fusion acce…
…pt optional bitcast Imported from GitHub PR openxla/xla#9150 The CUTLASS gemmDus fusion used to have intermediate bitcast as a required node. Removing this constraint since in some cases a 2-d gemm will directly update a 2-d weight. Copybara import of the project: -- 610efc47b040a3ce9d1a2a2ec5fad8a5688cb172 by TJ <[email protected]>: cutlass gemm dus fusion supports optinal bitcast Merging this change closes tensorflow#9150 PiperOrigin-RevId: 604573353
- Loading branch information
1 parent
6e1fb80
commit e382119
Showing
2 changed files
with
135 additions
and
7 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters