Skip to content

Commit

Permalink
Refactor CUDA kernel launch string to include stream parameter
Browse files Browse the repository at this point in the history
  • Loading branch information
LeiWang1999 authored and LeiWang1999 committed May 6, 2024
1 parent 1704958 commit 90fce29
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion python/bitblas/wrapper/general.py
Original file line number Diff line number Diff line change
Expand Up @@ -287,7 +287,7 @@ def legalize_c(p):
# Determine the shared memory size, defaulting to 0 if not specified
smem_str = 0 if self.dynamic_smem_buf is None else self.dynamic_smem_buf
# Format the CUDA kernel launch string
call_str = "{}<<<{}, {}, {}, 0>>>({});".format(function_name, grid_str, block_str, smem_str,
call_str = "{}<<<{}, {}, {}, stream>>>({});".format(function_name, grid_str, block_str, smem_str,
call_args)
# Create the host function wrapper for the CUDA kernel
host_func = """
Expand Down

0 comments on commit 90fce29

Please sign in to comment.