-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
is this a serious program>? #251
Comments
root@1a89b5aa5fce:/opt/hipBLASLt/build/release# ./clients/staging/hipblaslt-test Query device success: there are 1 devicesDevice ID 0 : AMD Radeon VII gfx906:sramecc+:xnack-
|
@idreamerhx hipblaslt currently only support gfx90a device. |
@idreamerhx Can you please test with the latest ROCm 6.1.2 to see if your issue still exists? If not, please close the ticket. Thanks! |
root@1a89b5aa5fce:/opt/hipBLASLt/build/release# ./clients/staging/hipblaslt-bench -m 2048 -n 2048 -k 2048 --precision f32_r -v 1 --activation_type relu
Query device success: there are 1 devices
Device ID 0 : AMD Radeon VII gfx906:sramecc+:xnack-
with 17.2 GB memory, max. SCLK 1801 MHz, max. MCLK 1000 MHz, compute capability 9.0
maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64
rocblaslt warning: No paths matched /opt/hipBLASLt/build/release/library/../Tensile/library/gfx906co. Make sure that HIPBLASLT_TENSILE_LIBPATH is set correctly.
transA,transB,grouped_gemm,batch_count,M,N,K,alpha,lda,stride_a,beta,ldb,stride_b,ldc,stride_c,ldd,stride_d,d_type,compute_type,activation_type,bias_vector,hipblaslt-Gflops,us,CPU-Gflops,CPU-us,norm_error_1
N,N,0,1,2048,2048,2048,1,2048,4194304,0,2048,4194304,2048,4194304,2048,4194304,f32_r,f32_r,relu,0, 2.72763e+06, 6.3,4.47063,3.84376e+06,1.08487
root@1a89b5aa5fce:/opt/hipBLASLt/build/release# ./clients/staging/hipblaslt-bench -m 1024 -n 1024 -k 1024 --precision f32_r -v 1 --activation_type relu
Query device success: there are 1 devices
Device ID 0 : AMD Radeon VII gfx906:sramecc+:xnack-
with 17.2 GB memory, max. SCLK 1801 MHz, max. MCLK 1000 MHz, compute capability 9.0
maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64
rocblaslt warning: No paths matched /opt/hipBLASLt/build/release/library/../Tensile/library/gfx906co. Make sure that HIPBLASLT_TENSILE_LIBPATH is set correctly.
transA,transB,grouped_gemm,batch_count,M,N,K,alpha,lda,stride_a,beta,ldb,stride_b,ldc,stride_c,ldd,stride_d,d_type,compute_type,activation_type,bias_vector,hipblaslt-Gflops,us,CPU-Gflops,CPU-us,norm_error_1
N,N,0,1,1024,1024,1024,1,1024,1048576,0,1024,1048576,1024,1048576,1024,1048576,f32_r,f32_r,relu,0, 279030, 7.7,4.39526,488829,1.12318
root@1a89b5aa5fce:/opt/hipBLASLt/build/release# ^C
root@1a89b5aa5fce:/opt/hipBLASLt/build/release# ./clients/staging/hipblaslt-bench -m 102^C-n 1024 -k 1024 --precision f32_r -v 1 --activation_type relu
root@1a89b5aa5fce:/opt/hipBLASLt/build/release# ./clients/staging/hipblaslt-bench --precision f32_r -v 1
Query device success: there are 1 devices
Device ID 0 : AMD Radeon VII gfx906:sramecc+:xnack-
with 17.2 GB memory, max. SCLK 1801 MHz, max. MCLK 1000 MHz, compute capability 9.0
maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64
rocblaslt warning: No paths matched /opt/hipBLASLt/build/release/library/../Tensile/library/gfx906co. Make sure that HIPBLASLT_TENSILE_LIBPATH is set correctly.
transA,transB,grouped_gemm,batch_count,M,N,K,alpha,lda,stride_a,beta,ldb,stride_b,ldc,stride_c,ldd,stride_d,d_type,compute_type,activation_type,bias_vector,hipblaslt-Gflops,us,CPU-Gflops,CPU-us,norm_error_1
N,N,0,1,128,128,128,1,128,16384,0,128,16384,128,16384,128,16384,f32_r,f32_r,none,0, 776.723, 5.4,4.06425,1032,1.07202
what fuck the gpu has 200Tflops? 279030
The text was updated successfully, but these errors were encountered: