Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add sycl-bench benchmarks #2047

Open
wants to merge 28 commits into
base: main
Choose a base branch
from
Open

Conversation

mateuszpn
Copy link

Add unisa-hpc/sycl-bench SYCL benchmarks

@mateuszpn mateuszpn marked this pull request as ready for review September 5, 2024 08:58
@mateuszpn mateuszpn requested a review from a team as a code owner September 5, 2024 08:58
@mateuszpn mateuszpn marked this pull request as draft September 5, 2024 09:15
Copy link

github-actions bot commented Sep 5, 2024

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10719179498

This comment was marked as outdated.

@mateuszpn mateuszpn marked this pull request as ready for review September 5, 2024 13:30
median_result.unit = benchmark.unit()
median_result.name = benchmark.name()
median_result.unit = benchmark.unit()
median_result.name = label
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why did you change this? Now, we have 'hashtable' instead of 'VelocityBench Hashtable'

@igchor
Copy link
Member

igchor commented Sep 5, 2024

I think we should add a 'unit' to the results table - it's not clear when looking at it if the result is in milliseconds or something else.

Also, if it is in milliseconds, some of the benchmarks take very little time - can we increase number of iterations/size so that we can hopefully get more stable results?

Copy link

github-actions bot commented Sep 9, 2024

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10772124181

This comment was marked as outdated.

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10792817112

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10792817112
Job status: failure. Test status: skipped.

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10793178302

This comment was marked as outdated.

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10793921214

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10793921214
Job status: cancelled. Test status: skipped.

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10793998950

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10793998950
Job status: success. Test status: success.

Summary

result is better

Benchmark This PR baseline
Velocity-Bench Hashtable 265.78607 178.291413
api_overhead_benchmark_sycl SubmitKernel out of order - 50.631
api_overhead_benchmark_sycl SubmitKernel in order - 49.385
api_overhead_benchmark_ur SubmitKernel out of order - 31.93
api_overhead_benchmark_ur SubmitKernel in order - 28.586
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 423.457
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 253.906
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 9.179
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 1.854
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 4.506
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 3.613
miscellaneous_benchmark_sycl VectorSum - 863.651
Velocity-Bench Bitcracker - 35.8407
Velocity-Bench CudaSift - 283.294
Velocity-Bench Easywave - 457.0
Velocity-Bench QuickSilver - 115.63
Velocity-Bench Sobel Filter - 934.963

Charts

Velocity-Bench Hashtable
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Hashtable
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Hashtable

        This PR (265.78607 M keys/sec)   : crit, 0, 265

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section hashtable

        baseline (178.291413 M keys/sec)   :  0, 178

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (50.631 μs)   :  0, 50

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (49.385 μs)   :  0, 49

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (31.93 μs)   :  0, 31

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (28.586 μs)   :  0, 28

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (423.457 μs)   :  0, 423

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Host<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (253.906 μs)   :  0, 253

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueMemcpy(api=sycl<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB)

        baseline (9.179 μs)   :  0, 9

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
    todayMarker off
    dateFormat  X
    axisFormat %s

    section StreamMemory(api=sycl<br>type=Triad<br>size=10KB<br>useEvents=0<br>contents=Zeros<br>memoryPlacement=Device)

        baseline (1.854 μs)   :  0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Device<br>dst=Device<br>size=1KB<br>ioq=0)

        baseline (4.506 μs)   :  0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Host<br>dst=Host<br>size=1KB<br>ioq=1)

        baseline (3.613 μs)   :  0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
miscellaneous_benchmark_sycl VectorSum
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title miscellaneous_benchmark_sycl VectorSum
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorSum(api=sycl<br>numberOfElementsX=512<br>numberOfElementsY=256<br>numberOfElementsZ=256)

        baseline (863.651 μs)   :  0, 863

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Bitcracker
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Bitcracker
    todayMarker off
    dateFormat  X
    axisFormat %s

    section bitcracker

        baseline (35.8407 s)   :  0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench CudaSift
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench CudaSift
    todayMarker off
    dateFormat  X
    axisFormat %s

    section cudaSift

        baseline (283.294 ms)   :  0, 283

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Easywave
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Easywave
    todayMarker off
    dateFormat  X
    axisFormat %s

    section easywave

        baseline (457.0 ms)   :  0, 457

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench QuickSilver
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench QuickSilver
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QuickSilver

        baseline (115.63 MMS/CTT)   :  0, 115

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Sobel Filter
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Sobel Filter
    todayMarker off
    dateFormat  X
    axisFormat %s

    section sobel_filter

        baseline (934.963 ms)   :  0, 934

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading

Details

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.504984 s
265.786070 million keys/second

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10903371915
Job status: failure. Test status: success.

Summary

result is better

Benchmark This PR baseline
Velocity-Bench Hashtable 211.882006 178.291413
Runtime_BlockedTransform_iter_64_blocksize_256 0.356 -
Runtime_BlockedTransform_iter_256_blocksize_256 0.08600000000000001 -
Runtime_BlockedTransform_iter_128_blocksize_256 0.152 -
Runtime_BlockedTransform_iter_512_blocksize_256 0.08399999999999999 -
Polybench_2DConvolution 0.229 -
Polybench_2mm 1.239 -
api_overhead_benchmark_sycl SubmitKernel out of order - 50.631
api_overhead_benchmark_sycl SubmitKernel in order - 49.385
api_overhead_benchmark_ur SubmitKernel out of order - 31.93
api_overhead_benchmark_ur SubmitKernel in order - 28.586
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 423.457
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 253.906
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 9.179
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 1.854
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 4.506
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 3.613
miscellaneous_benchmark_sycl VectorSum - 863.651
Velocity-Bench Bitcracker - 35.8407
Velocity-Bench CudaSift - 283.294
Velocity-Bench Easywave - 457.0
Velocity-Bench QuickSilver - 115.63
Velocity-Bench Sobel Filter - 934.963
Mean performance: 118.8% of baseline (higher is better)

Charts

Velocity-Bench Hashtable
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Hashtable
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Hashtable

        This PR (211.882006 M keys/sec)   : crit, 0, 211

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section hashtable

        baseline (178.291413 M keys/sec)   :  0, 178

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_64_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_64_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_64_blocksize_256

        This PR (0.356 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_256_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_256_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_256_blocksize_256

        This PR (0.08600000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_128_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_128_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_128_blocksize_256

        This PR (0.152 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_512_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_512_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_512_blocksize_256

        This PR (0.08399999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2DConvolution
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2DConvolution
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2DConvolution

        This PR (0.229 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2mm

        This PR (1.239 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (50.631 μs)   :  0, 50

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (49.385 μs)   :  0, 49

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (31.93 μs)   :  0, 31

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (28.586 μs)   :  0, 28

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (423.457 μs)   :  0, 423

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Host<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (253.906 μs)   :  0, 253

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueMemcpy(api=sycl<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB)

        baseline (9.179 μs)   :  0, 9

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
    todayMarker off
    dateFormat  X
    axisFormat %s

    section StreamMemory(api=sycl<br>type=Triad<br>size=10KB<br>useEvents=0<br>contents=Zeros<br>memoryPlacement=Device)

        baseline (1.854 μs)   :  0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Device<br>dst=Device<br>size=1KB<br>ioq=0)

        baseline (4.506 μs)   :  0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Host<br>dst=Host<br>size=1KB<br>ioq=1)

        baseline (3.613 μs)   :  0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
miscellaneous_benchmark_sycl VectorSum
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title miscellaneous_benchmark_sycl VectorSum
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorSum(api=sycl<br>numberOfElementsX=512<br>numberOfElementsY=256<br>numberOfElementsZ=256)

        baseline (863.651 μs)   :  0, 863

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Bitcracker
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Bitcracker
    todayMarker off
    dateFormat  X
    axisFormat %s

    section bitcracker

        baseline (35.8407 s)   :  0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench CudaSift
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench CudaSift
    todayMarker off
    dateFormat  X
    axisFormat %s

    section cudaSift

        baseline (283.294 ms)   :  0, 283

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Easywave
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Easywave
    todayMarker off
    dateFormat  X
    axisFormat %s

    section easywave

        baseline (457.0 ms)   :  0, 457

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench QuickSilver
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench QuickSilver
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QuickSilver

        baseline (115.63 MMS/CTT)   :  0, 115

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Sobel Filter
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Sobel Filter
    todayMarker off
    dateFormat  X
    axisFormat %s

    section sobel_filter

        baseline (934.963 ms)   :  0, 934

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading

Details

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.633455 s
211.882006 million keys/second

Runtime_BlockedTransform_iter_64_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_256_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_128_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_512_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Polybench_2DConvolution

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2DConvolution --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2DConvolution.csv

Output:

Polybench_2mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2mm.csv --size=512

Output:

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10904512313

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10904512313
Job status: success. Test status: success.

Summary

result is better

Benchmark This PR baseline
Velocity-Bench Hashtable 205.335339 178.291413
Runtime_BlockedTransform_iter_128_blocksize_256 0.152 -
Runtime_BlockedTransform_iter_256_blocksize_256 0.085 -
Runtime_BlockedTransform_iter_512_blocksize_256 0.08399999999999999 -
Runtime_BlockedTransform_iter_64_blocksize_256 0.356 -
Polybench_2DConvolution 0.23 -
Polybench_2mm 1.239 -
api_overhead_benchmark_sycl SubmitKernel out of order - 50.631
api_overhead_benchmark_sycl SubmitKernel in order - 49.385
api_overhead_benchmark_ur SubmitKernel out of order - 31.93
api_overhead_benchmark_ur SubmitKernel in order - 28.586
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 423.457
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 253.906
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 9.179
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 1.854
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 4.506
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 3.613
miscellaneous_benchmark_sycl VectorSum - 863.651
Velocity-Bench Bitcracker - 35.8407
Velocity-Bench CudaSift - 283.294
Velocity-Bench Easywave - 457.0
Velocity-Bench QuickSilver - 115.63
Velocity-Bench Sobel Filter - 934.963
Mean performance: 115.2% of baseline (higher is better)

Charts

Velocity-Bench Hashtable
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Hashtable
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Hashtable

        This PR (205.335339 M keys/sec)   : crit, 0, 205

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section hashtable

        baseline (178.291413 M keys/sec)   :  0, 178

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_128_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_128_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_128_blocksize_256

        This PR (0.152 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_256_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_256_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_256_blocksize_256

        This PR (0.085 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_512_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_512_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_512_blocksize_256

        This PR (0.08399999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_64_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_64_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_64_blocksize_256

        This PR (0.356 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2DConvolution
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2DConvolution
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2DConvolution

        This PR (0.23 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2mm

        This PR (1.239 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (50.631 μs)   :  0, 50

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (49.385 μs)   :  0, 49

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (31.93 μs)   :  0, 31

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (28.586 μs)   :  0, 28

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (423.457 μs)   :  0, 423

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Host<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (253.906 μs)   :  0, 253

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueMemcpy(api=sycl<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB)

        baseline (9.179 μs)   :  0, 9

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
    todayMarker off
    dateFormat  X
    axisFormat %s

    section StreamMemory(api=sycl<br>type=Triad<br>size=10KB<br>useEvents=0<br>contents=Zeros<br>memoryPlacement=Device)

        baseline (1.854 μs)   :  0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Device<br>dst=Device<br>size=1KB<br>ioq=0)

        baseline (4.506 μs)   :  0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Host<br>dst=Host<br>size=1KB<br>ioq=1)

        baseline (3.613 μs)   :  0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
miscellaneous_benchmark_sycl VectorSum
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title miscellaneous_benchmark_sycl VectorSum
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorSum(api=sycl<br>numberOfElementsX=512<br>numberOfElementsY=256<br>numberOfElementsZ=256)

        baseline (863.651 μs)   :  0, 863

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Bitcracker
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Bitcracker
    todayMarker off
    dateFormat  X
    axisFormat %s

    section bitcracker

        baseline (35.8407 s)   :  0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench CudaSift
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench CudaSift
    todayMarker off
    dateFormat  X
    axisFormat %s

    section cudaSift

        baseline (283.294 ms)   :  0, 283

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Easywave
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Easywave
    todayMarker off
    dateFormat  X
    axisFormat %s

    section easywave

        baseline (457.0 ms)   :  0, 457

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench QuickSilver
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench QuickSilver
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QuickSilver

        baseline (115.63 MMS/CTT)   :  0, 115

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Sobel Filter
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Sobel Filter
    todayMarker off
    dateFormat  X
    axisFormat %s

    section sobel_filter

        baseline (934.963 ms)   :  0, 934

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading

Details

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.653651 s
205.335339 million keys/second

Runtime_BlockedTransform_iter_128_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_256_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_512_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_64_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Polybench_2DConvolution

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2DConvolution --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2DConvolution.csv

Output:

Polybench_2mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2mm.csv --size=512

Output:

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10907515957

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10907515957
Job status: cancelled. Test status: skipped.

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10940796548

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10940796548
Job status: failure. Test status: failure.

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10940914306

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10940914306
Job status: success. Test status: success.

Summary

result is better

Benchmark This PR baseline
Velocity-Bench Hashtable 205.134803 178.291413
Velocity-Bench Easywave 445 457.0
Velocity-Bench CudaSift 278.061 283.294
Velocity-Bench Bitcracker 35.5238 35.8407
Velocity-Bench Sobel Filter 990.716 934.963
Velocity-Bench QuickSilver 89.64 115.63
Runtime_BlockedTransform_iter_256_blocksize_256 0.085 -
Runtime_BlockedTransform_iter_128_blocksize_256 0.152 -
Runtime_BlockedTransform_iter_64_blocksize_256 0.36200000000000004 -
Runtime_BlockedTransform_iter_512_blocksize_256 0.08399999999999999 -
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 5.8 -
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 5.61 -
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 5.602 -
Runtime_IndependentDAGTaskThroughput_SingleTask 6.5729999999999995 -
Runtime_DAGTaskThroughput_SingleTask 6.674 -
Runtime_DAGTaskThroughput_NDRangeParallelFor 4.976999999999999 -
Runtime_DAGTaskThroughput_HierarchicalParallelFor 5.3709999999999996 -
Runtime_DAGTaskThroughput_BasicParallelFor 6.074000000000001 -
MicroBench_LocalMem_fp32_4096 0.2 -
MicroBench_LocalMem_int32_4096 0.229 -
MicroBench_L2_fp32_4 0.026 -
MicroBench_L2_fp32_16 0.026 -
MicroBench_L2_fp32_1 0.026 -
MicroBench_L2_fp32_8 0.026 -
MicroBench_L2_int32_2 0.027 -
MicroBench_L2_int32_16 0.026 -
MicroBench_L2_int32_4 0.026 -
MicroBench_L2_int32_8 0.026 -
MicroBench_L2_int32_1 0.034 -
MicroBench_L2_fp32_2 0.029 -
Pattern_Reduction_Hierarchical_int32 0.052 -
Pattern_Reduction_NDRange_int64 0.052 -
Pattern_Reduction_NDRange_fp32 0.026 -
Pattern_Reduction_NDRange_int32 0.073 -
Pattern_Reduction_Hierarchical_int64 0.052 -
Pattern_Reduction_Hierarchical_fp32 0.052 -
ScalarProduct_Hierarchical_int64 0.063 -
ScalarProduct_NDRange_fp32 0.04 -
ScalarProduct_Hierarchical_fp32 0.059 -
ScalarProduct_NDRange_int32 0.15100000000000002 -
ScalarProduct_NDRange_int64 0.098 -
ScalarProduct_Hierarchical_int32 0.062 -
Pattern_SegmentedReduction_NDRange_int64 0.018000000000000002 -
Pattern_SegmentedReduction_Hierarchical_int16 0.030000000000000002 -
Pattern_SegmentedReduction_NDRange_int32 0.027 -
Pattern_SegmentedReduction_Hierarchical_int32 0.028 -
Pattern_SegmentedReduction_Hierarchical_int64 0.029 -
Pattern_SegmentedReduction_NDRange_fp32 0.014 -
Pattern_SegmentedReduction_NDRange_int16 0.056 -
Pattern_SegmentedReduction_Hierarchical_fp32 0.030000000000000002 -
USM_Latency_fp32_out_of_order__ 47.133 -
SYCL2020_Accessors_Latency_fp32_out_of_order__ 71.05 -
SYCL2020_Accessors_Latency_fp32_in_order__ 69.305 -
USM_Latency_fp32_in_order__ 33.798 -
USM_Allocation_latency_fp32_device 0.009000000000000001 -
USM_Allocation_latency_fp32_host 0.002 -
USM_Allocation_latency_fp32_shared 0.11699999999999999 -
USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch 15.211 -
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.849 -
USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch 14.097999999999999 -
USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch 15.334999999999999 -
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 3.09 -
USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch 13.643 -
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 3.2169999999999996 -
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.718 -
USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1 0.012 -
USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1 0.019 -
USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1 0.42700000000000005 -
USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1 0.016 -
VectorAddition_fp32 0.032 -
VectorAddition_int64 0.04 -
VectorAddition_int32 0.037 -
Polybench_2DConvolution 0.23 -
Polybench_2mm 1.239 -
Polybench_3mm 1.747 -
MicroBench_Arith_int32_512 0.073 -
MicroBench_Arith_fp32_512 0.032 -
Polybench_Atax 6.904000000000001 -
ReductionAtomic_int32 0.041999999999999996 -
ReductionAtomic_int64 0.041 -
ReductionAtomic_fp64 0.044 -
ReductionAtomic_fp32 0.041 -
Polybench_Bicg 5.123 -
Polybench_Correlation 94.999 -
Polybench_Covariance 94.518 -
Polybench_Gemm 3.973 -
Polybench_Gesummv 7.3229999999999995 -
Polybench_Gramschmidt 285.055 -
Kmeans_fp32 1.7930000000000001 -
LinearRegressionCoeff_fp32 1.3339999999999999 -
LinearRegression_fp32 0.358 -
MatmulChain 11.028 -
MolecularDynamics 0.066 -
Polybench_Mvt 3.641 -
MicroBench_sf_fp32_16 0.026 -
Polybench_Syr2k 6.2989999999999995 -
Polybench_Syrk 3.211 -
api_overhead_benchmark_sycl SubmitKernel out of order - 50.631
api_overhead_benchmark_sycl SubmitKernel in order - 49.385
api_overhead_benchmark_ur SubmitKernel out of order - 31.93
api_overhead_benchmark_ur SubmitKernel in order - 28.586
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 423.457
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 253.906
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 9.179
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 1.854
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 4.506
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 3.613
miscellaneous_benchmark_sycl VectorSum - 863.651

Charts

Velocity-Bench Hashtable
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Hashtable
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Hashtable

        This PR (205.134803 M keys/sec)   : crit, 0, 205

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section hashtable

        baseline (178.291413 M keys/sec)   :  0, 178

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Bitcracker
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Bitcracker
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Bitcracker

        This PR (35.5238 s)   : crit, 0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section bitcracker

        baseline (35.8407 s)   :  0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench CudaSift
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench CudaSift
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>CudaSift

        This PR (278.061 ms)   : crit, 0, 278

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section cudaSift

        baseline (283.294 ms)   :  0, 283

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Easywave
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Easywave
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Easywave

        This PR (445 ms)   : crit, 0, 445

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section easywave

        baseline (457.0 ms)   :  0, 457

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench QuickSilver
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench QuickSilver
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>QuickSilver

        This PR (89.64 MMS/CTT)   : crit, 0, 89

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section QuickSilver

        baseline (115.63 MMS/CTT)   :  0, 115

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Sobel Filter
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Sobel Filter
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Sobel<br>Filter

        This PR (990.716 ms)   : crit, 0, 990

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section sobel_filter

        baseline (934.963 ms)   :  0, 934

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_256_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_256_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_256_blocksize_256

        This PR (0.085 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_128_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_128_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_128_blocksize_256

        This PR (0.152 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_64_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_64_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_64_blocksize_256

        This PR (0.36200000000000004 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_512_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_512_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_512_blocksize_256

        This PR (0.08399999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_BasicParallelFor

        This PR (5.8 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

        This PR (5.61 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

        This PR (5.602 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_SingleTask

        This PR (6.5729999999999995 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_SingleTask

        This PR (6.674 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_NDRangeParallelFor

        This PR (4.976999999999999 ms)   : crit, 0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_HierarchicalParallelFor

        This PR (5.3709999999999996 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_BasicParallelFor

        This PR (6.074000000000001 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_LocalMem_fp32_4096
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_LocalMem_fp32_4096
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_LocalMem_fp32_4096

        This PR (0.2 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_LocalMem_int32_4096
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_LocalMem_int32_4096
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_LocalMem_int32_4096

        This PR (0.229 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_4
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_4
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_4

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_1

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_8
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_8
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_8

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_2
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_2
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_2

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_4
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_4
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_4

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_8
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_8
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_8

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_1

        This PR (0.034 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_2
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_2
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_2

        This PR (0.029 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_int32

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_int64

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_fp32

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_int32

        This PR (0.073 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_int64

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_fp32

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_int64

        This PR (0.063 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_fp32

        This PR (0.04 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_fp32

        This PR (0.059 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_int32

        This PR (0.15100000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_int64

        This PR (0.098 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_int32

        This PR (0.062 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int64

        This PR (0.018000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int16

        This PR (0.030000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int32

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int32

        This PR (0.028 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int64

        This PR (0.029 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_fp32

        This PR (0.014 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int16

        This PR (0.056 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_fp32

        This PR (0.030000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Latency_fp32_out_of_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Latency_fp32_out_of_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Latency_fp32_out_of_order__

        This PR (47.133 ms)   : crit, 0, 47

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
SYCL2020_Accessors_Latency_fp32_out_of_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title SYCL2020_Accessors_Latency_fp32_out_of_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SYCL2020_Accessors_Latency_fp32_out_of_order__

        This PR (71.05 ms)   : crit, 0, 71

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
SYCL2020_Accessors_Latency_fp32_in_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title SYCL2020_Accessors_Latency_fp32_in_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SYCL2020_Accessors_Latency_fp32_in_order__

        This PR (69.305 ms)   : crit, 0, 69

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Latency_fp32_in_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Latency_fp32_in_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Latency_fp32_in_order__

        This PR (33.798 ms)   : crit, 0, 33

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_device
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_device
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_device

        This PR (0.009000000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_host
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_host
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_host

        This PR (0.002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_shared
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_shared
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_shared

        This PR (0.11699999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch

        This PR (15.211 ms)   : crit, 0, 15

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

        This PR (1.849 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch

        This PR (14.097999999999999 ms)   : crit, 0, 14

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch

        This PR (15.334999999999999 ms)   : crit, 0, 15

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

        This PR (3.09 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch

        This PR (13.643 ms)   : crit, 0, 13

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

        This PR (3.2169999999999996 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

        This PR (1.718 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1

        This PR (0.012 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1

        This PR (0.019 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1

        This PR (0.42700000000000005 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1

        This PR (0.016 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_fp32

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_int64

        This PR (0.04 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_int32

        This PR (0.037 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2DConvolution
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2DConvolution
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2DConvolution

        This PR (0.23 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2mm

        This PR (1.239 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_3mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_3mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_3mm

        This PR (1.747 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_int32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_int32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_int32_512

        This PR (0.073 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_fp32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_fp32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_fp32_512

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Atax
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Atax
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Atax

        This PR (6.904000000000001 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_int32

        This PR (0.041999999999999996 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_int64

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_fp64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_fp64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_fp64

        This PR (0.044 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_fp32

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Bicg
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Bicg
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Bicg

        This PR (5.123 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Correlation
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Correlation
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Correlation

        This PR (94.999 ms)   : crit, 0, 94

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Covariance
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Covariance
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Covariance

        This PR (94.518 ms)   : crit, 0, 94

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gemm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gemm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gemm

        This PR (3.973 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gesummv
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gesummv
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gesummv

        This PR (7.3229999999999995 ms)   : crit, 0, 7

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gramschmidt
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gramschmidt
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gramschmidt

        This PR (285.055 ms)   : crit, 0, 285

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Kmeans_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Kmeans_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Kmeans_fp32

        This PR (1.7930000000000001 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
LinearRegressionCoeff_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title LinearRegressionCoeff_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section LinearRegressionCoeff_fp32

        This PR (1.3339999999999999 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
LinearRegression_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title LinearRegression_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section LinearRegression_fp32

        This PR (0.358 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MatmulChain
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MatmulChain
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MatmulChain

        This PR (11.028 ms)   : crit, 0, 11

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MolecularDynamics
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MolecularDynamics
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MolecularDynamics

        This PR (0.066 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Mvt
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Mvt
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Mvt

        This PR (3.641 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_sf_fp32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_sf_fp32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_sf_fp32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syr2k
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syr2k
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syr2k

        This PR (6.2989999999999995 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syrk
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syrk
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syrk

        This PR (3.211 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (50.631 μs)   :  0, 50

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (49.385 μs)   :  0, 49

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (31.93 μs)   :  0, 31

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (28.586 μs)   :  0, 28

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (423.457 μs)   :  0, 423

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Host<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (253.906 μs)   :  0, 253

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueMemcpy(api=sycl<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB)

        baseline (9.179 μs)   :  0, 9

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
    todayMarker off
    dateFormat  X
    axisFormat %s

    section StreamMemory(api=sycl<br>type=Triad<br>size=10KB<br>useEvents=0<br>contents=Zeros<br>memoryPlacement=Device)

        baseline (1.854 μs)   :  0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Device<br>dst=Device<br>size=1KB<br>ioq=0)

        baseline (4.506 μs)   :  0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Host<br>dst=Host<br>size=1KB<br>ioq=1)

        baseline (3.613 μs)   :  0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
miscellaneous_benchmark_sycl VectorSum
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title miscellaneous_benchmark_sycl VectorSum
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorSum(api=sycl<br>numberOfElementsX=512<br>numberOfElementsY=256<br>numberOfElementsZ=256)

        baseline (863.651 μs)   :  0, 863

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading

Details

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.65429 s
205.134803 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/test-user/bench_workdir/bitcracker/bitcracker -f /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.0100313 s
bitcracker - total time for whole calculation: 35.5238 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/test-user/bench_workdir/cudaSift/cudaSift

Output:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1267 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1274 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1122 1277 30.4643% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1265 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1258 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1266 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1265 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1087 1265 29.514% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1245 1278 33.804% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1260 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1265 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1257 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1240 1275 33.6682% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1211 1259 32.8808% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1265 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1087 1265 29.514% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1267 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1258 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1262 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1266 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1261 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1256 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1093 1279 29.6769% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1268 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1103 1269 29.9484% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1161 1278 31.5232% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1256 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1266 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1241 1275 33.6954% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1179 1264 32.0119% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1265 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1263 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1271 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1135 1275 30.8173% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1155 1261 31.3603% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1272 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1263 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1258 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1175 1252 31.9033% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1245 1279 33.804% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1104 1264 29.9756% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1260 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1267 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1115 1264 30.2742% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1266 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1129 1279 30.6544% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1139 1267 30.9259% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1217 1253 33.0437% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1261 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1256 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 278.061 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/test-user/bench_workdir/easywave/easyWave_sycl -grid /home/test-user/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/test-user/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.29735+27)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/test-user/bench_workdir/QuickSilver/qs -i /home/test-user/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/test-user/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 6.333450e-01 8.452690e-01 1.000000e-06
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 5.860130e-01 9.932250e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 5.634620e-01 9.985810e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 5.991400e-01 1.108448e+00 1.000000e-06
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 5.303310e-01 1.042307e+00 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 5.245170e-01 9.982700e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 5.328590e-01 1.001232e+00 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 5.242420e-01 1.036436e+00 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 5.213920e-01 1.035630e+00 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 5.240450e-01 9.907130e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.559e+07 1.559e+07 1.559e+07 0.000e+00 100.00
cycleInit 10 5.539e+06 5.539e+06 5.539e+06 0.000e+00 100.00
cycleTracking 10 1.005e+07 1.005e+07 1.005e+07 0.000e+00 100.00
cycleTracking_Kernel 104 4.951e+06 4.951e+06 4.951e+06 0.000e+00 100.00
cycleTracking_MPI 117 2.931e+05 2.931e+05 2.931e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 8.260e+02 8.260e+02 8.260e+02 0.000e+00 100.00
Figure Of Merit 89.64 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/test-user/bench_workdir/sobel_filter/sobel_filter -i /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 14.086 s
sobelfilter - total time for whole calculation: 0.990716 s

Runtime_BlockedTransform_iter_256_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_128_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_64_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_512_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LocalMem_multi.csv --size=512

Output:

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LocalMem_multi.csv --size=512

Output:

MicroBench_L2_fp32_4

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_8

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_2

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_4

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_8

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_2

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

USM_Latency_fp32_out_of_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

SYCL2020_Accessors_Latency_fp32_out_of_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

SYCL2020_Accessors_Latency_fp32_in_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Latency_fp32_in_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

VectorAddition_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

VectorAddition_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

VectorAddition_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

Polybench_2DConvolution

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2DConvolution --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2DConvolution.csv

Output:

Polybench_2mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2mm.csv --size=512

Output:

Polybench_3mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/3mm.csv --size=512

Output:

MicroBench_Arith_int32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

MicroBench_Arith_fp32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

Polybench_Atax

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Atax.csv --size=8192

Output:

ReductionAtomic_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_fp64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

Polybench_Bicg

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/bicg --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Bicg.csv --size=8192

Output:

Polybench_Correlation

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/correlation --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Correlation.csv --size=512

Output:

Polybench_Covariance

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/covariance --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Covariance.csv --size=512

Output:

Polybench_Gemm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gemm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gemm.csv --size=1024

Output:

Polybench_Gesummv

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gesummv --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gesummv.csv --size=8192

Output:

Polybench_Gramschmidt

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gramschmidt --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gramschmidt.csv --size=512

Output:

Kmeans_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Kmeans.csv --size=67108864

Output:

LinearRegressionCoeff_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/lin_reg_coeff --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LinearRegressionCoeff.csv

Output:

LinearRegression_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/lin_reg_error --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LinearRegression.csv

Output:

MatmulChain

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/matmulchain --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/MatmulChain.csv --size=1024

Output:

MolecularDynamics

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/MolecularDynamics.csv

Output:

Polybench_Mvt

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/mvt --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Mvt.csv --size=16384

Output:

MicroBench_sf_fp32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/sf --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/sf_16.csv --size=--size=100000000

Output:

Polybench_Syr2k

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syr2k --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syr2k.csv --size=1024

Output:

Polybench_Syrk

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syrk --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syrk.csv --size=1024

Output:

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10956727303

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10956727303
Job status: failure. Test status: skipped.

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10956727303

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10956727303
Job status: success. Test status: success.

Summary

result is better

Benchmark This PR baseline Relative perf Change -
Velocity-Bench Easywave 364 457.0 diff:125.55% Perf change: 25.5% 0
Velocity-Bench Hashtable 212.463702 178.291413 diff:119.17% Perf change: 19.2% 0
Velocity-Bench CudaSift 280.223 283.294 diff:101.10% Perf change: 1.1%
Velocity-Bench Bitcracker 35.6787 35.8407 diff:100.45% Perf change: 0.5%
Velocity-Bench Sobel Filter 963.407 934.963 diff:97.05% Perf change: -3.0% -
Velocity-Bench QuickSilver 91.56 115.63 diff:79.18% Perf change: -20.8% --------
Runtime_BlockedTransform_iter_128_blocksize_256 0.152 -
Runtime_BlockedTransform_iter_64_blocksize_256 0.36200000000000004 -
Runtime_BlockedTransform_iter_256_blocksize_256 0.08399999999999999 -
Runtime_BlockedTransform_iter_512_blocksize_256 0.083 -
Runtime_IndependentDAGTaskThroughput_SingleTask 6.563000000000001 -
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 5.792000000000001 -
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 5.596 -
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 5.588 -
Runtime_DAGTaskThroughput_BasicParallelFor 6.054 -
Runtime_DAGTaskThroughput_SingleTask 6.548 -
Runtime_DAGTaskThroughput_HierarchicalParallelFor 5.37 -
Runtime_DAGTaskThroughput_NDRangeParallelFor 4.957 -
MicroBench_LocalMem_fp32_4096 0.2 -
MicroBench_LocalMem_int32_4096 0.229 -
MicroBench_L2_int32_4 0.026 -
MicroBench_L2_fp32_4 0.026 -
MicroBench_L2_int32_8 0.026 -
MicroBench_L2_int32_16 0.026 -
MicroBench_L2_int32_2 0.027 -
MicroBench_L2_fp32_1 0.026 -
MicroBench_L2_fp32_8 0.026 -
MicroBench_L2_fp32_16 0.026 -
MicroBench_L2_int32_1 0.034 -
MicroBench_L2_fp32_2 0.029 -
Pattern_Reduction_Hierarchical_int32 0.052 -
Pattern_Reduction_Hierarchical_fp32 0.052 -
Pattern_Reduction_NDRange_int32 0.074 -
Pattern_Reduction_NDRange_int64 0.053 -
Pattern_Reduction_Hierarchical_int64 0.051 -
Pattern_Reduction_NDRange_fp32 0.026 -
ScalarProduct_Hierarchical_int32 0.062 -
ScalarProduct_NDRange_fp32 0.04 -
ScalarProduct_NDRange_int32 0.15100000000000002 -
ScalarProduct_NDRange_int64 0.098 -
ScalarProduct_Hierarchical_int64 0.063 -
ScalarProduct_Hierarchical_fp32 0.059 -
Pattern_SegmentedReduction_Hierarchical_int16 0.030000000000000002 -
Pattern_SegmentedReduction_Hierarchical_int32 0.028 -
Pattern_SegmentedReduction_Hierarchical_int64 0.029 -
Pattern_SegmentedReduction_Hierarchical_fp32 0.030000000000000002 -
Pattern_SegmentedReduction_NDRange_int64 0.018000000000000002 -
Pattern_SegmentedReduction_NDRange_int16 0.056 -
Pattern_SegmentedReduction_NDRange_int32 0.027 -
Pattern_SegmentedReduction_NDRange_fp32 0.014 -
USM_Latency_fp32_out_of_order__ 46.964 -
SYCL2020_Accessors_Latency_fp32_out_of_order__ 71.006 -
USM_Latency_fp32_in_order__ 33.724 -
SYCL2020_Accessors_Latency_fp32_in_order__ 69.3 -
USM_Allocation_latency_fp32_host 0.002 -
USM_Allocation_latency_fp32_shared 0.11699999999999999 -
USM_Allocation_latency_fp32_device 0.009000000000000001 -
USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch 15.334999999999999 -
USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch 15.211 -
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 3.09 -
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 3.2169999999999996 -
USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch 14.097999999999999 -
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.849 -
USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch 13.639999999999999 -
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.718 -
USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1 0.011 -
USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1 0.019 -
USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1 0.015000000000000001 -
USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1 0.42700000000000005 -
VectorAddition_fp32 0.032 -
VectorAddition_int32 0.037 -
VectorAddition_int64 0.04 -
Polybench_2DConvolution 0.23 -
Polybench_2mm 1.239 -
Polybench_3mm 1.747 -
MicroBench_Arith_int32_512 0.073 -
MicroBench_Arith_fp32_512 0.032 -
Polybench_Atax 6.902 -
ReductionAtomic_int32 0.041 -
ReductionAtomic_fp32 0.041 -
ReductionAtomic_fp64 0.043000000000000003 -
ReductionAtomic_int64 0.041 -
Polybench_Bicg 5.122 -
Polybench_Correlation 94.61 -
Polybench_Covariance 94.47 -
Polybench_Gemm 3.965 -
Polybench_Gesummv 7.316999999999999 -
Polybench_Gramschmidt 285.055 -
Kmeans_fp32 1.7930000000000001 -
LinearRegressionCoeff_fp32 1.3339999999999999 -
LinearRegression_fp32 0.357 -
MatmulChain 11.028 -
MolecularDynamics 0.066 -
Polybench_Mvt 3.629 -
MicroBench_sf_fp32_16 0.025 -
Polybench_Syr2k 6.292 -
Polybench_Syrk 3.206 -
api_overhead_benchmark_sycl SubmitKernel out of order - 50.631
api_overhead_benchmark_sycl SubmitKernel in order - 49.385
api_overhead_benchmark_ur SubmitKernel out of order - 31.93
api_overhead_benchmark_ur SubmitKernel in order - 28.586
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 423.457
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 253.906
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 9.179
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 1.854
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 4.506
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 3.613
miscellaneous_benchmark_sycl VectorSum - 863.651

Charts

Velocity-Bench Hashtable
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Hashtable
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Hashtable

        This PR (212.463702 M keys/sec)   : crit, 0, 212

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section hashtable

        baseline (178.291413 M keys/sec)   :  0, 178

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Bitcracker
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Bitcracker
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Bitcracker

        This PR (35.6787 s)   : crit, 0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section bitcracker

        baseline (35.8407 s)   :  0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench CudaSift
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench CudaSift
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>CudaSift

        This PR (280.223 ms)   : crit, 0, 280

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section cudaSift

        baseline (283.294 ms)   :  0, 283

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Easywave
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Easywave
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Easywave

        This PR (364 ms)   : crit, 0, 364

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section easywave

        baseline (457.0 ms)   :  0, 457

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench QuickSilver
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench QuickSilver
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>QuickSilver

        This PR (91.56 MMS/CTT)   : crit, 0, 91

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section QuickSilver

        baseline (115.63 MMS/CTT)   :  0, 115

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Sobel Filter
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Sobel Filter
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Sobel<br>Filter

        This PR (963.407 ms)   : crit, 0, 963

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section sobel_filter

        baseline (934.963 ms)   :  0, 934

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_128_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_128_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_128_blocksize_256

        This PR (0.152 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_64_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_64_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_64_blocksize_256

        This PR (0.36200000000000004 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_256_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_256_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_256_blocksize_256

        This PR (0.08399999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_512_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_512_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_512_blocksize_256

        This PR (0.083 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_SingleTask

        This PR (6.563000000000001 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_BasicParallelFor

        This PR (5.792000000000001 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

        This PR (5.596 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

        This PR (5.588 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_BasicParallelFor

        This PR (6.054 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_SingleTask

        This PR (6.548 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_HierarchicalParallelFor

        This PR (5.37 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_NDRangeParallelFor

        This PR (4.957 ms)   : crit, 0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_LocalMem_fp32_4096
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_LocalMem_fp32_4096
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_LocalMem_fp32_4096

        This PR (0.2 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_LocalMem_int32_4096
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_LocalMem_int32_4096
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_LocalMem_int32_4096

        This PR (0.229 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_4
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_4
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_4

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_4
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_4
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_4

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_8
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_8
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_8

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_2
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_2
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_2

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_1

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_8
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_8
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_8

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_1

        This PR (0.034 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_2
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_2
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_2

        This PR (0.029 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_int32

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_fp32

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_int32

        This PR (0.074 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_int64

        This PR (0.053 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_int64

        This PR (0.051 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_fp32

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_int32

        This PR (0.062 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_fp32

        This PR (0.04 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_int32

        This PR (0.15100000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_int64

        This PR (0.098 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_int64

        This PR (0.063 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_fp32

        This PR (0.059 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int16

        This PR (0.030000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int32

        This PR (0.028 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int64

        This PR (0.029 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_fp32

        This PR (0.030000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int64

        This PR (0.018000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int16

        This PR (0.056 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int32

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_fp32

        This PR (0.014 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Latency_fp32_out_of_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Latency_fp32_out_of_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Latency_fp32_out_of_order__

        This PR (46.964 ms)   : crit, 0, 46

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
SYCL2020_Accessors_Latency_fp32_out_of_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title SYCL2020_Accessors_Latency_fp32_out_of_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SYCL2020_Accessors_Latency_fp32_out_of_order__

        This PR (71.006 ms)   : crit, 0, 71

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Latency_fp32_in_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Latency_fp32_in_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Latency_fp32_in_order__

        This PR (33.724 ms)   : crit, 0, 33

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
SYCL2020_Accessors_Latency_fp32_in_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title SYCL2020_Accessors_Latency_fp32_in_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SYCL2020_Accessors_Latency_fp32_in_order__

        This PR (69.3 ms)   : crit, 0, 69

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_host
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_host
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_host

        This PR (0.002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_shared
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_shared
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_shared

        This PR (0.11699999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_device
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_device
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_device

        This PR (0.009000000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch

        This PR (15.334999999999999 ms)   : crit, 0, 15

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch

        This PR (15.211 ms)   : crit, 0, 15

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

        This PR (3.09 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

        This PR (3.2169999999999996 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch

        This PR (14.097999999999999 ms)   : crit, 0, 14

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

        This PR (1.849 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch

        This PR (13.639999999999999 ms)   : crit, 0, 13

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

        This PR (1.718 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1

        This PR (0.011 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1

        This PR (0.019 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1

        This PR (0.015000000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1

        This PR (0.42700000000000005 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_fp32

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_int32

        This PR (0.037 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_int64

        This PR (0.04 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2DConvolution
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2DConvolution
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2DConvolution

        This PR (0.23 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2mm

        This PR (1.239 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_3mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_3mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_3mm

        This PR (1.747 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_int32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_int32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_int32_512

        This PR (0.073 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_fp32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_fp32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_fp32_512

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Atax
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Atax
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Atax

        This PR (6.902 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_int32

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_fp32

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_fp64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_fp64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_fp64

        This PR (0.043000000000000003 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_int64

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Bicg
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Bicg
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Bicg

        This PR (5.122 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Correlation
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Correlation
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Correlation

        This PR (94.61 ms)   : crit, 0, 94

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Covariance
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Covariance
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Covariance

        This PR (94.47 ms)   : crit, 0, 94

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gemm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gemm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gemm

        This PR (3.965 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gesummv
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gesummv
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gesummv

        This PR (7.316999999999999 ms)   : crit, 0, 7

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gramschmidt
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gramschmidt
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gramschmidt

        This PR (285.055 ms)   : crit, 0, 285

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Kmeans_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Kmeans_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Kmeans_fp32

        This PR (1.7930000000000001 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
LinearRegressionCoeff_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title LinearRegressionCoeff_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section LinearRegressionCoeff_fp32

        This PR (1.3339999999999999 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
LinearRegression_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title LinearRegression_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section LinearRegression_fp32

        This PR (0.357 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MatmulChain
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MatmulChain
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MatmulChain

        This PR (11.028 ms)   : crit, 0, 11

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MolecularDynamics
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MolecularDynamics
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MolecularDynamics

        This PR (0.066 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Mvt
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Mvt
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Mvt

        This PR (3.629 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_sf_fp32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_sf_fp32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_sf_fp32_16

        This PR (0.025 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syr2k
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syr2k
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syr2k

        This PR (6.292 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syrk
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syrk
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syrk

        This PR (3.206 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (50.631 μs)   :  0, 50

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (49.385 μs)   :  0, 49

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (31.93 μs)   :  0, 31

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (28.586 μs)   :  0, 28

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (423.457 μs)   :  0, 423

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Host<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (253.906 μs)   :  0, 253

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueMemcpy(api=sycl<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB)

        baseline (9.179 μs)   :  0, 9

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
    todayMarker off
    dateFormat  X
    axisFormat %s

    section StreamMemory(api=sycl<br>type=Triad<br>size=10KB<br>useEvents=0<br>contents=Zeros<br>memoryPlacement=Device)

        baseline (1.854 μs)   :  0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Device<br>dst=Device<br>size=1KB<br>ioq=0)

        baseline (4.506 μs)   :  0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Host<br>dst=Host<br>size=1KB<br>ioq=1)

        baseline (3.613 μs)   :  0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
miscellaneous_benchmark_sycl VectorSum
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title miscellaneous_benchmark_sycl VectorSum
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorSum(api=sycl<br>numberOfElementsX=512<br>numberOfElementsY=256<br>numberOfElementsZ=256)

        baseline (863.651 μs)   :  0, 863

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading

Details

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.631721 s
212.463702 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/test-user/bench_workdir/bitcracker/bitcracker -f /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.0158677 s
bitcracker - total time for whole calculation: 35.6787 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/test-user/bench_workdir/cudaSift/cudaSift

Output:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1217 1256 33.0437% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1117 1267 30.3285% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1272 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1257 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1266 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1214 1248 32.9623% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1141 1255 30.9802% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1240 1276 33.6682% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1263 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1257 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1244 1275 33.7768% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1106 1255 30.0299% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1261 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1267 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1264 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1262 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1267 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1214 1248 32.9623% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1070 1264 29.0524% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1049 1269 28.4822% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1264 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1272 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1263 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1266 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1267 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1259 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1261 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1270 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1102 1263 29.9213% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1255 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1217 1249 33.0437% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1241 1273 33.6954% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1091 1258 29.6226% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1267 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1184 1264 32.1477% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1257 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1039 1263 28.2107% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1112 1269 30.1928% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1103 1265 29.9484% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1263 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1098 1262 29.8127% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1091 1269 29.6226% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1138 1255 30.8987% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1277 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1067 1257 28.9709% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1262 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1254 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1059 1255 28.7537% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1262 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1270 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 280.223 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/test-user/bench_workdir/easywave/easyWave_sycl -grid /home/test-user/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/test-user/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.29735+27)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/test-user/bench_workdir/QuickSilver/qs -i /home/test-user/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/test-user/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 4.351420e-01 8.258620e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.614050e-01 9.668870e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.586800e-01 9.833950e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 4.048920e-01 1.061983e+00 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.399210e-01 1.028483e+00 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.433380e-01 9.788310e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.306580e-01 9.791370e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.303060e-01 1.023620e+00 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.280730e-01 1.018522e+00 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.295950e-01 9.725800e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.340e+07 1.340e+07 1.340e+07 0.000e+00 100.00
cycleInit 10 3.562e+06 3.562e+06 3.562e+06 0.000e+00 100.00
cycleTracking 10 9.839e+06 9.839e+06 9.839e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.938e+06 4.938e+06 4.938e+06 0.000e+00 100.00
cycleTracking_MPI 117 2.091e+05 2.091e+05 2.091e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 4.100e+02 4.100e+02 4.100e+02 0.000e+00 100.00
Figure Of Merit 91.56 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/test-user/bench_workdir/sobel_filter/sobel_filter -i /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.65489 s
sobelfilter - total time for whole calculation: 0.963407 s

Runtime_BlockedTransform_iter_128_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_64_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_256_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_512_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LocalMem_multi.csv --size=512

Output:

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LocalMem_multi.csv --size=512

Output:

MicroBench_L2_int32_4

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_4

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_8

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_2

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_8

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_2

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

USM_Latency_fp32_out_of_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

SYCL2020_Accessors_Latency_fp32_out_of_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Latency_fp32_in_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

SYCL2020_Accessors_Latency_fp32_in_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

VectorAddition_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

VectorAddition_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

VectorAddition_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

Polybench_2DConvolution

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2DConvolution --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2DConvolution.csv

Output:

Polybench_2mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2mm.csv --size=512

Output:

Polybench_3mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/3mm.csv --size=512

Output:

MicroBench_Arith_int32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

MicroBench_Arith_fp32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

Polybench_Atax

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Atax.csv --size=8192

Output:

ReductionAtomic_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_fp64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

Polybench_Bicg

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/bicg --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Bicg.csv --size=8192

Output:

Polybench_Correlation

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/correlation --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Correlation.csv --size=512

Output:

Polybench_Covariance

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/covariance --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Covariance.csv --size=512

Output:

Polybench_Gemm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gemm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gemm.csv --size=1024

Output:

Polybench_Gesummv

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gesummv --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gesummv.csv --size=8192

Output:

Polybench_Gramschmidt

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gramschmidt --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gramschmidt.csv --size=512

Output:

Kmeans_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Kmeans.csv --size=67108864

Output:

LinearRegressionCoeff_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/lin_reg_coeff --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LinearRegressionCoeff.csv

Output:

LinearRegression_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/lin_reg_error --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LinearRegression.csv

Output:

MatmulChain

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/matmulchain --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/MatmulChain.csv --size=1024

Output:

MolecularDynamics

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/MolecularDynamics.csv

Output:

Polybench_Mvt

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/mvt --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Mvt.csv --size=16384

Output:

MicroBench_sf_fp32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/sf --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/sf_16.csv --size=--size=100000000

Output:

Polybench_Syr2k

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syr2k --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syr2k.csv --size=1024

Output:

Polybench_Syrk

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syrk --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syrk.csv --size=1024

Output:

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10959153191

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10959153191
Job status: success. Test status: success.

Summary

result is better

Performance change in benchmark groups

"Perf change in group Velocity: 1.0471070215994367"
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Hashtable 240.254858 178.291413 134.8% 34.75% ++++++++++
Velocity-Bench Sobel Filter 955.85 934.963 97.8% -2.19% -
Velocity-Bench Bitcracker - 35.8407
Velocity-Bench CudaSift - 283.294
Velocity-Bench Easywave - 457.0
Velocity-Bench QuickSilver - 115.63
"Perf change in group Runtime: 1.0"
Benchmark This PR baseline Relative perf Change -
Runtime_BlockedTransform_iter_64_blocksize_256 0.356 -
Runtime_BlockedTransform_iter_256_blocksize_256 0.08399999999999999 -
Runtime_BlockedTransform_iter_512_blocksize_256 0.081 -
Runtime_BlockedTransform_iter_128_blocksize_256 0.152 -
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 5.792000000000001 -
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 5.593 -
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 5.588 -
Runtime_IndependentDAGTaskThroughput_SingleTask 6.563000000000001 -
Runtime_DAGTaskThroughput_SingleTask 6.548 -
Runtime_DAGTaskThroughput_NDRangeParallelFor 4.957 -
Runtime_DAGTaskThroughput_BasicParallelFor 6.054 -
Runtime_DAGTaskThroughput_HierarchicalParallelFor 5.37 -
"Perf change in group Polybench: 1.0"
Benchmark This PR baseline Relative perf Change -
Polybench_2DConvolution 0.23 -
Polybench_2mm 1.239 -
Polybench_3mm 1.7429999999999999 -
Polybench_Syr2k 6.2989999999999995 -
Polybench_Syrk 3.201 -
"Perf change in group MicroBench: 1.0"
Benchmark This PR baseline Relative perf Change -
MicroBench_Arith_int32_512 0.073 -
MicroBench_Arith_fp32_512 0.032 -
"Perf change in group api: 1.0"
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_sycl SubmitKernel out of order - 50.631
api_overhead_benchmark_sycl SubmitKernel in order - 49.385
api_overhead_benchmark_ur SubmitKernel out of order - 31.93
api_overhead_benchmark_ur SubmitKernel in order - 28.586
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 4.506
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 3.613
"Perf change in group memory: 1.0"
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 423.457
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 253.906
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 9.179
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 1.854
"Perf change in group miscellaneous: 1.0"
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum - 863.651

Charts

Velocity-Bench Hashtable
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Hashtable
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Hashtable

        This PR (240.254858 M keys/sec)   : crit, 0, 240

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section hashtable

        baseline (178.291413 M keys/sec)   :  0, 178

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Sobel Filter
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Sobel Filter
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Sobel<br>Filter

        This PR (955.85 ms)   : crit, 0, 955

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section sobel_filter

        baseline (934.963 ms)   :  0, 934

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_64_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_64_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_64_blocksize_256

        This PR (0.356 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_256_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_256_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_256_blocksize_256

        This PR (0.08399999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_512_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_512_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_512_blocksize_256

        This PR (0.081 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_128_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_128_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_128_blocksize_256

        This PR (0.152 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_BasicParallelFor

        This PR (5.792000000000001 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

        This PR (5.593 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

        This PR (5.588 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_SingleTask

        This PR (6.563000000000001 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_SingleTask

        This PR (6.548 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_NDRangeParallelFor

        This PR (4.957 ms)   : crit, 0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_BasicParallelFor

        This PR (6.054 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_HierarchicalParallelFor

        This PR (5.37 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2DConvolution
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2DConvolution
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2DConvolution

        This PR (0.23 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2mm

        This PR (1.239 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_3mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_3mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_3mm

        This PR (1.7429999999999999 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_int32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_int32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_int32_512

        This PR (0.073 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_fp32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_fp32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_fp32_512

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syr2k
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syr2k
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syr2k

        This PR (6.2989999999999995 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syrk
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syrk
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syrk

        This PR (3.201 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (50.631 μs)   :  0, 50

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (49.385 μs)   :  0, 49

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (31.93 μs)   :  0, 31

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (28.586 μs)   :  0, 28

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (423.457 μs)   :  0, 423

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Host<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (253.906 μs)   :  0, 253

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueMemcpy(api=sycl<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB)

        baseline (9.179 μs)   :  0, 9

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
    todayMarker off
    dateFormat  X
    axisFormat %s

    section StreamMemory(api=sycl<br>type=Triad<br>size=10KB<br>useEvents=0<br>contents=Zeros<br>memoryPlacement=Device)

        baseline (1.854 μs)   :  0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Device<br>dst=Device<br>size=1KB<br>ioq=0)

        baseline (4.506 μs)   :  0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Host<br>dst=Host<br>size=1KB<br>ioq=1)

        baseline (3.613 μs)   :  0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
miscellaneous_benchmark_sycl VectorSum
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title miscellaneous_benchmark_sycl VectorSum
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorSum(api=sycl<br>numberOfElementsX=512<br>numberOfElementsY=256<br>numberOfElementsZ=256)

        baseline (863.651 μs)   :  0, 863

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Bitcracker
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Bitcracker
    todayMarker off
    dateFormat  X
    axisFormat %s

    section bitcracker

        baseline (35.8407 s)   :  0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench CudaSift
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench CudaSift
    todayMarker off
    dateFormat  X
    axisFormat %s

    section cudaSift

        baseline (283.294 ms)   :  0, 283

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Easywave
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Easywave
    todayMarker off
    dateFormat  X
    axisFormat %s

    section easywave

        baseline (457.0 ms)   :  0, 457

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench QuickSilver
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench QuickSilver
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QuickSilver

        baseline (115.63 MMS/CTT)   :  0, 115

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading

Details

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.558647 s
240.254858 million keys/second

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/test-user/bench_workdir/sobel_filter/sobel_filter -i /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.56778 s
sobelfilter - total time for whole calculation: 0.95585 s

Runtime_BlockedTransform_iter_64_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_256_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_512_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_128_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Polybench_2DConvolution

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2DConvolution --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2DConvolution.csv

Output:

Polybench_2mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2mm.csv --size=512

Output:

Polybench_3mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/3mm.csv --size=512

Output:

MicroBench_Arith_int32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

MicroBench_Arith_fp32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

Polybench_Syr2k

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syr2k --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syr2k.csv --size=1024

Output:

Polybench_Syrk

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syrk --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syrk.csv --size=1024

Output:

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10959615616

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10959615616
Job status: success. Test status: success.

Summary

result is better

Performance change in benchmark groups

"Relative perf in group Velocity-Bench: 1.0444796768814102"
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Hashtable 206.158023 178.291413 115.6% 15.63% +++++
Velocity-Bench Bitcracker 35.6408 35.8407 100.6% 0.56% 0
Velocity-Bench CudaSift 240.071 283.294 118.0% 18.00% ++++++
Velocity-Bench Easywave 354 457.0 129.1% 29.10% ++++++++++
Velocity-Bench QuickSilver 89.64 115.63 77.5% -22.48% --------
Velocity-Bench Sobel Filter 988.861 934.963 94.5% -5.45% --
"Relative perf in group Runtime: 1.0"
Benchmark This PR baseline Relative perf Change -
Runtime_BlockedTransform_iter_256_blocksize_256 0.08399999999999999 -
Runtime_BlockedTransform_iter_64_blocksize_256 0.356 -
Runtime_BlockedTransform_iter_128_blocksize_256 0.153 -
Runtime_BlockedTransform_iter_512_blocksize_256 0.081 -
Runtime_IndependentDAGTaskThroughput_SingleTask 6.465 -
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 5.587 -
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 5.5840000000000005 -
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 5.792000000000001 -
Runtime_DAGTaskThroughput_SingleTask 6.548 -
Runtime_DAGTaskThroughput_HierarchicalParallelFor 5.329000000000001 -
Runtime_DAGTaskThroughput_BasicParallelFor 6.054 -
Runtime_DAGTaskThroughput_NDRangeParallelFor 4.927 -
"Relative perf in group MicroBench: 1.0"
Benchmark This PR baseline Relative perf Change -
MicroBench_LocalMem_int32_4096 0.229 -
MicroBench_LocalMem_fp32_4096 0.201 -
MicroBench_L2_fp32_4 0.026 -
MicroBench_L2_int32_8 0.027 -
MicroBench_L2_int32_4 0.027 -
MicroBench_L2_fp32_16 0.026 -
MicroBench_L2_int32_16 0.026 -
MicroBench_L2_fp32_1 0.026 -
MicroBench_L2_fp32_2 0.029 -
MicroBench_L2_int32_2 0.027 -
MicroBench_L2_int32_1 0.034 -
MicroBench_L2_fp32_8 0.026 -
MicroBench_Arith_int32_512 0.073 -
MicroBench_Arith_fp32_512 0.032 -
MicroBench_sf_fp32_16 0.025 -
"Relative perf in group Pattern: 1.0"
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_Hierarchical_int32 0.052 -
Pattern_Reduction_Hierarchical_fp32 0.052 -
Pattern_Reduction_NDRange_int32 0.074 -
Pattern_Reduction_NDRange_int64 0.053 -
Pattern_Reduction_Hierarchical_int64 0.051 -
Pattern_Reduction_NDRange_fp32 0.025 -
Pattern_SegmentedReduction_Hierarchical_int16 0.030000000000000002 -
Pattern_SegmentedReduction_NDRange_int16 0.049 -
Pattern_SegmentedReduction_NDRange_int32 0.027 -
Pattern_SegmentedReduction_NDRange_int64 0.018000000000000002 -
Pattern_SegmentedReduction_Hierarchical_int64 0.029 -
Pattern_SegmentedReduction_Hierarchical_int32 0.028 -
Pattern_SegmentedReduction_NDRange_fp32 0.014 -
Pattern_SegmentedReduction_Hierarchical_fp32 0.030000000000000002 -
"Relative perf in group ScalarProduct: 1.0"
Benchmark This PR baseline Relative perf Change -
ScalarProduct_Hierarchical_int32 0.062 -
ScalarProduct_NDRange_int32 0.152 -
ScalarProduct_Hierarchical_int64 0.063 -
ScalarProduct_Hierarchical_fp32 0.059 -
ScalarProduct_NDRange_int64 0.098 -
ScalarProduct_NDRange_fp32 0.04 -
"Relative perf in group USM: 1.0"
Benchmark This PR baseline Relative perf Change -
USM_Latency_fp32_in_order__ 33.709 -
USM_Latency_fp32_out_of_order__ 46.89 -
USM_Allocation_latency_fp32_device 0.009000000000000001 -
USM_Allocation_latency_fp32_shared 0.118 -
USM_Allocation_latency_fp32_host 0.002 -
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.849 -
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 3.09 -
USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch 14.111 -
USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch 13.643 -
USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch 15.211 -
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 3.201 -
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.718 -
USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch 15.339 -
USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1 0.011 -
USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1 0.015000000000000001 -
USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1 0.42700000000000005 -
USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1 0.019 -
"Relative perf in group SYCL2020: 1.0"
Benchmark This PR baseline Relative perf Change -
SYCL2020_Accessors_Latency_fp32_out_of_order__ 70.866 -
SYCL2020_Accessors_Latency_fp32_in_order__ 69.089 -
"Relative perf in group VectorAddition: 1.0"
Benchmark This PR baseline Relative perf Change -
VectorAddition_int64 0.04 -
VectorAddition_fp32 0.032 -
VectorAddition_int32 0.038 -
"Relative perf in group Polybench: 1.0"
Benchmark This PR baseline Relative perf Change -
Polybench_2DConvolution 0.23 -
Polybench_2mm 1.239 -
Polybench_3mm 1.7429999999999999 -
Polybench_Atax 6.901 -
Polybench_Bicg 5.122 -
Polybench_Correlation 94.61 -
Polybench_Covariance 94.47 -
Polybench_Gemm 3.965 -
Polybench_Gesummv 7.316999999999999 -
Polybench_Gramschmidt 285.066 -
Polybench_Mvt 3.626 -
Polybench_Syr2k 6.3 -
Polybench_Syrk 3.201 -
"Relative perf in group ReductionAtomic: 1.0"
Benchmark This PR baseline Relative perf Change -
ReductionAtomic_fp64 0.043000000000000003 -
ReductionAtomic_int32 0.041999999999999996 -
ReductionAtomic_fp32 0.041 -
ReductionAtomic_int64 0.041 -
"Relative perf in group Kmeans: 1.0"
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 1.792 -
"Relative perf in group LinearRegressionCoeff: 1.0"
Benchmark This PR baseline Relative perf Change -
LinearRegressionCoeff_fp32 1.3339999999999999 -
"Relative perf in group LinearRegression: 1.0"
Benchmark This PR baseline Relative perf Change -
LinearRegression_fp32 0.358 -
"Relative perf in group MatmulChain: 1.0"
Benchmark This PR baseline Relative perf Change -
MatmulChain 11.029 -
"Relative perf in group MolecularDynamics: 1.0"
Benchmark This PR baseline Relative perf Change -
MolecularDynamics 0.066 -
"Relative perf in group api: 1.0"
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_sycl SubmitKernel out of order - 50.631
api_overhead_benchmark_sycl SubmitKernel in order - 49.385
api_overhead_benchmark_ur SubmitKernel out of order - 31.93
api_overhead_benchmark_ur SubmitKernel in order - 28.586
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 4.506
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 3.613
"Relative perf in group memory: 1.0"
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 423.457
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 253.906
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 9.179
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 1.854
"Relative perf in group miscellaneous: 1.0"
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum - 863.651

Charts

Velocity-Bench Hashtable
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Hashtable
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Hashtable

        This PR (206.158023 M keys/sec)   : crit, 0, 206

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section hashtable

        baseline (178.291413 M keys/sec)   :  0, 178

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Bitcracker
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Bitcracker
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Bitcracker

        This PR (35.6408 s)   : crit, 0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section bitcracker

        baseline (35.8407 s)   :  0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench CudaSift
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench CudaSift
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>CudaSift

        This PR (240.071 ms)   : crit, 0, 240

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section cudaSift

        baseline (283.294 ms)   :  0, 283

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Easywave
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Easywave
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Easywave

        This PR (354 ms)   : crit, 0, 354

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section easywave

        baseline (457.0 ms)   :  0, 457

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench QuickSilver
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench QuickSilver
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>QuickSilver

        This PR (89.64 MMS/CTT)   : crit, 0, 89

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section QuickSilver

        baseline (115.63 MMS/CTT)   :  0, 115

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Sobel Filter
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Sobel Filter
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Sobel<br>Filter

        This PR (988.861 ms)   : crit, 0, 988

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section sobel_filter

        baseline (934.963 ms)   :  0, 934

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_256_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_256_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_256_blocksize_256

        This PR (0.08399999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_64_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_64_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_64_blocksize_256

        This PR (0.356 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_128_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_128_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_128_blocksize_256

        This PR (0.153 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_512_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_512_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_512_blocksize_256

        This PR (0.081 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_SingleTask

        This PR (6.465 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

        This PR (5.587 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

        This PR (5.5840000000000005 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_BasicParallelFor

        This PR (5.792000000000001 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_SingleTask

        This PR (6.548 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_HierarchicalParallelFor

        This PR (5.329000000000001 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_BasicParallelFor

        This PR (6.054 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_NDRangeParallelFor

        This PR (4.927 ms)   : crit, 0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_LocalMem_int32_4096
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_LocalMem_int32_4096
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_LocalMem_int32_4096

        This PR (0.229 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_LocalMem_fp32_4096
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_LocalMem_fp32_4096
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_LocalMem_fp32_4096

        This PR (0.201 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_4
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_4
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_4

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_8
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_8
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_8

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_4
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_4
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_4

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_1

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_2
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_2
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_2

        This PR (0.029 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_2
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_2
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_2

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_1

        This PR (0.034 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_8
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_8
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_8

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_int32

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_fp32

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_int32

        This PR (0.074 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_int64

        This PR (0.053 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_int64

        This PR (0.051 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_fp32

        This PR (0.025 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_int32

        This PR (0.062 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_int32

        This PR (0.152 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_int64

        This PR (0.063 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_fp32

        This PR (0.059 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_int64

        This PR (0.098 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_fp32

        This PR (0.04 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int16

        This PR (0.030000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int16

        This PR (0.049 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int32

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int64

        This PR (0.018000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int64

        This PR (0.029 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int32

        This PR (0.028 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_fp32

        This PR (0.014 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_fp32

        This PR (0.030000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Latency_fp32_in_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Latency_fp32_in_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Latency_fp32_in_order__

        This PR (33.709 ms)   : crit, 0, 33

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
SYCL2020_Accessors_Latency_fp32_out_of_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title SYCL2020_Accessors_Latency_fp32_out_of_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SYCL2020_Accessors_Latency_fp32_out_of_order__

        This PR (70.866 ms)   : crit, 0, 70

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Latency_fp32_out_of_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Latency_fp32_out_of_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Latency_fp32_out_of_order__

        This PR (46.89 ms)   : crit, 0, 46

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
SYCL2020_Accessors_Latency_fp32_in_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title SYCL2020_Accessors_Latency_fp32_in_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SYCL2020_Accessors_Latency_fp32_in_order__

        This PR (69.089 ms)   : crit, 0, 69

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_device
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_device
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_device

        This PR (0.009000000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_shared
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_shared
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_shared

        This PR (0.118 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_host
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_host
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_host

        This PR (0.002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

        This PR (1.849 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

        This PR (3.09 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch

        This PR (14.111 ms)   : crit, 0, 14

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch

        This PR (13.643 ms)   : crit, 0, 13

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch

        This PR (15.211 ms)   : crit, 0, 15

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

        This PR (3.201 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

        This PR (1.718 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch

        This PR (15.339 ms)   : crit, 0, 15

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1

        This PR (0.011 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1

        This PR (0.015000000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1

        This PR (0.42700000000000005 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1

        This PR (0.019 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_int64

        This PR (0.04 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_fp32

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_int32

        This PR (0.038 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2DConvolution
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2DConvolution
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2DConvolution

        This PR (0.23 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2mm

        This PR (1.239 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_3mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_3mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_3mm

        This PR (1.7429999999999999 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_int32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_int32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_int32_512

        This PR (0.073 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_fp32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_fp32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_fp32_512

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Atax
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Atax
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Atax

        This PR (6.901 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_fp64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_fp64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_fp64

        This PR (0.043000000000000003 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_int32

        This PR (0.041999999999999996 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_fp32

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_int64

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Bicg
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Bicg
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Bicg

        This PR (5.122 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Correlation
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Correlation
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Correlation

        This PR (94.61 ms)   : crit, 0, 94

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Covariance
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Covariance
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Covariance

        This PR (94.47 ms)   : crit, 0, 94

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gemm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gemm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gemm

        This PR (3.965 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gesummv
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gesummv
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gesummv

        This PR (7.316999999999999 ms)   : crit, 0, 7

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gramschmidt
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gramschmidt
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gramschmidt

        This PR (285.066 ms)   : crit, 0, 285

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Kmeans_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Kmeans_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Kmeans_fp32

        This PR (1.792 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
LinearRegressionCoeff_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title LinearRegressionCoeff_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section LinearRegressionCoeff_fp32

        This PR (1.3339999999999999 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
LinearRegression_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title LinearRegression_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section LinearRegression_fp32

        This PR (0.358 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MatmulChain
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MatmulChain
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MatmulChain

        This PR (11.029 ms)   : crit, 0, 11

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MolecularDynamics
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MolecularDynamics
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MolecularDynamics

        This PR (0.066 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Mvt
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Mvt
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Mvt

        This PR (3.626 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_sf_fp32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_sf_fp32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_sf_fp32_16

        This PR (0.025 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syr2k
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syr2k
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syr2k

        This PR (6.3 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syrk
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syrk
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syrk

        This PR (3.201 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (50.631 μs)   :  0, 50

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (49.385 μs)   :  0, 49

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (31.93 μs)   :  0, 31

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (28.586 μs)   :  0, 28

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (423.457 μs)   :  0, 423

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Host<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (253.906 μs)   :  0, 253

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueMemcpy(api=sycl<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB)

        baseline (9.179 μs)   :  0, 9

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
    todayMarker off
    dateFormat  X
    axisFormat %s

    section StreamMemory(api=sycl<br>type=Triad<br>size=10KB<br>useEvents=0<br>contents=Zeros<br>memoryPlacement=Device)

        baseline (1.854 μs)   :  0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Device<br>dst=Device<br>size=1KB<br>ioq=0)

        baseline (4.506 μs)   :  0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Host<br>dst=Host<br>size=1KB<br>ioq=1)

        baseline (3.613 μs)   :  0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
miscellaneous_benchmark_sycl VectorSum
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title miscellaneous_benchmark_sycl VectorSum
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorSum(api=sycl<br>numberOfElementsX=512<br>numberOfElementsY=256<br>numberOfElementsZ=256)

        baseline (863.651 μs)   :  0, 863

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading

Details

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.651043 s
206.158023 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/test-user/bench_workdir/bitcracker/bitcracker -f /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.0153997 s
bitcracker - total time for whole calculation: 35.6408 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/test-user/bench_workdir/cudaSift/cudaSift

Output:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1098 1269 29.8127% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1126 1264 30.5729% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1114 1264 30.2471% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1271 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1259 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1262 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1103 1274 29.9484% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1271 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1269 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1258 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1259 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1264 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1191 1260 32.3378% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1254 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1123 1266 30.4914% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1263 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1244 1276 33.7768% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1259 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1138 1263 30.8987% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1254 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1262 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1261 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1265 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1272 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1111 1265 30.1656% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1270 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1272 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1263 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1123 1271 30.4914% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1261 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1102 1254 29.9213% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1266 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1263 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1264 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1276 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1262 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1259 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1270 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1087 1253 29.514% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1126 1260 30.5729% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1259 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1135 1269 30.8173% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1215 1254 32.9894% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1161 1253 31.5232% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1265 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1065 1260 28.9166% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1262 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1263 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1273 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1143 1273 31.0345% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 240.071 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/test-user/bench_workdir/easywave/easyWave_sycl -grid /home/test-user/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/test-user/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.29735+27)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/test-user/bench_workdir/QuickSilver/qs -i /home/test-user/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/test-user/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 4.501810e-01 8.406180e-01 1.000000e-06
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 5.790920e-01 9.947470e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 5.680660e-01 9.862850e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 5.725840e-01 1.108194e+00 1.000000e-06
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 5.301690e-01 1.043107e+00 1.000000e-06
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 4.863480e-01 9.827190e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.444020e-01 9.790370e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.565260e-01 1.038769e+00 1.000000e-06
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.746330e-01 1.036995e+00 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 5.254170e-01 1.039748e+00 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.484e+07 1.484e+07 1.484e+07 0.000e+00 100.00
cycleInit 10 4.787e+06 4.787e+06 4.787e+06 0.000e+00 100.00
cycleTracking 10 1.005e+07 1.005e+07 1.005e+07 0.000e+00 100.00
cycleTracking_Kernel 104 4.945e+06 4.945e+06 4.945e+06 0.000e+00 100.00
cycleTracking_MPI 117 2.767e+05 2.767e+05 2.767e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 7.680e+02 7.680e+02 7.680e+02 0.000e+00 100.00
Figure Of Merit 89.64 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/test-user/bench_workdir/sobel_filter/sobel_filter -i /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 14.9581 s
sobelfilter - total time for whole calculation: 0.988861 s

Runtime_BlockedTransform_iter_256_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_64_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_128_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_512_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LocalMem_multi.csv --size=512

Output:

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LocalMem_multi.csv --size=512

Output:

MicroBench_L2_fp32_4

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_8

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_4

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_2

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_2

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_8

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

USM_Latency_fp32_in_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

SYCL2020_Accessors_Latency_fp32_out_of_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Latency_fp32_out_of_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

SYCL2020_Accessors_Latency_fp32_in_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

VectorAddition_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

VectorAddition_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

VectorAddition_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

Polybench_2DConvolution

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2DConvolution --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2DConvolution.csv

Output:

Polybench_2mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2mm.csv --size=512

Output:

Polybench_3mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/3mm.csv --size=512

Output:

MicroBench_Arith_int32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

MicroBench_Arith_fp32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

Polybench_Atax

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Atax.csv --size=8192

Output:

ReductionAtomic_fp64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

Polybench_Bicg

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/bicg --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Bicg.csv --size=8192

Output:

Polybench_Correlation

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/correlation --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Correlation.csv --size=512

Output:

Polybench_Covariance

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/covariance --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Covariance.csv --size=512

Output:

Polybench_Gemm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gemm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gemm.csv --size=1024

Output:

Polybench_Gesummv

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gesummv --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gesummv.csv --size=8192

Output:

Polybench_Gramschmidt

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gramschmidt --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gramschmidt.csv --size=512

Output:

Kmeans_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Kmeans.csv --size=67108864

Output:

LinearRegressionCoeff_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/lin_reg_coeff --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LinearRegressionCoeff.csv

Output:

LinearRegression_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/lin_reg_error --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LinearRegression.csv

Output:

MatmulChain

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/matmulchain --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/MatmulChain.csv --size=1024

Output:

MolecularDynamics

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/MolecularDynamics.csv

Output:

Polybench_Mvt

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/mvt --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Mvt.csv --size=16384

Output:

MicroBench_sf_fp32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/sf --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/sf_16.csv --size=--size=100000000

Output:

Polybench_Syr2k

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syr2k --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syr2k.csv --size=1024

Output:

Polybench_Syrk

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syrk --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syrk.csv --size=1024

Output:

Copy link

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/10960319405

Copy link

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/10960319405
Job status: success. Test status: success.

Summary

result is better

Performance change in benchmark groups

"Relative perf in group Velocity-Bench: 0.9770643765903436"
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Hashtable 206.771893 178.291413 116.0% 15.97% +++++++
Velocity-Bench CudaSift 280.091 283.294 101.1% 1.14% +
Velocity-Bench Bitcracker 35.6682 35.8407 100.5% 0.48% 0
Velocity-Bench Easywave 458 457.0 99.8% -0.22% 0
Velocity-Bench Sobel Filter 979.897 934.963 95.4% -4.59% --
Velocity-Bench QuickSilver 89.65 115.63 77.5% -22.47% ----------
"Relative perf in group Runtime: 1.0"
Benchmark This PR baseline Relative perf Change -
Runtime_BlockedTransform_iter_64_blocksize_256 0.356 -
Runtime_BlockedTransform_iter_256_blocksize_256 0.08399999999999999 -
Runtime_BlockedTransform_iter_128_blocksize_256 0.154 -
Runtime_BlockedTransform_iter_512_blocksize_256 0.081 -
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 5.799 -
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 5.577999999999999 -
Runtime_IndependentDAGTaskThroughput_SingleTask 6.467 -
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 5.588 -
Runtime_DAGTaskThroughput_BasicParallelFor 5.986 -
Runtime_DAGTaskThroughput_SingleTask 6.535 -
Runtime_DAGTaskThroughput_NDRangeParallelFor 4.887 -
Runtime_DAGTaskThroughput_HierarchicalParallelFor 5.298 -
"Relative perf in group MicroBench: 1.0"
Benchmark This PR baseline Relative perf Change -
MicroBench_LocalMem_fp32_4096 0.201 -
MicroBench_LocalMem_int32_4096 0.229 -
MicroBench_L2_fp32_2 0.029 -
MicroBench_L2_fp32_8 0.026 -
MicroBench_L2_fp32_4 0.026 -
MicroBench_L2_fp32_16 0.026 -
MicroBench_L2_int32_2 0.027 -
MicroBench_L2_int32_16 0.026 -
MicroBench_L2_int32_1 0.034 -
MicroBench_L2_int32_4 0.027 -
MicroBench_L2_fp32_1 0.026 -
MicroBench_L2_int32_8 0.027 -
MicroBench_Arith_fp32_512 0.032 -
MicroBench_Arith_int32_512 0.073 -
MicroBench_sf_fp32_16 0.025 -
"Relative perf in group Pattern: 1.0"
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_Hierarchical_int32 0.052 -
Pattern_Reduction_Hierarchical_fp32 0.052 -
Pattern_Reduction_Hierarchical_int64 0.051 -
Pattern_Reduction_NDRange_int32 0.075 -
Pattern_Reduction_NDRange_int64 0.052 -
Pattern_Reduction_NDRange_fp32 0.025 -
Pattern_SegmentedReduction_NDRange_fp32 0.014 -
Pattern_SegmentedReduction_NDRange_int16 0.046 -
Pattern_SegmentedReduction_NDRange_int32 0.027 -
Pattern_SegmentedReduction_Hierarchical_int64 0.029 -
Pattern_SegmentedReduction_Hierarchical_int16 0.030000000000000002 -
Pattern_SegmentedReduction_Hierarchical_int32 0.028 -
Pattern_SegmentedReduction_Hierarchical_fp32 0.030000000000000002 -
Pattern_SegmentedReduction_NDRange_int64 0.018000000000000002 -
"Relative perf in group ScalarProduct: 1.0"
Benchmark This PR baseline Relative perf Change -
ScalarProduct_Hierarchical_int64 0.063 -
ScalarProduct_NDRange_int64 0.098 -
ScalarProduct_NDRange_int32 0.152 -
ScalarProduct_Hierarchical_int32 0.062 -
ScalarProduct_Hierarchical_fp32 0.059 -
ScalarProduct_NDRange_fp32 0.04 -
"Relative perf in group SYCL2020: 1.0"
Benchmark This PR baseline Relative perf Change -
SYCL2020_Accessors_Latency_fp32_out_of_order__ 70.866 -
SYCL2020_Accessors_Latency_fp32_in_order__ 68.96900000000001 -
"Relative perf in group USM: 1.0"
Benchmark This PR baseline Relative perf Change -
USM_Latency_fp32_in_order__ 33.44 -
USM_Latency_fp32_out_of_order__ 46.696 -
USM_Allocation_latency_fp32_device 0.009000000000000001 -
USM_Allocation_latency_fp32_host 0.002 -
USM_Allocation_latency_fp32_shared 0.11900000000000001 -
USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch 15.221 -
USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch 15.362 -
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.868 -
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 3.2070000000000003 -
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 3.09 -
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.737 -
USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch 14.111 -
USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch 13.668 -
USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1 0.015000000000000001 -
USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1 0.42700000000000005 -
USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1 0.019 -
USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1 0.011 -
"Relative perf in group VectorAddition: 1.0"
Benchmark This PR baseline Relative perf Change -
VectorAddition_int64 0.041 -
VectorAddition_fp32 0.032 -
VectorAddition_int32 0.037 -
"Relative perf in group Polybench: 1.0"
Benchmark This PR baseline Relative perf Change -
Polybench_2DConvolution 0.23 -
Polybench_2mm 1.239 -
Polybench_3mm 1.7429999999999999 -
Polybench_Atax 6.901 -
Polybench_Bicg 5.122 -
Polybench_Correlation 94.67299999999999 -
Polybench_Covariance 94.436 -
Polybench_Gemm 3.973 -
Polybench_Gesummv 7.306 -
Polybench_Gramschmidt 285.066 -
Polybench_Mvt 3.636 -
Polybench_Syr2k 6.3020000000000005 -
Polybench_Syrk 3.201 -
"Relative perf in group ReductionAtomic: 1.0"
Benchmark This PR baseline Relative perf Change -
ReductionAtomic_fp32 0.041 -
ReductionAtomic_int64 0.041 -
ReductionAtomic_int32 0.041999999999999996 -
ReductionAtomic_fp64 0.043000000000000003 -
"Relative perf in group Kmeans: 1.0"
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 1.792 -
"Relative perf in group LinearRegressionCoeff: 1.0"
Benchmark This PR baseline Relative perf Change -
LinearRegressionCoeff_fp32 1.2910000000000001 -
"Relative perf in group LinearRegression: 1.0"
Benchmark This PR baseline Relative perf Change -
LinearRegression_fp32 0.357 -
"Relative perf in group MatmulChain: 1.0"
Benchmark This PR baseline Relative perf Change -
MatmulChain 11.030999999999999 -
"Relative perf in group MolecularDynamics: 1.0"
Benchmark This PR baseline Relative perf Change -
MolecularDynamics 0.066 -
"Relative perf in group api: 1.0"
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_sycl SubmitKernel out of order - 50.631
api_overhead_benchmark_sycl SubmitKernel in order - 49.385
api_overhead_benchmark_ur SubmitKernel out of order - 31.93
api_overhead_benchmark_ur SubmitKernel in order - 28.586
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 4.506
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 3.613
"Relative perf in group memory: 1.0"
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 423.457
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 253.906
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 9.179
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 1.854
"Relative perf in group miscellaneous: 1.0"
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum - 863.651

Charts

Velocity-Bench Hashtable
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Hashtable
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Hashtable

        This PR (206.771893 M keys/sec)   : crit, 0, 206

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section hashtable

        baseline (178.291413 M keys/sec)   :  0, 178

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Bitcracker
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Bitcracker
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Bitcracker

        This PR (35.6682 s)   : crit, 0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section bitcracker

        baseline (35.8407 s)   :  0, 35

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench CudaSift
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench CudaSift
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>CudaSift

        This PR (280.091 ms)   : crit, 0, 280

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section cudaSift

        baseline (283.294 ms)   :  0, 283

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Easywave
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Easywave
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Easywave

        This PR (458 ms)   : crit, 0, 458

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section easywave

        baseline (457.0 ms)   :  0, 457

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench QuickSilver
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench QuickSilver
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>QuickSilver

        This PR (89.65 MMS/CTT)   : crit, 0, 89

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section QuickSilver

        baseline (115.63 MMS/CTT)   :  0, 115

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Velocity-Bench Sobel Filter
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Velocity-Bench Sobel Filter
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Velocity-Bench<br>Sobel<br>Filter

        This PR (979.897 ms)   : crit, 0, 979

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

    section sobel_filter

        baseline (934.963 ms)   :  0, 934

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_64_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_64_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_64_blocksize_256

        This PR (0.356 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_256_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_256_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_256_blocksize_256

        This PR (0.08399999999999999 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_128_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_128_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_128_blocksize_256

        This PR (0.154 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_BlockedTransform_iter_512_blocksize_256
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_BlockedTransform_iter_512_blocksize_256
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_BlockedTransform_iter_512_blocksize_256

        This PR (0.081 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_BasicParallelFor

        This PR (5.799 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

        This PR (5.577999999999999 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_SingleTask

        This PR (6.467 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

        This PR (5.588 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_BasicParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_BasicParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_BasicParallelFor

        This PR (5.986 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_SingleTask
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_SingleTask
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_SingleTask

        This PR (6.535 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_NDRangeParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_NDRangeParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_NDRangeParallelFor

        This PR (4.887 ms)   : crit, 0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Runtime_DAGTaskThroughput_HierarchicalParallelFor
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Runtime_DAGTaskThroughput_HierarchicalParallelFor
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Runtime_DAGTaskThroughput_HierarchicalParallelFor

        This PR (5.298 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_LocalMem_fp32_4096
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_LocalMem_fp32_4096
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_LocalMem_fp32_4096

        This PR (0.201 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_LocalMem_int32_4096
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_LocalMem_int32_4096
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_LocalMem_int32_4096

        This PR (0.229 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_2
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_2
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_2

        This PR (0.029 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_8
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_8
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_8

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_4
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_4
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_4

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_2
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_2
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_2

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_16

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_1

        This PR (0.034 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_4
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_4
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_4

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_fp32_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_fp32_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_fp32_1

        This PR (0.026 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_L2_int32_8
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_L2_int32_8
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_L2_int32_8

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_int32

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_fp32

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_Hierarchical_int64

        This PR (0.051 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_int32

        This PR (0.075 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_int64

        This PR (0.052 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_Reduction_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_Reduction_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_Reduction_NDRange_fp32

        This PR (0.025 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_int64

        This PR (0.063 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_int64

        This PR (0.098 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_int32

        This PR (0.152 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_int32

        This PR (0.062 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_Hierarchical_fp32

        This PR (0.059 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ScalarProduct_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ScalarProduct_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ScalarProduct_NDRange_fp32

        This PR (0.04 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_fp32

        This PR (0.014 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int16

        This PR (0.046 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int32

        This PR (0.027 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int64

        This PR (0.029 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int16

        This PR (0.030000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_int32

        This PR (0.028 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_Hierarchical_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_Hierarchical_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_Hierarchical_fp32

        This PR (0.030000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Pattern_SegmentedReduction_NDRange_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Pattern_SegmentedReduction_NDRange_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Pattern_SegmentedReduction_NDRange_int64

        This PR (0.018000000000000002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
SYCL2020_Accessors_Latency_fp32_out_of_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title SYCL2020_Accessors_Latency_fp32_out_of_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SYCL2020_Accessors_Latency_fp32_out_of_order__

        This PR (70.866 ms)   : crit, 0, 70

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
SYCL2020_Accessors_Latency_fp32_in_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title SYCL2020_Accessors_Latency_fp32_in_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SYCL2020_Accessors_Latency_fp32_in_order__

        This PR (68.96900000000001 ms)   : crit, 0, 68

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Latency_fp32_in_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Latency_fp32_in_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Latency_fp32_in_order__

        This PR (33.44 ms)   : crit, 0, 33

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Latency_fp32_out_of_order__
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Latency_fp32_out_of_order__
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Latency_fp32_out_of_order__

        This PR (46.696 ms)   : crit, 0, 46

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_device
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_device
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_device

        This PR (0.009000000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_host
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_host
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_host

        This PR (0.002 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Allocation_latency_fp32_shared
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Allocation_latency_fp32_shared
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Allocation_latency_fp32_shared

        This PR (0.11900000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch

        This PR (15.221 ms)   : crit, 0, 15

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch

        This PR (15.362 ms)   : crit, 0, 15

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

        This PR (1.868 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

        This PR (3.2070000000000003 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

        This PR (3.09 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

        This PR (1.737 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch

        This PR (14.111 ms)   : crit, 0, 14

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch

        This PR (13.668 ms)   : crit, 0, 13

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1

        This PR (0.015000000000000001 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1

        This PR (0.42700000000000005 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1

        This PR (0.019 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1
    todayMarker off
    dateFormat  X
    axisFormat %s

    section USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1

        This PR (0.011 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_int64

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_fp32

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
VectorAddition_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title VectorAddition_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorAddition_int32

        This PR (0.037 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2DConvolution
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2DConvolution
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2DConvolution

        This PR (0.23 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_2mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_2mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_2mm

        This PR (1.239 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_3mm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_3mm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_3mm

        This PR (1.7429999999999999 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_fp32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_fp32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_fp32_512

        This PR (0.032 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_Arith_int32_512
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_Arith_int32_512
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_Arith_int32_512

        This PR (0.073 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Atax
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Atax
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Atax

        This PR (6.901 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_fp32

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_int64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_int64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_int64

        This PR (0.041 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_int32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_int32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_int32

        This PR (0.041999999999999996 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
ReductionAtomic_fp64
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title ReductionAtomic_fp64
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ReductionAtomic_fp64

        This PR (0.043000000000000003 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Bicg
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Bicg
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Bicg

        This PR (5.122 ms)   : crit, 0, 5

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Correlation
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Correlation
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Correlation

        This PR (94.67299999999999 ms)   : crit, 0, 94

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Covariance
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Covariance
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Covariance

        This PR (94.436 ms)   : crit, 0, 94

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gemm
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gemm
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gemm

        This PR (3.973 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gesummv
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gesummv
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gesummv

        This PR (7.306 ms)   : crit, 0, 7

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Gramschmidt
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Gramschmidt
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Gramschmidt

        This PR (285.066 ms)   : crit, 0, 285

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Kmeans_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Kmeans_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Kmeans_fp32

        This PR (1.792 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
LinearRegressionCoeff_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title LinearRegressionCoeff_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section LinearRegressionCoeff_fp32

        This PR (1.2910000000000001 ms)   : crit, 0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
LinearRegression_fp32
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title LinearRegression_fp32
    todayMarker off
    dateFormat  X
    axisFormat %s

    section LinearRegression_fp32

        This PR (0.357 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MatmulChain
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MatmulChain
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MatmulChain

        This PR (11.030999999999999 ms)   : crit, 0, 11

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MolecularDynamics
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MolecularDynamics
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MolecularDynamics

        This PR (0.066 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Mvt
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Mvt
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Mvt

        This PR (3.636 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
MicroBench_sf_fp32_16
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title MicroBench_sf_fp32_16
    todayMarker off
    dateFormat  X
    axisFormat %s

    section MicroBench_sf_fp32_16

        This PR (0.025 ms)   : crit, 0, 0

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syr2k
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syr2k
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syr2k

        This PR (6.3020000000000005 ms)   : crit, 0, 6

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
Polybench_Syrk
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title Polybench_Syrk
    todayMarker off
    dateFormat  X
    axisFormat %s

    section Polybench_Syrk

        This PR (3.201 ms)   : crit, 0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (50.631 μs)   :  0, 50

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=sycl<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (49.385 μs)   :  0, 49

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel out of order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel out of order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=0<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (31.93 μs)   :  0, 31

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_ur SubmitKernel in order
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_ur SubmitKernel in order
    todayMarker off
    dateFormat  X
    axisFormat %s

    section SubmitKernel(api=ur<br>Profiling=0<br>Ioq=1<br>DiscardEvents=0<br>NumKernels=10<br>KernelExecTime=1<br>MeasureCompletion=0)

        baseline (28.586 μs)   :  0, 28

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (423.457 μs)   :  0, 423

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueInOrderMemcpy(api=sycl<br>IsCopyOnly=0<br>sourcePlacement=Host<br>destinationPlacement=Device<br>size=1KB<br>count=100)

        baseline (253.906 μs)   :  0, 253

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section QueueMemcpy(api=sycl<br>sourcePlacement=Device<br>destinationPlacement=Device<br>size=1KB)

        baseline (9.179 μs)   :  0, 9

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240
    todayMarker off
    dateFormat  X
    axisFormat %s

    section StreamMemory(api=sycl<br>type=Triad<br>size=10KB<br>useEvents=0<br>contents=Zeros<br>memoryPlacement=Device)

        baseline (1.854 μs)   :  0, 1

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Device<br>dst=Device<br>size=1KB<br>ioq=0)

        baseline (4.506 μs)   :  0, 4

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024
    todayMarker off
    dateFormat  X
    axisFormat %s

    section ExecImmediateCopyQueue(api=sycl<br>IsCopyOnly=1<br>MeasureCompletionTime=0<br>src=Host<br>dst=Host<br>size=1KB<br>ioq=1)

        baseline (3.613 μs)   :  0, 3

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading
miscellaneous_benchmark_sycl VectorSum
---
config:
    gantt:
        rightPadding: 10
        leftPadding: 120
        sectionFontSize: 10
        numberSectionStyles: 2
---
gantt
    title miscellaneous_benchmark_sycl VectorSum
    todayMarker off
    dateFormat  X
    axisFormat %s

    section VectorSum(api=sycl<br>numberOfElementsX=512<br>numberOfElementsY=256<br>numberOfElementsZ=256)

        baseline (863.651 μs)   :  0, 863

    -   : 0, 0

    -   : 0, 0

    -   : 0, 0

Loading

Details

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/test-user/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.64911 s
206.771893 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/test-user/bench_workdir/bitcracker/bitcracker -f /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/test-user/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.0153835 s
bitcracker - total time for whole calculation: 35.6682 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/test-user/bench_workdir/cudaSift/cudaSift

Output:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1260 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1263 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1272 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1267 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1218 1258 33.0709% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1048 1260 28.4551% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1168 1262 31.7133% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1265 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1266 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1267 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1270 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1256 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1271 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1264 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1256 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1070 1263 29.0524% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1101 1253 29.8941% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1272 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1240 1279 33.6682% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1112 1265 30.1928% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1266 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1268 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1091 1264 29.6226% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1253 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1259 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1159 1268 31.4689% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1269 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1268 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1274 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1270 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1270 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1155 1261 31.3603% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1064 1270 28.8895% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1276 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1096 1270 29.7583% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1269 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1269 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1268 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1089 1259 29.5683% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1181 1258 32.0662% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1274 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1256 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1259 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1266 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1110 1264 30.1385% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1257 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1267 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1259 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1243 1277 33.7497% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1264 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 280.091 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/test-user/bench_workdir/easywave/easyWave_sycl -grid /home/test-user/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/test-user/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.29735+27)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/test-user/bench_workdir/QuickSilver/qs -i /home/test-user/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/test-user/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 7.278690e-01 8.530250e-01 1.000000e-06
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 5.622930e-01 9.899600e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 5.316640e-01 9.984510e-01 1.000000e-06
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 5.844880e-01 1.108128e+00 1.000000e-06
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 5.256670e-01 1.041786e+00 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 5.254230e-01 9.982630e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 5.256160e-01 9.970410e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 5.272600e-01 1.036005e+00 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 5.235320e-01 1.035470e+00 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 5.217610e-01 9.906470e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.561e+07 1.561e+07 1.561e+07 0.000e+00 100.00
cycleInit 10 5.556e+06 5.556e+06 5.556e+06 0.000e+00 100.00
cycleTracking 10 1.005e+07 1.005e+07 1.005e+07 0.000e+00 100.00
cycleTracking_Kernel 104 4.939e+06 4.939e+06 4.939e+06 0.000e+00 100.00
cycleTracking_MPI 117 3.068e+05 3.068e+05 3.068e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 8.140e+02 8.140e+02 8.140e+02 0.000e+00 100.00
Figure Of Merit 89.65 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/test-user/bench_workdir/sobel_filter/sobel_filter -i /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/test-user/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 15.0229 s
sobelfilter - total time for whole calculation: 0.979897 s

Runtime_BlockedTransform_iter_64_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_256_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_128_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_BlockedTransform_iter_512_blocksize_256

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/blocked_transform --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/BlockedTransform_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/DAGTaskThroughput_multi.csv --size=512

Output:

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LocalMem_multi.csv --size=512

Output:

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LocalMem_multi.csv --size=512

Output:

MicroBench_L2_fp32_2

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_8

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_4

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_2

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_4

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_fp32_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

MicroBench_L2_int32_8

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/pattern_L2 --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/L2_multi.csv

Output:

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

Pattern_Reduction_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_Reduction_multi.csv

Output:

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ScalarProduct_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Pattern_SegmentedReduction_multi.csv

Output:

SYCL2020_Accessors_Latency_fp32_out_of_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

SYCL2020_Accessors_Latency_fp32_in_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Latency_fp32_in_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Latency_fp32_out_of_order__

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_accessors_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Latency_multi.csv

Output:

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Allocation_latency_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_with_init_with_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_no_init_with_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Instr_Mix_fp32_shared_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Instr_Mix_multi.csv

Output:

USM_Pinned_Overhead_fp32_HostDevice_NonPinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_DeviceHost_NonPinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_DeviceHost_Pinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

USM_Pinned_Overhead_fp32_HostDevice_Pinned_Init_1

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/usm_pinned_overhead --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/USM_Pinned_Overhead_multi.csv

Output:

VectorAddition_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

VectorAddition_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

VectorAddition_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/VectorAddition_multi.csv

Output:

Polybench_2DConvolution

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2DConvolution --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2DConvolution.csv

Output:

Polybench_2mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/2mm.csv --size=512

Output:

Polybench_3mm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/3mm.csv --size=512

Output:

MicroBench_Arith_fp32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

MicroBench_Arith_int32_512

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/arith --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Arith_int32_512.csv --size=16384

Output:

Polybench_Atax

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Atax.csv --size=8192

Output:

ReductionAtomic_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_int64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_int32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

ReductionAtomic_fp64

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/atomic_reduction --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/ReductionAtomic_fp64.csv

Output:

Polybench_Bicg

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/bicg --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Bicg.csv --size=8192

Output:

Polybench_Correlation

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/correlation --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Correlation.csv --size=512

Output:

Polybench_Covariance

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/covariance --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Covariance.csv --size=512

Output:

Polybench_Gemm

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gemm --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gemm.csv --size=1024

Output:

Polybench_Gesummv

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gesummv --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gesummv.csv --size=8192

Output:

Polybench_Gramschmidt

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/gramschmidt --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Gramschmidt.csv --size=512

Output:

Kmeans_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Kmeans.csv --size=67108864

Output:

LinearRegressionCoeff_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/lin_reg_coeff --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LinearRegressionCoeff.csv

Output:

LinearRegression_fp32

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/lin_reg_error --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/LinearRegression.csv

Output:

MatmulChain

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/matmulchain --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/MatmulChain.csv --size=1024

Output:

MolecularDynamics

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/MolecularDynamics.csv

Output:

Polybench_Mvt

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/mvt --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Mvt.csv --size=16384

Output:

MicroBench_sf_fp32_16

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/sf --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/sf_16.csv --size=--size=100000000

Output:

Polybench_Syr2k

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syr2k --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syr2k.csv --size=1024

Output:

Polybench_Syrk

Environment Variables:

Command:

/home/test-user/bench_workdir/sycl-bench-build/syrk --warmup-run --num-runs=3 --output=/home/test-user/bench_workdir/Syrk.csv --size=1024

Output:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci/cd Continuous integration/devliery
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants