You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Running on Windows with .\main.exe \path\to\audio.wav -m .\models\ggml-base.bin --processors 2 crashes. This seems to work fine with both the CUDA and Metal backends.
In a debug build I get an assertion failed of _CrtIsValidHeapPointer(block).
When running in a debugger I get various call stacks such as the following two:
I also seem to get random hangs in vkWaitForFences.
If the Vulkan backend doesn't support this then it should be documented somewhere. But it would be unfortunate if this is the case as Vulkan performance is a lot slower than CUDA from my testing (on the same GPU) and doesn't max out the GPU, so potentially allowing multiple processors could improve performance. When using mulitple processors with CUDA I can transcribe 10 minutes of audio in approximately 10 seconds (1 processor is approx 20 seconds). In Vulkan it takes approximately 45 seconds with a single processor.
The text was updated successfully, but these errors were encountered:
I've compiled main with Vulkan support.
Running on Windows with
.\main.exe \path\to\audio.wav -m .\models\ggml-base.bin --processors 2
crashes. This seems to work fine with both the CUDA and Metal backends.In a debug build I get an assertion failed of
_CrtIsValidHeapPointer(block)
.When running in a debugger I get various call stacks such as the following two:
I also seem to get random hangs in vkWaitForFences.
If the Vulkan backend doesn't support this then it should be documented somewhere. But it would be unfortunate if this is the case as Vulkan performance is a lot slower than CUDA from my testing (on the same GPU) and doesn't max out the GPU, so potentially allowing multiple processors could improve performance. When using mulitple processors with CUDA I can transcribe 10 minutes of audio in approximately 10 seconds (1 processor is approx 20 seconds). In Vulkan it takes approximately 45 seconds with a single processor.
The text was updated successfully, but these errors were encountered: