Replies: 1 comment 1 reply
-
We are aware of this limitation and will work towards supporting runtime CPU detection and dynamic loading of the backends. No ETA for now - it might take some time |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Right now if we build whisper with CUDA enabled, it is impossible to run the same binary on machines without nvidia GPU for CPU-only execution. For applications shipping whisper to consumer devices, this is a hassle as the apps have to ship two versions of whisper binaries with CUDA and without CUDA. This is due to the fact that CUDA libraries are linked dynamically, instead of loading conditionally(using
dlopen
on Linux orLoadLibrary
on Windows) based on the presence of Nvidia GPU.Is it possible to move to conditional loading of CUDA libraries and using function pointers for invoking CUDA functions? This will make the compiled binaries portable across multiple hardwares.
Beta Was this translation helpful? Give feedback.
All reactions