Conditional loading of CUDA libaries #2246

jkarthic · 2024-06-18T19:11:05Z

jkarthic
Jun 18, 2024

Right now if we build whisper with CUDA enabled, it is impossible to run the same binary on machines without nvidia GPU for CPU-only execution. For applications shipping whisper to consumer devices, this is a hassle as the apps have to ship two versions of whisper binaries with CUDA and without CUDA. This is due to the fact that CUDA libraries are linked dynamically, instead of loading conditionally(using dlopen on Linux or LoadLibrary on Windows) based on the presence of Nvidia GPU.
Is it possible to move to conditional loading of CUDA libraries and using function pointers for invoking CUDA functions? This will make the compiled binaries portable across multiple hardwares.

ggerganov · 2024-06-19T09:48:43Z

ggerganov
Jun 19, 2024
Maintainer

We are aware of this limitation and will work towards supporting runtime CPU detection and dynamic loading of the backends. No ETA for now - it might take some time

1 reply

jkarthic Jun 19, 2024
Author

Thanks for the quick and clear reply!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conditional loading of CUDA libaries #2246

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Conditional loading of CUDA libaries #2246

jkarthic Jun 18, 2024

Replies: 1 comment · 1 reply

ggerganov Jun 19, 2024 Maintainer

jkarthic Jun 19, 2024 Author

jkarthic
Jun 18, 2024

Replies: 1 comment 1 reply

ggerganov
Jun 19, 2024
Maintainer

jkarthic Jun 19, 2024
Author