CUDA version and updated installation instructions #785

sidharthrajaram · 2024-04-09T23:51:25Z

Edits to the README regarding installation/CUDA based on discussion on these issues: #783 and #717

Updated libraries to install for CUDA 12 (because of latest ctranslate2 support for only CUDA 12)
Added note regarding CUDA 11 support.

jimydavis · 2024-04-10T01:35:37Z

pip install nvidia-cudnn-cu12 needs to be pinned to either

~=8.9
^=8.9

because in v9, libcudnn_ops_infer.so.8 seems to be replaced with libcudnn_ops.so.9 amidst other changes. I am not familiar enough with nvidia's practice on whether it should be a tilde or a caret.

For cublas, at least as of commit 91c8307, I have tested that nvidia-cublas-cu12==12.4.5.8 is fully working.

README.md

Purfview · 2024-04-10T04:21:51Z

Btw, @nguyendc-systran mentioned that they will keep the support for CUDA 11 for a while, maybe they dropped that idea:

OpenNMT/CTranslate2#1590 (comment)

minhthuc2502 · 2024-04-10T16:04:23Z

We tried to support CUDA 12 and 11 but releasing 2 versions in parallel quite complicated to maintain. In the end, we decided to only support CUDA 12 but the Ctranslate2 source can always build with CUDA 11.

README.md

Purfview

Looks good.

bil-ash · 2024-04-11T02:55:41Z

May be alongside update ctranslate2 dependency to latest 4.2.0 because it supports flash attention as well as performance improvements for quantized models on CPU.

sidharthrajaram · 2024-04-11T05:38:57Z

May be alongside update ctranslate2 dependency to latest 4.2.0 because it supports flash attention as well as performance improvements for quantized models on CPU.

@bil-ash this PR mainly contains updates to the installation instructions due to lack of CUDA 11 support in the latest versions of ctranslate2. Upgrading the ctranslate2 dependency would be beyond the scope of this particular PR I think.

sidharthrajaram · 2024-04-19T18:15:01Z

Is this good to go, @Purfview ?

Purfview · 2024-04-19T18:18:48Z

@sidharthrajaram FYI, I'm not a maintainer of this repo.

CUDA version note and updated instructions in README

c75e602

Purfview reviewed Apr 10, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

ctranslate2 downgrade note, cuDNN v9 consideration

4adf102

sidharthrajaram requested a review from Purfview April 10, 2024 19:10

Purfview reviewed Apr 10, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

Purfview reviewed Apr 10, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

clearer note on cuDNN v9 package

0651ccf

sidharthrajaram requested a review from Purfview April 10, 2024 21:21

Purfview reviewed Apr 10, 2024

View reviewed changes

regularfry mentioned this pull request Apr 17, 2024

Logging fixes ufal/whisper_streaming#80

Merged

trungkienbkhn merged commit 3d1de60 into SYSTRAN:master May 4, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA version and updated installation instructions #785

CUDA version and updated installation instructions #785

sidharthrajaram commented Apr 9, 2024

jimydavis commented Apr 10, 2024

Purfview commented Apr 10, 2024

minhthuc2502 commented Apr 10, 2024

Purfview left a comment

bil-ash commented Apr 11, 2024

sidharthrajaram commented Apr 11, 2024

sidharthrajaram commented Apr 19, 2024

Purfview commented Apr 19, 2024

CUDA version and updated installation instructions #785

CUDA version and updated installation instructions #785

Conversation

sidharthrajaram commented Apr 9, 2024

jimydavis commented Apr 10, 2024

Purfview commented Apr 10, 2024

minhthuc2502 commented Apr 10, 2024

Purfview left a comment

Choose a reason for hiding this comment

bil-ash commented Apr 11, 2024

sidharthrajaram commented Apr 11, 2024

sidharthrajaram commented Apr 19, 2024

Purfview commented Apr 19, 2024