Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

error in modulated_deformable_im2col_cuda: no kernel image is available for execution on the device #29

Open
desires19 opened this issue Dec 8, 2021 · 6 comments

Comments

@desires19
Copy link

Hello, my bro
I have met a problem,,when I reproduced the excellent research, DCPose
python:3.6.12
cuda:11.0
GTX3080

error in modulated_deformable_im2col_cuda: no kernel image is available for execution on the device
error in modulated_deformable_col2im_coord_cuda: no kernel image is available for execution on the device

@Whj-cv
Copy link

Whj-cv commented Dec 12, 2021

I have the same problem. Have you solved it?

@chenhaomingbob
Copy link
Member

Hi, @desires19 @Whj-cv
Thank you for your interest in our work.
We develop DCPose using CUDA 10.0 and do not test in the environment of CUDA 11.0. Maybe you can try to run DCPose in the environment of CUDA 10.0.
Hope this will solve this problem.

@desires19
Copy link
Author

I have sloved it by changing GTX 1080Ti, cuda 10.1, pytorch 1.6.0

@peteruhrig
Copy link

We develop DCPose using CUDA 10.0 and do not test in the environment of CUDA 11.0. Maybe you can try to run DCPose in the environment of CUDA 10.0. Hope this will solve this problem.

@chenhaomingbob
All recent Nvidia cards using the Ampere Architecture (RTX 3060/3070/3080, A40, A100, and many more; full list here) require CUDA 11.1. I fear that staying with CUDA 10.0 will seriously reduce the adoption of DCPose by researchers.

@peteruhrig
Copy link

@chenhaomingbob
Compatibility with CUDA 11 and the corresponding Pytorch 1.11 (with CUDA 11 support) only requires minimal changes (use torch_check instead of at_check). See here:
#34 (comment)

@peteruhrig
Copy link

I have to add here that we SOMETIMES run into this problem: When processing a video with roughly 36,000 frames, only around 33,000 are actually processed. For the others, we get the error message:

error in modulated_deformable_im2col_cuda: no kernel image is available for execution on the device

This is rather surprising, because there is definitely a kernel image available for most of the frames, so at the moment we think that this error message is misleading. I appreciate any help with this!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants