Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Batched operations for Python bindings? #384

Open
ethanhs opened this issue May 4, 2020 · 3 comments
Open

Batched operations for Python bindings? #384

ethanhs opened this issue May 4, 2020 · 3 comments

Comments

@ethanhs
Copy link

ethanhs commented May 4, 2020

Hi! Really neat stuff here, I'm curious as to whether it would be possible to expose the batched gemm/axpy operations to Python. I'm hoping to use them in a project I'm working on.

Thanks for the awesome project!

@CNugteren
Copy link
Owner

CNugteren commented May 6, 2020

I checked and indeed all special (non standard BLAS) routines are not in pyclblast. I see no reason why they wouldn't be, except for a little bit of work. I'll look at this in the coming weeks when I have time.

@ethanhs
Copy link
Author

ethanhs commented May 6, 2020

Thank you! I realized that I probably will need to use the opencl kernels directly after all for my particular use case, so don't feel any pressure.

@CNugteren
Copy link
Owner

CNugteren commented May 10, 2020

Code is updated (see #386), and an example is included for batched-AXPY.

But I'm having issues updating the official PyCLBlast package, but of an issue with dependencies and the way I've build them locally: following this doesn't work, since it results in the legacy linux_x86_64 tag, but I believe we need a manylinux tag, see e.g. here. If someone is willing to do that for me, please go ahead. In the meantime I have attached a Python 3.7 linux wheel here, or you can of course check out the source and build it yourself if it is not compatible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants