Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[JS/WebGPU] Add GatherBlockQuantized op support #21734

Merged
merged 48 commits into from
Aug 26, 2024

Conversation

satyajandhyala
Copy link
Contributor

@satyajandhyala satyajandhyala commented Aug 14, 2024

Description

Add GatherBlockQuantized operator to JSEP.

Motivation and Context

Gemma model requires this.

@satyajandhyala satyajandhyala added the ep:WebGPU ort-web webgpu provider label Aug 14, 2024
@satyajandhyala satyajandhyala marked this pull request as ready for review August 14, 2024 13:06
Copy link
Contributor

@guschmue guschmue left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

works for me. There is some wired issue when using this with gemma-2-2b but I don't think this is related to this PR.

@satyajandhyala satyajandhyala merged commit af18824 into main Aug 26, 2024
97 checks passed
@satyajandhyala satyajandhyala deleted the sajandhy/webgpu_add_block_quantized_gather branch August 26, 2024 21:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ep:WebGPU ort-web webgpu provider
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants