Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Dev] Fix a but within FP8 E4M3 Fast Decoding #54

Merged
merged 29 commits into from
Jun 6, 2024

Conversation

LeiWang1999
Copy link
Contributor

This pull request primarily focuses on enhancing the functionality of the bitblas Python package and updating the version number. The main changes include the addition of MatmulConfigWithSplitK and MatmulWithSplitK in the bitblas module, updates to the gemv and gemv_dequantize modules to support more iterations, and modifications to the quantization module for better handling of floating point numbers. The version number has also been updated from 0.0.1.dev9 to 0.0.1.dev12.

Version Update:

  • VERSION and python/bitblas/__init__.py: Updated the version number from 0.0.1.dev9 to 0.0.1.dev12. [1] [2]

Enhancements to bitblas module:

Updates to gemv and gemv_dequantize modules:

Modifications to quantization module:

Other Changes:

LeiWang199 and others added 29 commits May 21, 2024 11:51
@LeiWang1999 LeiWang1999 merged commit c090df6 into microsoft:main Jun 6, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant