Add more quantization support for burn-jit #2275

laggui · 2024-09-12T19:33:24Z

Checklist

Confirmed that run-checks all script has been executed.

Changes

Quantization support for burn-jit.

Completed QJitTensor implementation w/ contained qparams
- Multiple quantized values are packed into a u32 tensor representation
Added quantize_per_tensor and dequantize_per_tensor cube kernels
- Handle affine/symmetric quantization with different vectorization factors based on input tensor size
Added from_data and into_data conversions
Added unit tests

Testing

Unit tests.

codecov · 2024-09-12T20:03:50Z

Codecov Report

Attention: Patch coverage is 60.58520% with 229 lines in your changes missing coverage. Please review.

Project coverage is 85.79%. Comparing base (58ce502) to head (d46076e).
Report is 13 commits behind head on main.

Files with missing lines	Patch %	Lines
...rates/burn-jit/src/kernel/quantization/quantize.rs	44.79%	106 Missing ⚠️
...tes/burn-jit/src/kernel/quantization/dequantize.rs	45.40%	101 Missing ⚠️
crates/burn-jit/src/ops/qtensor.rs	76.66%	21 Missing ⚠️
crates/burn-tensor/src/tensor/data.rs	83.33%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2275      +/-   ##
==========================================
- Coverage   85.92%   85.79%   -0.13%     
==========================================
  Files         750      754       +4     
  Lines       94328    95189     +861     
==========================================
+ Hits        81047    81671     +624     
- Misses      13281    13518     +237

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…ms equality

… values)

laggui · 2024-09-16T19:06:44Z

crates/burn-tensor/src/tensor/data.rs

Had to change the equality assertion for quantized values since some tests produced very close values on macos (in floating point), but the assertion failed on the quantization parameters values.

nathanielsimard

LGTM

laggui added 12 commits September 12, 2024 15:02

Add cubecl quantization kernels and QTensorOps for burn-jit

7637851

Fix typo

2256890

Fix output vec factor

4f93389

Fix output dtype size_of

7043b95

Remove unused code in dequantize test

13f538f

Fix dequantize vectorization

0f278f4

Handle tensors when number of elems is not a multiple of 4

31c3902

Support quantize for tensors with less than 4 elems (no vectorization)

e0dcd5d

Fix equal 0 test

0403075

Add quantize/dequantize tests

0fac955

Add q_to_device

ea11947

Refactor kernels for latest cubecl

e6e1a0e

laggui requested review from louisfd and nathanielsimard September 12, 2024 19:33

louisfd and others added 7 commits September 16, 2024 12:09

intermediate i32 cast

2d04530

Fix size_of output type

19cf31a

Use strict=false to ignore floating point precision issues with qpara…

7b17133

…ms equality

Only check that lhs & rhs strategies match (but not strict on qparams…

dd5cdf9

… values)

Use assert_approx_eq on dequant values

eab11d0

Reduce precision for flaky test

ebbbdf8

Remove todo comment

d1426eb

laggui commented Sep 16, 2024

View reviewed changes

nathanielsimard approved these changes Sep 17, 2024

View reviewed changes

laggui added 2 commits September 17, 2024 09:41

Add comment for cast to unsigned

fc09140

More comment

d46076e

laggui merged commit aa79e36 into main Sep 17, 2024
11 checks passed

laggui deleted the feat/jit/quantize branch September 17, 2024 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more quantization support for burn-jit #2275

Add more quantization support for burn-jit #2275

laggui commented Sep 12, 2024

codecov bot commented Sep 12, 2024 •

edited

Loading

laggui Sep 16, 2024

nathanielsimard left a comment

Add more quantization support for burn-jit #2275

Add more quantization support for burn-jit #2275

Conversation

laggui commented Sep 12, 2024

Checklist

Changes

Testing

codecov bot commented Sep 12, 2024 • edited Loading

Codecov Report

laggui Sep 16, 2024

Choose a reason for hiding this comment

nathanielsimard left a comment

Choose a reason for hiding this comment

codecov bot commented Sep 12, 2024 •

edited

Loading