Skip to content

Commit

Permalink
change naming
Browse files Browse the repository at this point in the history
  • Loading branch information
SunMarc committed Aug 9, 2023
1 parent 744c249 commit 66d7104
Show file tree
Hide file tree
Showing 3 changed files with 3 additions and 2 deletions.
4 changes: 2 additions & 2 deletions docs/source/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -126,9 +126,9 @@
title: BetterTransformer
isExpanded: false
- sections:
- local: optimization_toolbox/usage_guides/quantization
- local: llm_quantization/usage_guides/quantization
title: GPTQ quantization
title: Optimization toolbox
title: LLM quantization
isExpanded: false
- sections:
- local: utils/dummy_input_generators
Expand Down
1 change: 1 addition & 0 deletions docs/source/concept_guides/quantization.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -185,6 +185,7 @@ models while respecting accuracy and latency constraints.
[PyTorch quantization functions](https://pytorch.org/docs/stable/quantization-support.html#torch-quantization-quantize-fx)
to allow graph-mode quantization of 🤗 Transformers models in PyTorch. This is a lower-level API compared to the two
mentioned above, giving more flexibility, but requiring more work on your end.
- The `optimum.llm_quantization` package allows to [quantize and run LLM models](https://huggingface.co/docs/optimum/llm_quantization/usage_guides/quantization)

## Going further: How do machines represent numbers?

Expand Down

0 comments on commit 66d7104

Please sign in to comment.