documentation update for splade++ huggingface -> onnx #2556

AndreSlavescu · 2024-07-31T19:14:07Z

Updated documentation for up to date e2e splade conversion to onnx (huggingface -> onnx)

lintool · 2024-07-31T19:15:19Z

@cadurosar can you please take a look at this?

carlos-lassance

This looks very good, just a few comments

docs/onnx-conversion.md

carlos-lassance · 2024-08-01T08:24:42Z

docs/onnx-conversion.md

+
+---
+
+Another important component is being able to specify the dimensions of the input and output tensors. This is achieved by the following code:


I would say more that we are giving the dynamic axis names here and not specifying the dimensions

docs/onnx-conversion.md

carlos-lassance · 2024-08-01T08:28:05Z

docs/onnx-conversion.md

+cd src/main/python/onnx
+# Now run the script
+python3 run_onnx_model_inference.py --model_path models/splade-cocondenser-ensembledistil-optimized.onnx \
+                                    --model_name naver/splade-cocondenser-ensembledistil


maybe an explanation here that we get the model nome to use the tokenizer?
Another possibility is using the model name to load the hf version and then do a comparison between both to check that things are ok (always reassuring to someone using this for the first time)

Sounds good. I can add more details for this

lintool · 2024-08-01T11:47:56Z

@AndreSlavescu Can you also add a blurb on how to test the newly generated SPLADE++ ED model e2e? E.g., point to the repro, "swap in" the new model in ~/.cache/, etc.

documentation update for splade++ huggingface -> onnx

c457e61

lintool requested a review from cadurosar July 31, 2024 19:15

AndreSlavescu added 4 commits July 31, 2024 16:39

scripts for converting hf models easily to onnx

ca3a0b7

makdedir for models

f6246e7

updated docs

ab0614b

inference docs

485ea47

carlos-lassance suggested changes Aug 1, 2024

View reviewed changes

AndreSlavescu added 2 commits August 1, 2024 13:03

update docs + change to float(0.0) for thresholding

5a391c7

added final section for splade regressions

60e1197

lintool approved these changes Aug 2, 2024

View reviewed changes

lintool merged commit 1073485 into castorini:master Aug 2, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

documentation update for splade++ huggingface -> onnx #2556

documentation update for splade++ huggingface -> onnx #2556

AndreSlavescu commented Jul 31, 2024

lintool commented Jul 31, 2024

carlos-lassance left a comment

carlos-lassance Aug 1, 2024

carlos-lassance Aug 1, 2024

AndreSlavescu Aug 1, 2024

lintool commented Aug 1, 2024


		---

		Another important component is being able to specify the dimensions of the input and output tensors. This is achieved by the following code:

documentation update for splade++ huggingface -> onnx #2556

documentation update for splade++ huggingface -> onnx #2556

Conversation

AndreSlavescu commented Jul 31, 2024

lintool commented Jul 31, 2024

carlos-lassance left a comment

Choose a reason for hiding this comment

carlos-lassance Aug 1, 2024

Choose a reason for hiding this comment

carlos-lassance Aug 1, 2024

Choose a reason for hiding this comment

AndreSlavescu Aug 1, 2024

Choose a reason for hiding this comment

lintool commented Aug 1, 2024