Retrieve image text pairs #14

sahel-sh · 2024-06-07T03:13:09Z

[3/3] Retrieve complement candidates when enabled in config. The complement candidate of an original retrieved candidate has a complement modality so that candidate and complement candidate are always image-text pairs.
The complement candidate for each original candidate is retrieved by using the original candidate as an interactive query.
The complement candidate is the top most relevant candidate with complement modality but different from the original query.

…dates in jsonl format

lim142857 · 2024-06-17T16:00:32Z

Thank you for your contributions! I noticed that the InteractiveRetriever requires a pre-built candidate index file to function correctly. To assist users with this setup, could we consider adding a script, such as run_interactive_retriever_pipeline.sh, that demonstrates the entire pipeline? This script would cover embedding, indexing, and loading the index for the interactive retriever and retrieve demo queries. Additionally, incorporating a step-by-step guide in the README could greatly enhance the user experience.

lim142857

For the InteractiveRetriever, it seems that hardcoding for MSCOCO is in place. I think we should make it into a tool applicable to all tasks and datasets.
We can add a run_interactive_retriever_pipeline.sh, that demonstrates the entire pipeline.
Please see the detailed comments for the review :)

lim142857 · 2024-06-17T16:13:27Z

src/common/interactive_retriever.py

+
+        # MSCOCO's dataset id is hardcoded since the dataset id and query/candidate modalities determine the instruction part of the prompt.
+        # MSCOCO's dataset supports prompt instructions for both image->text and text->image query->candidate modalities.
+        self.dataset_id = 9


Is the InteractiveRetriever specifically designed for the MSCOCO dataset? I observed that the self.dataset_id and task_id assignments appear to be hardcoded.

I changed the interactiveRetriever to be generic, but the way it is currently integrated with the mbeir_retriever is for retrieving complement candidates to create image text pairs and mscoco is a dataset that supports both text->image and image->text queries. Now the embeir retriever sets the dataset to mscoco for this task.

lim142857 · 2024-06-17T16:14:20Z

src/common/interactive_retriever.py

+    IMAGE = "image"
+
+
+class InteractiveRetriever:


I noticed that the InteractiveRetriever requires a pre-built candidate index file to function correctly. To assist users with this setup, could we consider adding a script, such as run_interactive_retriever_pipeline.sh, that demonstrates the entire pipeline? This script would cover embedding, indexing, and loading the index for the interactive retriever and retrieve demo queries. Additionally, incorporating a step-by-step guide in the README could greatly enhance the user experience.

done, I created unirag folder next to inbatch for BLIP_FF Large and CLIP_SF Large. It has embed, index, and retrieval configs and the run script as your requested.

sahel-sh · 2024-07-13T17:05:55Z

@lim142857 I applied the requested changes, PTAL

sahel-sh added 28 commits May 22, 2024 15:20

Add a raw retrieval option to store queries and their retrieved candi…

bdc8649

…dates in jsonl format

Add interactive retriever

1104db5

Merge branch 'main' into raw_retrieval

1017242

add retrieval of image-text pairs to retrieval config yaml

f6d7659

left a todo for retrieving complementary candidates

cfeaaf5

Merge branch 'raw_retrieval' into interactive_retrieval

b4bde72

retrieve complement candidates

1a3b79a

Merge branch 'main' into raw_retrieval

5035a49

reformated with 120 chars

9b2e01b

reformatted with 120

d8f81bf

reformatted with 120

7c568c6

fix retrieved candidates path

e1c4915

Merge branch 'raw_retrieval' into interactive_retrieval

de8a73c

Merge branch 'interactive_retrieval' into retrieve_image_text_pairs

70a0145

fixed query embedder config

9957ef3

fix distributed settings

938e53f

skip getting complements for candidates with text,image modality

a95131c

fix typpo

db3222e

refactor raw retrieval

5cc9370

refactor interactive_retriever

9e5ec40

refactored raw retrieval

424ae26

Add a todo for image-txt retrieval

b4acd11

add default value for not to break the existing calls

160fea0

merge with raw_retrieval

46222df

update requirements

410ffd3

temp commit

7323f36

temp fix for complement retriever

65a6b08

add complement candidates

72871c1

lim142857 requested changes Jun 17, 2024

View reviewed changes

sahel-sh added 3 commits July 9, 2024 11:22

Merge branch 'main' into retrieve_image_text_pairs

2e21807

addressed review comments

c62880b

polish readme

b25ddbc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retrieve image text pairs #14

Retrieve image text pairs #14

sahel-sh commented Jun 7, 2024

lim142857 commented Jun 17, 2024 •

edited

Loading

lim142857 left a comment

lim142857 Jun 17, 2024

sahel-sh Jul 13, 2024

lim142857 Jun 17, 2024

sahel-sh Jul 13, 2024

sahel-sh commented Jul 13, 2024

Retrieve image text pairs #14

Are you sure you want to change the base?

Retrieve image text pairs #14

Conversation

sahel-sh commented Jun 7, 2024

lim142857 commented Jun 17, 2024 • edited Loading

lim142857 left a comment

Choose a reason for hiding this comment

lim142857 Jun 17, 2024

Choose a reason for hiding this comment

sahel-sh Jul 13, 2024

Choose a reason for hiding this comment

lim142857 Jun 17, 2024

Choose a reason for hiding this comment

sahel-sh Jul 13, 2024

Choose a reason for hiding this comment

sahel-sh commented Jul 13, 2024

lim142857 commented Jun 17, 2024 •

edited

Loading