Finetuning

We provide the scripts as well as the trained models on the downstream tasks as follows. Note that before you run the finetuning scripts, you need to download the corresponding pretrained ROSITA model here.

VQA

The following script run the finetuning on the train+trainvalsplit of VQAv2.

$ bash scripts/train-vqa-vqav2.sh

As the training stage complete, you may run the following script to run evaluation on the test split and generate a result.json file under the output folder. This file can be submitted to the online server to obtain the performance.

$ bash scripts/test-vqa-vqav2.sh

We also provide the checkpoint model to reproduce the following results on test-dev and test-std split using the testing script above.

Name	Test-dev				Test-std				Downloads
Name	All	Y/N	Num	Other	All	Y/N	Num	Other	Downloads
ROSITA-base	73.91	89.91	56.07	64.29	73.97	89.76	55.81	64.39	model

REC

RefCOCO

The following script run the finetuning on the train split of RefCOCO.

$ bash scripts/train-rec-refcoco.sh

As the training stage complete, you may run the following script to run evaluation on the val/testA/testB split of RefCOCO.

bash scripts/test-rec-refcoco.sh

We also provide the checkpoint model to reproduce the following results on val and testA/B split using the testing script above.

Name	RefCOCO			Downloads
Name	val	testA	testB	Downloads
ROSITA-base	84.79	87.99	78.28	model

RefCOCO+

The following script run the finetuning on the train split of RefCOCO+.

bash scripts/train-rec-refcoco+.sh

As the training stage complete, you may run the following script to run evaluation on the val/testA/testB split of RefCOCO+.

bash scripts/test-rec-refcoco+.sh

We also provide the checkpoint model to reproduce the following results on val and testA/B split using the testing script above.

Name	RefCOCO+			Downloads
Name	val	testA	testB	Downloads
ROSITA-base	76.06	82.01	67.40	model

RefCOCOg

The following script run the finetuning on the train split of RefCOCOg.

bash scripts/train-rec-refcocog.sh

As the training stage complete, you may run the following script to run evaluation on the val/test split of RefCOCOg.

bash scripts/test-rec-refcocog.sh

We also provide the checkpoint model to reproduce the following results on val and testA/B split using the testing script above.

Name	RefCOCOg		Downloads
Name	val	test	Downloads
ROSITA-base	78.23	78.25	model

ITR

ITR-COCO

The following script run the finetuning on the train split of ITR-COCO.

bash scripts/train-itr-coco.sh

As the training stage complete, you may run the following script to run evaluation on the testall split of ITR-COCO.

bash scripts/test-itr-coco.sh

We also provide the checkpoint model to reproduce the following results on testall split using the testing script above.

Name	IR-COCO			TR-COCO			Downloads
Name	R@1	R@5	R@10	R@1	R@5	R@10	Downloads
ROSITA-base	54.40	80.92	88.60	71.26	91.62	95.58	model

ITR-Flickr

The following script run the finetuning on the train split of ITR-Flickr.

bash scripts/train-itr-flickr.sh

As the training stage complete, you may run the following script to run evaluation on the test split of ITR-Flickr.

bash scripts/test-itr-flickr.sh

We also provide the checkpoint model to reproduce the following results on test split using the testing script above.

Name	IR-Flickr			TR-Flickr			Downloads
Name	R@1	R@5	R@10	R@1	R@5	R@10	Downloads
ROSITA-base	74.08	92.44	96.08	88.90	98.10	99.30	model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetune.md

finetune.md

Finetuning

Table of Contents

VQA

REC

RefCOCO

RefCOCO+

RefCOCOg

ITR

ITR-COCO

ITR-Flickr

Files

finetune.md

Latest commit

History

finetune.md

File metadata and controls

Finetuning

Table of Contents

VQA

REC

RefCOCO

RefCOCO+

RefCOCOg

ITR

ITR-COCO

ITR-Flickr