Skip to content

Commit

Permalink
ViLT
Browse files Browse the repository at this point in the history
ViLT
  • Loading branch information
TarrySingh committed Jul 10, 2024
1 parent 0c0aa4c commit 608ca25
Show file tree
Hide file tree
Showing 8 changed files with 35,840 additions and 0 deletions.
Binary file modified .DS_Store
Binary file not shown.
Binary file modified deep-learning/.DS_Store
Binary file not shown.
22,864 changes: 22,864 additions & 0 deletions deep-learning/Transformer-Tutorials/ViLT/Fine_tuning_ViLT_for_VQA.ipynb

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

10 changes: 10 additions & 0 deletions deep-learning/Transformer-Tutorials/ViLT/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
# ViLT notebooks
In this directory, you can find several notebooks that illustrate how to use NAVER AI Lab's [ViLT](https://arxiv.org/abs/2102.03334) both for fine-tuning on custom data as well as inference. It currently includes the following notebooks:

- fine-tuning ViLT for visual question answering (VQA) (based on the [VQAv2 dataset](https://visualqa.org/))
- performing inference with ViLT to illustrate visual question answering (VQA)
- masked language modeling (MLM) with a pre-trained ViLT model
- performing inference with ViLT for image-text retrieval
- performing inference with ViLT to illustrate natural language for visual reasoning (based on the [NLVRv2 dataset](https://lil.nlp.cornell.edu/nlvr/)).

All models can be found on the [hub](https://huggingface.co/models?search=vilt).
Loading

0 comments on commit 608ca25

Please sign in to comment.