FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection

This is the official PyTorch implementation for ACL 2024 paper "FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection"

Quick Links

FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection

Overview

In this work, we propose FastFiD, which executes sentence selection on the encoded passages to address the inference efficiency problem of FiD and other similar RAG system. This aids in retaining valuable information while reducing the context length required for generating answers. Experiments on three commonly used datasets (Natural Questions, TriviaQA and ASQA) demonstrate that our method can enhance the inference speed by 2.3X-5.7X, while simultaneously maintaining the model’s performance.

Requirments

Python: 3.8.12

pip install -r requirements.txt

Note that you should install the correct version of PyTorch that matches your CUDA version. See PyTorch official website for instructions.

Model and Data Preparation

Required checkpoints and embeddings

Retriever Model:

NQ Retriever: https://github.com/facebookresearch/FiD/blob/main/get-model.sh
TQA Retriever: https://github.com/facebookresearch/FiD/blob/main/get-model.sh
ASQA use the same retriever as NQ.

Pretrained Model:

T5-base: https://huggingface.co/google-t5/t5-base
T5-large: https://huggingface.co/google-t5/t5-large
Llama2-7B: https://huggingface.co/meta-llama/Llama-2-7b

Required data files

Wikipedia evidence passages & NQ & TriviaQA & ASQA

Tsinghua Cloud
Google Drive

Data Process

Build retriving index for wikipedia passages using retriever models.
```
bash scripts/build_index.sh
```

Process datas to get retrieved passages for each qa pair.

bash scripts/evaluate_retriever.sh
bash scripts/build_qap_dataset.sh

Process data to recognize supported sentences in retrieved passages.
```
bash scripts/build_qap_sentence_dataset.sh
```

Main Experiments

First Stage Training

During the first stage training, we will utilize a multi-task training method, which involves sentence selection and answer generation. By doing this, we will get a model which can not only predict final answer given question and retrieved passages, but also select valuable information in the retrieved passages.

bash scripts/train_hybrid.sh

Second Stage Training

In this stage, we will train the model to make predictions based only on the selected information to achieve inference acceleration.

bash scripts/train_select_generation.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Readme.md

Readme.md

FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection

Quick Links

Overview

Requirments

Model and Data Preparation

Required checkpoints and embeddings

Required data files

Data Process

Main Experiments

First Stage Training

Second Stage Training

Files

Readme.md

Latest commit

History

Readme.md

File metadata and controls

FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection

Quick Links

Overview

Requirments

Model and Data Preparation

Required checkpoints and embeddings

Required data files

Data Process

Main Experiments

First Stage Training

Second Stage Training