From dc651db69f6cb66c862d846267bbdaa36cccd213 Mon Sep 17 00:00:00 2001 From: chiayewken Date: Tue, 28 Mar 2023 23:16:02 +0800 Subject: [PATCH] c --- README.md | 11 +++++++---- 1 file changed, 7 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index 9da5812..60b7e5e 100644 --- a/README.md +++ b/README.md @@ -3,10 +3,13 @@ This repository contains code for extending the [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca) synthetic instruction tuning to existing instruction-tuned models such as [Flan-T5](https://arxiv.org/abs/2210.11416). The pretrained models and demos are available on HuggingFace 🤗 : -[Base](https://huggingface.co/declare-lab/flan-alpaca-base) (220M), -[Large](https://huggingface.co/declare-lab/flan-alpaca-large) (770M), -[XL](https://huggingface.co/declare-lab/flan-alpaca-xl) (3B), -XXL (11B, Coming soon) + +| Model | Parameters | Training GPUs | +|---------------------------------------------------------------------------|------------|-----------------| +| [Flan-Alpaca-Base](https://huggingface.co/declare-lab/flan-alpaca-base) | 220M | 1x A6000 | +| [Flan-Alpaca-Large](https://huggingface.co/declare-lab/flan-alpaca-large) | 770M | 1x A6000 | +| [Flan-Alpaca-XL](https://huggingface.co/declare-lab/flan-alpaca-xl) | 3B | 1x A6000 | +| [Flan-Alpaca-XXL](https://huggingface.co/declare-lab/flan-alpaca-xxl) | 11B | 4x A6000 (FSDP) | ### Why?