FastAPI for deploying model pulled from HuggingFace

Hosted gemma 2b-it (without bitsandbytes config) : can uncomment line < quantization_config=bnb_config, > to load the quantized model
POST accepts any Research Paper's abstract as input and try to reporduce the title that maximises the clickbait as output
The gemma 2b-it model is loaded with a PeftAdapter derived from QLoRA fine tuning the gemma 2b-it model on dataset [https://huggingface.co/datasets/ashishkgpian/astromistral] containing the abstract and title of different scientific/research papers shown to have maximum reads, even without having citations [clickbait possibility]

To run it locally :

git clone https://github.com/ashishakkumar/HuggingFace-FastAPI-Uvicorn.git

>> python -m venv virtual_environment_name

>> source virtual_environment_name/bin/activate

>> pip install -r requirements.txt

>> uvicorn main:app --reload

After the server is setup for running the api, you can check more about the inputs and outputs by running http://127.0.0.1:8000/docs in the browser

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
OIG3.p39IuQA.jpeg		OIG3.p39IuQA.jpeg
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback