Skip to content

Latest commit

 

History

History
81 lines (51 loc) · 3.63 KB

README.md

File metadata and controls

81 lines (51 loc) · 3.63 KB

Tutorials for RAG usage with an LLM locally or in Google Colab

Simple RAG tutorials that can be run locally with an LLM or using Google Colab (only Pro version).

These notebooks can be executed locally or in Google Colab. Either way, you have to install Ollama to run it.

RAG diagram

Tutorials

Technologies used

For these tutorials, we use LangChain, LlamaIndex, and HuggingFace for generating the RAG application code, Ollama for serving the LLM model, and a Jupyter or Google Colab notebook.

Langchain Logo LlamaIndex Logo HuggingFace Logo Ollama Logo Jupyter Logo Google Colab Logo

Intructions to run the example locally

  • Download and install Ollama:

Go to this URL and install it: https://ollama.com/download

  • Pull the LLM model. In this case, llama3:
ollama pull llama3

More details about llama3 in the official release blog and in Ollama documentation.

Intructions to run the example using Google Colab (Pro account needed)

  • Install Ollama from the command line:

(Press the button on the bottom-left part of the notebook to open a Terminal)

curl -fsSL https://ollama.com/install.sh | sh
  • Pull the LLM model. In this case, llama3
ollama serve & ollama pull llama3
  • Serve the model locally so the code can access it.
ollama serve & ollama run llama3

If an error is raised related to docarray, refer to this solution: https://stackoverflow.com/questions/76880224/error-using-using-docarrayinmemorysearch-in-langchain-could-not-import-docarray