Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HoloChat Local #70

Closed
wants to merge 1 commit into from
Closed

HoloChat Local #70

wants to merge 1 commit into from

Conversation

NigelNelson
Copy link
Contributor

HoloChat local is a chat application that allows users to interact with 'HoloChat' an LLM that able to answer a wide array of questions about the Holoscan SDK.

To accomplish this, this app uses Langchain to store the Holoscan SDK's repo and user guide in a vector database. This vector database is then queried by HoloChat to retrieve relevant documentation to aid in answering user questions.

To build this repo, the BGE embedding model is used via LangChain, which uses HuggingFace's tranformer's library as the back end. This requires torch >= 2.0, where I was unable to find a wheel that supported ARM64 with dGPU, CUDA 11.8, and Python 3.10. So, I built the included wheel from source and included it in the repo.

However, I'm interested in potentially deploying this in a container, such as nvcr.io/nvidia/cuda:12.2.0-runtime-ubuntu22.04, which could remove all of the setup steps and allow a user to simply execute docker run.

@NigelNelson NigelNelson self-assigned this Aug 31, 2023
@NigelNelson NigelNelson reopened this Sep 5, 2023
@NigelNelson NigelNelson closed this Sep 5, 2023
@NigelNelson NigelNelson deleted the nigeln/local-llama branch September 6, 2023 03:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant