Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What embeddings model is used for the uploaded documents? #61

Open
algsupport opened this issue Oct 2, 2023 · 6 comments
Open

What embeddings model is used for the uploaded documents? #61

algsupport opened this issue Oct 2, 2023 · 6 comments
Labels
question Further information is requested

Comments

@algsupport
Copy link

I saw the project on the youtube. Seems excellent.

I was wondering. When uploading a new document, what is the embeddings model used to convert it?

Is it possible to chose a custom one? If so, how can it be done?

Thank you.

@timothycarambat
Copy link
Member

Custom embedding models arent supported at this time just because we haven't expanded the scope yet. Right now its just the normal 1536 text-ada-embedding-002 by OpenAI. Obviously not everyone uses that and if you tried to edit or add a doc and the dimensions dont match it would stop you from doing so.

Are you using a Hugging face model for embeddings?

@timothycarambat timothycarambat added the question Further information is requested label Oct 2, 2023
@algsupport
Copy link
Author

Yes, I would want to use a hugging face model if possible.
text-ada-embedding-002 works too, but would be more convinient to select the embeddings model.

Would you mind if I try to add it myself (Of course I will make a pull request if I succeed.)? Could you point me towards the part of the code that would be responsible for the embeddings CRUD?

Thank you

@timothycarambat
Copy link
Member

It is used in several areas (job for each vectorDB at this time).

If you look for anywhere openAi.embedTextChunk or openAi.embedTextChunks is used - that is the only place embeddings are currently used!

@andsty
Copy link

andsty commented Nov 28, 2023

can we have support on hugging face embeddings as well? or is not possible?

@timothycarambat
Copy link
Member

You can, but it is supported via LocalAI and not via the HuggingFace API directly. Is that what you are using for embedding currently?

@hasani114
Copy link

Any plan on adding this feature? Since this post, openai also released another model for embeddings. Also there are more specialized embeddings models being developed by companies like voyage ai etc which we'd like to be able to use.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

4 participants