Support embeddings workflows #36

alexanderatallah · 2023-04-18T23:58:53Z

Support workflows involving embeddings and vector databases. Need to discuss:

whether the storage of embeddings should be left to the developer
if not, ephemeral vs persistent embeddings use cases
most expensive "tinkering" use cases that require vector databases, that could be made low/zero cost for devs
cases where multiple apps are computing embeddings on the same data
costs of switching between embedding models (recomputing embeddings)

handrew · 2023-04-20T02:40:54Z

Working on a demo of embeddings w/ Window, so I have a few thoughts here.

I think it should be fairly easy for Window to be neutral here and allow the developer to store embeddings wherever they want on the backend.
For instance, the thing I'm working on saves embeddings to disk locally. It'd be easy to imagine saving this to some vector store or plain old SQL db in a hosted service and letting the developer handle CRUD operations, caching, etc.
However, it would be nice if Window had some way to store embeddings on the client side as well :). Either in the client's browser's local storage, or even just in memory. To this end, there are a few tools — Huggingface.js and Chroma come to mind — that could be used in Window “out of the box”.

I don't have a strong view or prior on what use cases might prefer one or the other yet, but per the above I think it could be fairly simple to enable flexibility of in-memory / ephemeral vs. locally-stored / managed embeddings. Could be a potential "Infura"-like offering too?

My prior on this is that OpenAI's embeddings are so cheap that the "zero cost" value prop might not be as acutely felt. Also, I've seen some vector dbs, e.g., Chroma use free, off-the-shelf HuggingFace embedding models, so those might be an option to get started out with.

Will of course depend on the provider. Unless I'm misunderstanding, if I were you, I'd probably leave this up to the developer.

alexanderatallah added this to window.ai Roadmap Apr 15, 2023

alexanderatallah converted this from a draft issue Apr 18, 2023

handrew mentioned this issue Apr 20, 2023

Example demo app with multiple custom models, including backend embedding #39

Merged

Provide feedback