feat: issue-1028, fetch models when user enters api key #3251

normunds-wipo · 2024-07-03T11:31:36Z

Summary

Solution is intended only for custom configurations. Each time the user enters api key from frontend, it reloads models, including from the endpoint using the new key.

Relevant changes:

in frontend api layer, update of user key invalidates models, causing to re-load them when next used
in modelscontroller, get method does not serve from cache anymore, but each time called, retrieves the models
in loadConfigModels, if api_key is defined as user_provided, check if user has a valid key and use it to fetch models

It might be working also for default endpoints, as now each time key has been entered, the models get refreshed, but this is not tested.

Change Type

Issue [Enhancement]: if user_provided API KEY value is used, fetch models for user #1028, fetch user models when user provides API_KEY

Testing

Tested just by running code. We are only using custom configuration

Checklist

Please delete any irrelevant options.

My code adheres to this project's style guidelines
I have performed a self-review of my own code
I have commented in any complex areas of my code
Local unit tests pass with my changes (well tests related to tokenSplit fail with or without my code)

… same base url

danny-avila · 2024-07-05T14:21:10Z

api/server/controllers/ModelController.js

-  const customModelsConfig = await loadConfigModels(req);
-
-  const modelConfig = { ...defaultModelsConfig, ...customModelsConfig };
+  const modelConfig = { ...(await loadDefaultModels(req)), ...(await loadConfigModels(req)) };


with this change, the cache is never used for models. this will lead to slow page loads every time.

Without this it will not work. Now what?

The cache is never used when we get /api/models request from UI. Right, and that happens only in two cases:

first load of UI (this is your worry - we do not need to re-query endpoints if the key is fixed)

after user key update (this is kind of point that we want to obtain)
Cache is used if some other part of code requests models using getModelsConfig()

We can say that all models retrieved with not-user-provided key stay the same between different users. In this case we could to split endpoints using fixed keys and user defined.

Then maybe we could cache fixed set and "user-provided". I guess it could not be simply between default models and custom models as both can use both fixed and user provided keys, right?

On the first glance the query of models are not easy to split up in this way. Probably doable, but it feels hardly worth.

Probably we could use caching of models per endpoint+key. Then we will need to do only 1 request - for the endpoint user just changed the key. Sounds the best for me. But this involves changes across all endpoints/providers.

Looking a bit deeper, it looks like we cannot cache ALL models (as a block) like it is currently done in ModelController if we need to allow different users to have a different set of models.
User key that you can enter from UI implies exactly that. Say we have a custom configuration with a user provided key

you get key with 5 models, you log in, enter the key, we cache N+5 models

I get key with 2 models, I log in, enter the key, we cache N+2 models

you do some operation, that requires model validation - if validation uses the cache, your operation will be validated against my 2 models :-(

The solution would be to cache models per url+key, and whenever we need a list of models, rebuild all the model set (and not use the fixed one in CONFIG_STORE). It looks like hardly any models get fetched from a remote endpoint except ones in custom config (and ones loaded via getOpenAIModels, that already caches models per endpoint)
So all the changes will go into loadConfigModels (and I will remove any usage of cache in ModelsController) I will shortly update PR with these changes.
There is also some draft code in overrideController() that intends to update the cache; this will need to aligned if/whenever implemented.

mkagit · 2024-09-12T15:50:40Z

Is it working without issues @normunds-wipo ?

normunds-wipo · 2024-09-12T16:19:54Z

Yes. I just merged the main branch and had to adjust for encryption/decryption utility return value change (from string to Promise), but else we are using this code for a couple of months without problems.

mkagit · 2024-09-12T16:45:23Z

Yes. I just merged the main branch and had to adjust for encryption/decryption utility return value change (from string to Promise), but else we are using this code for a couple of months without problems.

I'll try it. What's your case for the reloading fetch?
I was looking for to, fetch through custom "user_provided" key, models from the LiteLLM virtual keys proxy server.

normunds-wipo · 2024-09-12T16:52:56Z

The same, query models from LiteLLM. Different users have different keys and potentially different set of models. So we cannot cache all models and need to query models by user. Also once the user enters the key we need to reload models corresponding to the new key. It seems it is doing it correctly.

dansavu · 2024-09-16T16:54:34Z

This is a useful feature, looking forward to see the PR approved and merged.

normunds-wipo and others added 3 commits July 3, 2024 12:45

feat: issue-1028, fetch models when user enters api key

e19123a

fix: user_provided key - allow several custom configurations with the…

3ba470b

… same base url

Merge branch 'main' into feature/issue-1028

e2cc66f

normunds-wipo marked this pull request as ready for review July 3, 2024 11:44

danny-avila requested changes Jul 5, 2024

View reviewed changes

fix: implemented model caching per endpoint+key

e7ee26a

normunds-wipo requested a review from danny-avila July 12, 2024 11:21

Merge branch 'main' into feature/issue-1028

b60507a

fix: align decryption with changes in server/utils

ca84534

Merge branch 'main' into feature/issue-1028

5e9ecc8

normunds-wipo added 2 commits October 1, 2024 09:26

fix: include fix for cannot change expiry to "never" (danny-avila#4293)

fb2b41f

fix: accommodate never expiring keys (danny-avila#3252)

a5b956c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: issue-1028, fetch models when user enters api key #3251

feat: issue-1028, fetch models when user enters api key #3251

normunds-wipo commented Jul 3, 2024 •

edited

Loading

danny-avila Jul 5, 2024

normunds-wipo Jul 5, 2024

normunds-wipo Jul 8, 2024 •

edited

Loading

mkagit commented Sep 12, 2024

normunds-wipo commented Sep 12, 2024

mkagit commented Sep 12, 2024

normunds-wipo commented Sep 12, 2024

dansavu commented Sep 16, 2024

feat: issue-1028, fetch models when user enters api key #3251

Are you sure you want to change the base?

feat: issue-1028, fetch models when user enters api key #3251

Conversation

normunds-wipo commented Jul 3, 2024 • edited Loading

Summary

Change Type

Testing

Checklist

danny-avila Jul 5, 2024

Choose a reason for hiding this comment

normunds-wipo Jul 5, 2024

Choose a reason for hiding this comment

normunds-wipo Jul 8, 2024 • edited Loading

Choose a reason for hiding this comment

mkagit commented Sep 12, 2024

normunds-wipo commented Sep 12, 2024

mkagit commented Sep 12, 2024

normunds-wipo commented Sep 12, 2024

dansavu commented Sep 16, 2024

normunds-wipo commented Jul 3, 2024 •

edited

Loading

normunds-wipo Jul 8, 2024 •

edited

Loading