Cache Mongo-DB calls (in memory only) #998

jason-fox · 2021-03-02T12:50:30Z

Mongo-DB access is slow. This PR minimizes the need to make database calls, by caching the last 1000 device calls and 100 group calls in a refreshable cache. This in turn reduces network traffic and increases maximum throughput.

Add cache-manager
Wrap mongo-db GET calls
Bust cache on any provisioning updates/deletes
Update unit tests.
Add Documentation

All parameters are settable as config or Docker ENV variables.

Adding cache=true as part of any Device or Group configuration will ensure that the data can potentially be received from cached and not necessarily retrieved from the MongoDB database.

jason-fox · 2021-03-02T12:51:13Z

Duplicate of #926 but without the redis cache

mapedraza · 2021-03-09T17:22:53Z

Thank for your contribution Jason.

Regarding the config parameters, the option dontCache unfortunately does not provide backwards compatibility.

A good example could be something like JEXL configuration approach:

A global parameter that configures the default mode (cache enabled or disabled for all config groups, by default)
A parameter on the config group that specifies the cache mode and overrides the default configuration

With these two parameters, an existent deployment that update the IoT Agent can use cache if it needed, offering backwards compatibility with config groups already provisioned. The expected behaviour is described below.

Regarding the cache distribution, it should allow segmentation (as mentioned in here #926 (comment)). As far as I saw in the code, all the cache is shared across all the tenants. In multi tenant environments, the risk one tenant could overuse all the resources is present. Having this in mind, we can differentiate between two types of cache:

Groups cache: It is associated to a tenant (aka FIWARE-Service). Since it is not modeled in IoT Agent’s data model or in the API any “entity” or operation that reference to a Service (Tenant) right now, a first approach to configure the group cache should be:
- A general switch on/of group cache.
- Group cache size (All tenants would have same Group cache size).
Devices cache: Each device group has to have their own independient device cache.

The architecture discussed above is illustrated in the following diagram:

As a resume, we would have to have the following env vars, as well as config.js parameter:

Group Cache
- IOTA_GROUP_CACHE_MODE. Posibles values: none, inMemory
- IOTA_GROUP_CACHE_SIZE
- IOTA_GROUP_CACHE_TTL
Device Group. Those configuration can be overridden through device group provision.
- IOTA_DEVICE_CACHE_DEFAULT_MODE
- IOTA_DEVICE_CACHE_DEFAULT_SIZE
- IOTA_DEVICE_CACHE_DEFAULT_TTL
- IOTA_DEVICE_CACHE_MAX_SIZE

The device group provision JSON as well should include the following parameters. This parameters should override the env var or config.js file configurations.

cacheTTL
cacheMode: may have one of the following values: none, inMemory
cacheSize: in case the value provided is greater than IOTA_DEVICE_CACHE_MAX_SIZE, it will be set to IOTA_DEVICE_CACHE_MAX_SIZE.

Example of cache config file section:

{
    "groupCacheMode":"inMemory",
    "groupCacheSize":100,
    "groupCacheTTL":100000,
    "deviceCacheDefaultMode":"inMemory",
    "deviceCacheDefaultSize":100,
    "deviceCacheDefaultTTL":100000,
    "deviceCacheMaxSize":1000
}

Equivalence with envars:

Environment variable	Configuration attribute
IOTA_CB_URL	`cache.contextBroker.url`
IOTA_CB_HOST	`cache.contextBroker.host`
IOTA_GROUP_CACHE_MODE	`cache.groupCacheMode`
IOTA_GROUP_CACHE_SIZE	`cache.groupCacheSize`
IOTA_GROUP_CACHE_TTL	`cache.groupCacheTTL`
IOTA_DEVICE_CACHE_DEFAULT_MODE	`cache.deviceCacheDefaultMode`
IOTA_DEVICE_CACHE_DEFAULT_SIZE	`cache.deviceCacheDefaultSize`
IOTA_DEVICE_CACHE_DEFAULT_TTL	`cache.deviceCacheDefaultMode`
IOTA_DEVICE_CACHE_MAX_SIZE	`cache.deviceCacheMaxSize`

Another point to consider is the Cache Replacement Mode. Depending on the scenario, it may be more interesting to have a LRU, MRU or random replacement based policy. Which mode is used right now? It would be interesting to be able to configure it.

SBlechmann · 2021-04-07T17:46:36Z

Thanks for this interesting PR! I really hope this can in fact improve read queries from the agents!
Since I'm not a coding expert (I'm more like a simple user) and haven't reviewed the code totally, please, allow me these questions:

If I understood correctly, the group cache is supposed to be the maximum granted cache per tenant (fiware-service) and the device cache is the maximum granted cache per group (endpoint /iot/services)? If I am in fact right I find this naming pretty confusing. How about renaming "group cache" to "tenant cache" or "db cache" and "device cache" to "group cache"?
Is there a check that the "device cache" or better the sum of "device caches" per tenant don't exceed the "group cache"?

Thanks for your time, really appreciate it!

mapedraza · 2021-04-08T08:39:52Z

First, I want to clarify that my previous comment with the description and diagrams, shows the desired behaviour expected from a cache system.

If I understood correctly, the group cache is supposed to be the maximum granted cache per tenant (fiware-service) and the device cache is the maximum granted cache per group (endpoint /iot/services)? If I am in fact right I find this naming pretty confusing. How about renaming "group cache" to "tenant cache" or "db cache" and "device cache" to "group cache"?

I know it is a bit confusing, but the reason about naming "group cache" is because it is a cache that store groups (and it is releated to each tenant). The reason why the device cache is named as device cache, is because devices are stored on that cache (and it is also linked to a group). Depending on what data is being stored or what the cache belongs to, one naming or another may make more sense.

Is there a check that the "device cache" or better the sum of "device caches" per tenant don't exceed the "group cache"?

They are different types of caches. The group cache only store config group (also named provision groups) and they dont store devices. You just have 2 different limits (IOTA_GROUP_CACHE_SIZE and IOTA_DEVICE_CACHE_MAX_SIZE)

SBlechmann · 2021-04-08T10:30:19Z

Thanks for the explanation, I think I got your naming now. So device and group cache are independent from each other.

Allow me one last question:

The group cache is at a fixed level for each tenant/database?

I believe there is checks in the background that still the sum of group and device cache don't exceed the memory of mongodb?

jason-fox · 2021-04-08T10:48:15Z

Regarding the cache distribution, it should allow segmentation (as mentioned in here #926 (comment)). As far as I saw in the code, all the cache is shared across all the tenants. In multi tenant environments, the risk one tenant could overuse all the resources is present. Having this in mind, we can differentiate between two types of cache:

I think the architecture you described won't work with a pure in-memory cache (which is what this PR now is) but is something to be achieved in PR #926 - all that this PR does for now is substitute an in memory record of the last n hits - it is very, very simple, but very fast to access. I would assume the per tenant config would be on Redis as a series of RedisCaches.

The same goes for replacement mode - not this PR but the other one.

jason-fox · 2021-04-08T10:55:42Z

Regarding the config parameters, the option dontCache unfortunately does not provide backwards compatibility.

Getting this right is something for the first PR, laying the groundwork so to speak. How is dontCache not backward compatible?
The default behaviour is Don't use any caches same as before. If and only if caching is deliberately enabled, it is used.
dontCache is a flag on provisioning to bypass any caching for important, must be consistent devices.

Can you clarify what changes to the current behaviour are you looking for here? I assume something needs fixing, I'm just not sure what.

A good example could be something like JEXL configuration approach:

A global parameter that configures the default mode (cache enabled or disabled for all config groups, by default)

This is currently IOTA_MEMCACHE_ENABLED or memCache.enabled in the config.

A parameter on the config group that specifies the cache mode and overrides the default configuration

The current architecture is assuming each cache can be enabled separately e.g. :

IOTA_MEMCACHE_ENABLED
IOTA_REDISCACHE_ENABLED
etc.

The local in-memory is the fastest and smallest. If IOTA_MEMCACHE_ENABLED=false it is not used if both are true then Redis will run if in-memory fails.

With these two parameters, an existent deployment that update the IoT Agent can use cache if it needed, offering
backwards compatibility with config groups already provisioned.

If we ignore Redis for now, does the in-memory quick and dirty do enough or not?

jason-fox · 2021-04-13T15:10:53Z

@mapedraza - The in-provisioning flag has been switched from dontCache to cache. The default has therefore switched to not to cache unless explicitly provisioned to do so in the device or device group level. Does this change pass the comment below:

Regarding the config parameters, the option dontCache unfortunately does not provide backwards compatibility.

Blobonat · 2022-08-19T20:11:46Z

Is there still active work on this topic? For a horizontal scalable deployment of IOTAs with one common MongoDB-Cluster this would be a gigantic performance boost lantency-wise.

jason-fox · 2022-08-22T16:08:02Z

Rebased as requested. This part is actually a much smaller change than it appears, since it also corrects the location of the mongoDB test tool, and runs cache flushing when necessary.

const mongoUtils = require('../../tools/mongoDBUtils');

Blobonat · 2022-08-23T07:18:40Z

Will this feature introduce breaking changes for the IOTA-implementations like IOTA-JSON and IOTA-UL or are the changes transparent so a simple version bump will enable the use of this functionality?

jason-fox · 2022-08-23T08:13:07Z

It is opt in, so it is only enabled if you set the configuration to do so. Even if you are using an in-memory cache, you could provision individual devices not to use it - it just depends if you want lower latency or if you are worried about the potential that IoT Agent A uses older cached in-memory info when a provisioning update has occurred through IoT Agent B

jason-fox · 2023-03-10T14:15:51Z

@mapedraza - is this PR still in the queue to be reviewed? It is opt-in, so without setting the parameters, the PR itself is harmless. It is use-case dependent as to whether you want full consistency across multiple IoT Agent instances or lower latency and fewer Database look-ups. The text is quite clear about this:

The memCache data is not shared across instances and therefore should be reserved to short term data storage. Multiple
IoT Agents would potential hold inconsistent provisioning data until the cache has expired.

jason-fox added 4 commits August 22, 2022 17:37

Merge branch 'master' into feature/memCache

54c51c7

Add missing files

13e4124

Formatting docs

e6cf78e

revert change

a560d54

jason-fox force-pushed the feature/memCache branch from c266011 to a560d54 Compare August 22, 2022 15:58

Update test to use reqeustShim

cb44a60

Merge branch 'master' into feature/memCache

34409b3

mapedraza mentioned this pull request Sep 22, 2023

Include groups and device cache #1467

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache Mongo-DB calls (in memory only) #998

Cache Mongo-DB calls (in memory only) #998

jason-fox commented Mar 2, 2021 •

edited

Loading

jason-fox commented Mar 2, 2021

mapedraza commented Mar 9, 2021

SBlechmann commented Apr 7, 2021

mapedraza commented Apr 8, 2021

SBlechmann commented Apr 8, 2021

jason-fox commented Apr 8, 2021

jason-fox commented Apr 8, 2021 •

edited

Loading

jason-fox commented Apr 13, 2021 •

edited

Loading

Blobonat commented Aug 19, 2022

jason-fox commented Aug 22, 2022

Blobonat commented Aug 23, 2022

jason-fox commented Aug 23, 2022 •

edited

Loading

jason-fox commented Mar 10, 2023

Cache Mongo-DB calls (in memory only) #998

Are you sure you want to change the base?

Cache Mongo-DB calls (in memory only) #998

Conversation

jason-fox commented Mar 2, 2021 • edited Loading

jason-fox commented Mar 2, 2021

mapedraza commented Mar 9, 2021

SBlechmann commented Apr 7, 2021

mapedraza commented Apr 8, 2021

SBlechmann commented Apr 8, 2021

jason-fox commented Apr 8, 2021

jason-fox commented Apr 8, 2021 • edited Loading

jason-fox commented Apr 13, 2021 • edited Loading

Blobonat commented Aug 19, 2022

jason-fox commented Aug 22, 2022

Blobonat commented Aug 23, 2022

jason-fox commented Aug 23, 2022 • edited Loading

jason-fox commented Mar 10, 2023

jason-fox commented Mar 2, 2021 •

edited

Loading

jason-fox commented Apr 8, 2021 •

edited

Loading

jason-fox commented Apr 13, 2021 •

edited

Loading

jason-fox commented Aug 23, 2022 •

edited

Loading