Add warnings when the index size exceeds work_mem #200

ezra-varady · 2023-10-11T20:32:24Z

This adds checks for the work_mem and maintenance_work_mem GUC variables. Checks for both are fairly uncommon in the upstream extensions. It seems like maintenance_work_mem is checked more often than work_mem this may be because it's greater size makes it less prohibitive to respect. Neither is actually enforced by postgres. I'll list some examples below

maintenance_work_mem is referenced more infrequently but its use is pretty straightforward when it is
in src/backend/access/gin/gininsert.c in the build callback function maintenance_work_mem is checked

if (buildstate->accum.allocatedMemory >= (Size) maintenance_work_mem * 1024L)

similarly nbtree checks it in its parallel build functions src/backend/access/nbtree/nbtsort.c

 sortmem = maintenance_work_mem / btshared->scantuplesortstates;
 _bt_parallel_scan_and_sort(btspool, btspool2, btshared, sharedsort,
                                               sharedsort2, sortmem, false);

It is also checked in src/backend/commands/vacuumparallel.c. It is never checked in contrib. I think the bloom in the earlier listing refers to an internal bloomfilter, not the extension. Notably though pgvector does have a check

work_mem is also checked very infrequently although within the optimizer/executor there are a number of checks.
in src/backend/access/gin/ginfast.c it gets checked

workMemory = work_mem;
...
/*
* Is it time to flush memory to disk?  Flush if we are at the end of
* the pending list, or if we have a full row and memory is getting
* full.
*/
if (GinPageGetOpaque(page)->rightlink == InvalidBlockNumber ||
    (GinPageHasFullRow(page) &&
    (accum.allocatedMemory >= workMemory * 1024L)))

it gets checked in src/backend/access/nbtree/nbtpage.c as well, albeit in a function that is only called during vacuums

maxbufsize = (work_mem * 1024L) / sizeof(BTPendingFSM);
maxbufsize = Min(maxbufsize, INT_MAX);
maxbufsize = Min(maxbufsize, MaxAllocSize / sizeof(BTPendingFSM));
/* Stay sane with small work_mem */
maxbufsize = Max(maxbufsize, vstate->bufsize);
vstate->maxbufsize = maxbufsize;

I have however found the following calling pattern in several places including contrib/tablefunc/tablefunc.c, contrib/dblink/dblink.c, contrib/adminpack/adminpack.c and also pgvector (albeit only in ivfscans)

tuplestore_begin_heap(random_access, false, work_mem);

this seems to be a data structure that holds tuples to be returned by a scan. It doesn't account for memory allocated elsewhere though. Overall the lack of enforcement seems to make checking these values somewhat uncommon. I think it makes sense to enforce maintenance_work_mem because building an index is relatively infrequent, but maybe enforcing runtime checks for work_mem is overkill

Ngalstyan4

Looks good!

Could you reduce code duplication?
It seems very similar code blocks are added in insert, scan and build. If possible, we should move these checks to a helper function
It seems the general strategy in this PR is to instrument memory allocation points with a conditional check for memory limits.
1. Could we use CurrentMemoryContext to get a more accurate measure of used memory?
  (I think at any given point in time we can ask postgres to tell us how much memory is allocated at CurrentMemoryContext)
2. Is there a way to cap maximum memory allowed in a memory context when we create the memory context? We create such contexts for builds, inserts etc. And if we can enforce memory constraints there, we will have less code to maintain

Ngalstyan4 · 2023-10-13T05:24:52Z

src/hnsw/build.c

+            double             M = ldb_HnswGetM(index);
+            double             mL = 1 / log(M);
+            usearch_metadata_t meta = usearch_metadata(buildstate->usearch_index, &error);
+            uint32             node_size = UsearchNodeBytes(&meta, meta.dimensions * sizeof(float), (int)(mL + .5));


where is the mL+.5 term coming from?
I assume this is the expected node size, given the distribution in the paper. Could you write a note in the comment around it?

Ngalstyan4 · 2023-10-13T05:26:52Z

src/hnsw/build.c

+            double             M = ldb_HnswGetM(index);
+            double             mL = 1 / log(M);
+            usearch_metadata_t meta = usearch_metadata(buildstate->usearch_index, &error);
+            uint32             node_size = UsearchNodeBytes(&meta, meta.dimensions * sizeof(float), (int)(mL + .5));


Note that once #19 is merged, sizeof(float) will. no longer be correct, since with type casts vector elements can be smaller than sizeof(float). Could you add a note (todo:: update sizeof(float) to correct vector size once #19 is merged)so we do not overlook it later?

Ngalstyan4 · 2023-10-13T05:28:28Z

src/hnsw/build.c

+            if(2 * usearch_size(buildstate->usearch_index, &error) * node_size
+               >= (size_t)maintenance_work_mem * 1024L) {
+                usearch_free(buildstate->usearch_index, &error);
+                elog(ERROR, "index size exceeded maintenance_work_mem during index construction");


For now, let's leave this as a warning.
We are carrying out fault tolerance tests now and we should throw errors like this once we are sure we handle them well.

Ngalstyan4 · 2023-10-13T05:29:23Z

src/hnsw/build.c

+        uint32             node_size = UsearchNodeBytes(&meta, opts.dimensions * sizeof(float), (int)(mL + .5));
+        // accuracy could be improved by not rounding mL, but otherwise this will never be fully accurate
+        if(node_size * estimated_row_count > maintenance_work_mem * 1024L) {
+            elog(ERROR, "index size exceeded maintenance_work_mem during index construction");


same as above, about the error.

Ngalstyan4 · 2023-10-13T05:30:07Z

src/hnsw/build.c

@@ -452,6 +463,14 @@ static void BuildIndex(
            // Unlock and release buffer
            UnlockReleaseBuffer(buffer);
        }
+        double             M = ldb_HnswGetM(index);
+        double             mL = 1 / log(M);
+        usearch_metadata_t meta = usearch_metadata(buildstate->usearch_index, &error);


Is this the exact same code as above?
Can we factor it out into a Check__ function?

ezra-varady · 2023-10-14T22:48:13Z

Rebased onto main. Broke out code to check memory use into a helper function in utils.c. I added code to include memory allocated in the current context and it's children in these calculations for versions of postgres greater than 12. Before pg13 there's not a clear way to check how much memory has been allocated by a single context. The call site in the node retriever will get called a lot. It may be worth trying to optimize a bit around this

Ngalstyan4 · 2023-10-15T22:27:35Z

Looks good! moved to #205. Will change one test, check code coverage and merge from there

Ngalstyan4 reviewed Oct 13, 2023

View reviewed changes

ezra-varady added 6 commits October 14, 2023 12:08

respect maintenance_work_mem during index construction

dbdb254

add checks for work_mem in scan and insert

42b1254

make exceeding work_mem a warning, change tests so they dont trigger it

3e3c996

clang-format

16de472

reduce code duplication

9d45ffc

add test back in to external_index, fix elog in CheckMem

76dd81d

ezra-varady force-pushed the ezra/resource-use branch from 1175c02 to 76dd81d Compare October 14, 2023 22:11

ezra-varady added 4 commits October 14, 2023 12:14

fix merge error I missed

22c6d6b

add guards for pg>12. There int an obvious way to do this before pg13

3795467

fix guards

eed8979

actually fix guards

e5fe05f

Ngalstyan4 closed this Oct 15, 2023

Ngalstyan4 mentioned this pull request Oct 15, 2023

Narek/ezra work mem #205

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add warnings when the index size exceeds work_mem #200

Add warnings when the index size exceeds work_mem #200

ezra-varady commented Oct 11, 2023

Ngalstyan4 left a comment

Ngalstyan4 Oct 13, 2023

Ngalstyan4 Oct 13, 2023

Ngalstyan4 Oct 13, 2023

Ngalstyan4 Oct 13, 2023

Ngalstyan4 Oct 13, 2023

ezra-varady commented Oct 14, 2023

Ngalstyan4 commented Oct 15, 2023

Add warnings when the index size exceeds work_mem #200

Add warnings when the index size exceeds work_mem #200

Conversation

ezra-varady commented Oct 11, 2023

Ngalstyan4 left a comment

Choose a reason for hiding this comment

Ngalstyan4 Oct 13, 2023

Choose a reason for hiding this comment

Ngalstyan4 Oct 13, 2023

Choose a reason for hiding this comment

Ngalstyan4 Oct 13, 2023

Choose a reason for hiding this comment

Ngalstyan4 Oct 13, 2023

Choose a reason for hiding this comment

Ngalstyan4 Oct 13, 2023

Choose a reason for hiding this comment

ezra-varady commented Oct 14, 2023

Ngalstyan4 commented Oct 15, 2023