Limit HNSW build's shared memory size for small tables

Question

Accepted Answer

The current implementation of HNSW build allocates memory based on the 'maintenance_work_mem' setting, which can lead to excessive memory usage for small tables. This is inefficient as it does not account for the actual size of the data being processed, resulting in wasted resources. Introduce a new configuration parameter that sets an upper limit on memory allocation for HNSW builds based on the number of rows in the table. This will ensure that even if 'maintenance_work_mem' is set high, the actual memory used will be capped appropriately for small tables. Update the HNSW build function to utilize the newly defined maximum memory allocation instead of directly using 'maintenance_work_mem'. This will involve changing the memory allocation logic to reference the calculated 'maxMemory'. Create unit tests to validate that the memory allocation logic correctly caps the memory usage for various table sizes. Ensure that for small tables, the memory allocated does not exceed the defined limits. Revise the documentation to include the new memory allocation strategy for HNSW builds. Clearly outline the new parameter and how it interacts with 'maintenance_work_mem'. This will help users understand the changes and optimize their configurations.

Limit HNSW build's shared memory size for small tables

Problem

1 Fix

Implement Upper Limit on HNSW Build Memory Allocation

Define Maximum Memory Allocation

Modify HNSW Build Function

Test Memory Allocation Logic

Update Documentation

Validation

Environment

Submitted by

Tags