IVFFLAT QPS too low

Question

IVFFLAT QPS too low

Accepted Answer

@bantmen A r5.2xlarge has 64GB of RAM, and if you're using the PostgreSQL defaults, this would mean 16GB is allocated for shared buffers.

An IVFFlat index for this dataset would be ~89GiB in size (excluding the size of the data in the table itself), so your entire index won't fit into memory. Additionally, the data access pattern for IVFFlat can be oversimplified as "random" - using your descript @bantmen A r5.2xlarge has 64GB of RAM, and if you're using the PostgreSQL defaults, this would mean 16GB is allocated for shared buffers. An IVFFlat index for this dataset would be ~89GiB in size (excluding the size of the data in the table itself), so your entire index won't fit into memory. Additionally, the data access pattern for IVFFlat can be oversimplified as "random" - using your description above, out of 4,200 centers,  you're trying to find the 10 closest to a query vector. Amongst those centroids, you're looking at ~48K vectors, which is ~220MB of data. Given each query could lead to a complete different set of 10 out of the 4,200 centers, you could end up in a situation where you're swapping data in/out of memory fai Could HNSW help? In this case, you're likely keeping the top layers of your graph in memory, and while the lower level may involve more fetches to disk, it's likely to be lower. For example, if you're using `hnsw.ef_search` on the default setting (40), you'll likely scan far fewer vectors (without seeing empirical data around your embedding model, I can't give an estimated convergence). I can't comment on if you'll meet your recall target without more information.

IVFFLAT QPS too low

Problem

1 Fix

Solution: IVFFLAT QPS too low

@bantmen A r5.2xlarge has 64GB of RAM, and if you're using the PostgreSQL defaul

An IVFFlat index for this dataset would be ~89GiB in size (excluding the size of

Could HNSW help? In this case, you're likely keeping the top layers of your grap

Validation

Verification Summary

Environment

Submitted by

Tags