Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

higher qps with similar recall on larger datasets #575

Open
TyangJN opened this issue Jul 8, 2024 · 1 comment
Open

higher qps with similar recall on larger datasets #575

TyangJN opened this issue Jul 8, 2024 · 1 comment

Comments

@TyangJN
Copy link

TyangJN commented Jul 8, 2024

@yurymalkov Thanks for your great work.

I'd be really grateful if you could tell me:

i run hnsw on 1M sift dataset with M=32, efc=200, efs=256, which could reture a good qps and recall results.

but when it tures to 15M sift dataset, the qps decreased rapidly.

What parameters gave the best qps for larger datasets? and how can i optimize the parameters setting for larger datasets?
Thanks a lot.

@searchivarius
Copy link
Member

Hi @TyangJN does rapidly mean that going from say 14M to 15M you see a sudden big (e.g., 2-3x) drop?

How much memory and L3 cache do you have as well as the number of CPU cores? I assume you use them all for querying. One test to run is to count QPS for smaller number of threads, e.g., even for one. Do you see a sharp decrease in QPS when you use a single thread?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants