Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update big5 workload config to use the ordered documents snapshot #16005

Merged
merged 1 commit into from
Sep 19, 2024

Conversation

rishabh6788
Copy link
Contributor

Description

The big5 workload by default uses 8 ingestion clients to ingest the data.
During our analysis we understood that the order of data based on timestamp is not maintained during ingestion process and this causes skew in query latencies that use time filters.

We have re-created the snapshots for big5 after ingesting using 1-client to maintain the timestamp order of the

Related Issues

Resolves #[Issue number to be closed when this PR is merged]

Check List

  • [ ] Functionality includes testing.
  • [ ] API changes companion pull request created, if applicable.
  • [ ] Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@rishabh6788 rishabh6788 merged commit 94222f1 into opensearch-project:main Sep 19, 2024
37 of 38 checks passed
@peternied
Copy link
Member

@rishabh6788 Can you check if this need to be backfilled to the 2.x branches?

@rishabh6788
Copy link
Contributor Author

Not required as workflows that start on comment add only run for default branch, main in our case.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants