uv_append: Initialize barrier upon creation. #462

MathieuBordere · 2023-07-27T10:31:58Z

Fixes #461.

barrier->blocking was not initialized, resulting in barrier->blocking being true in most of the cases, causing the effect seen in the issue.

The tests didn't catch the bug as the barriers are properly initialized in the tests.

edit: not going to spend time on test for now.
~~Will still add a test that uses a slow UvSnapshot call and ensures append requests get through.~~

Signed-off-by: Mathieu Borderé <[email protected]>

MathieuBordere · 2023-07-27T11:20:51Z

We haven't released anything since then, but I'd rather merge this quickly as LXD builds from master, will just merge it, looks like an obvious improvement.

cole-miller · 2023-07-27T13:47:53Z

Wow, this is a nasty bug!

MathieuBordere · 2023-07-27T14:32:47Z

Wow, this is a nasty bug!

yeah :-(

freeekanayaka · 2023-07-27T15:59:04Z

Wow, this is a nasty bug!

yeah :-(

Was the snapshot throttling experiment run with the un-fixed code? IOW does this bug explain why latency increased instead of decreasing? Or is there something else yet to be investigated?

MathieuBordere · 2023-07-27T16:31:42Z

Wow, this is a nasty bug!

yeah :-(

Was the snapshot throttling experiment run with the un-fixed code? IOW does this bug explain why latency increased instead of decreasing? Or is there something else yet to be investigated?

I ran those tests on June 21st, which predates the work that introduced that bug. I'll revisit the throttling anyway as I want to understand the mechanism behind the phenomenon and double-check I didn't mess something up during the test :-)

cole-miller · 2023-07-27T16:33:21Z

Mathieu re-ran the benchmark against a raft commit that wasn't affected by this issue, and confirmed that tail latency for client requests improves when snapshotting does less I/O.

MathieuBordere · 2023-07-27T16:34:29Z

Mathieu re-ran the benchmark against a raft commit that wasn't affected by this issue, and confirmed that tail latency for client requests improves when snapshotting does less I/O.

yes, but I think Free is talking about the throttling of the snapshot write.

freeekanayaka · 2023-07-28T08:58:55Z

Mathieu re-ran the benchmark against a raft commit that wasn't affected by this issue, and confirmed that tail latency for client requests improves when snapshotting does less I/O.

@cole-miller @MathieuBordere btw even after our emails and conversations I still don't know what the real-world problem being solved is, and/or the details about possible performance issues that Canonical is hitting with dqlite (e.g. which application has issues? what are these issues exactly? was the underlying problem/cause unequivocally identified and measured?).

However, assuming that Canonical has an application for which:

The dataset of the application can reasonably fit in memory, for all use cases of the application now and in the foreseeable future.
The issue being observed is that, when snapshotting, request latency increases to an unacceptable level for the application.
When snapshotting is disabled altogether (no snapshots are taken at all), latency always remains within acceptable levels for the application.

then one would reasonably expect that throttling the snapshot I/O to a low enough rate should be enough to maintain latency within acceptable values, even without implementing incremental snapshots in order to reduce the total amount of I/O being performed (vs just its rate). Compression via lz4 should probably be disabled in order to reduce CPU load, as already mentioned by Mathieu IIRC.

This expectation is based on the fact that I/O should be the only resource that becomes contended when taking a snapshot without compression, while CPU and memory should be fine (I believe the only cost involved there is a memory copy of the WAL, since the database pages are not being copied anymore these days after canonical/dqlite#356). Or are there other impactful costs, e.g. checkpoint-related?

If assumption 1. is not true, one would intuitively look at a disk-based dqlite FSM as a solution. However, in case of really large datasets, dqlite might not be an appropriate database choice anymore considering that:

it's single writer (that might prevent scaling, at least for some use cases)
automatic role management would mean moving a lot of data between nodes anyway

unless of course one plans to change/fix those two aspects too.

MathieuBordere · 2023-07-28T14:08:02Z

Wow, this is a nasty bug!

yeah :-(

Was the snapshot throttling experiment run with the un-fixed code? IOW does this bug explain why latency increased instead of decreasing? Or is there something else yet to be investigated?

I ran those tests on June 21st, which predates the work that introduced that bug. I'll revisit the throttling anyway as I want to understand the mechanism behind the phenomenon and double-check I didn't mess something up during the test :-)

Will rerun the throttling tests with code from this tree https://github.com/MathieuBordere/raft/tree/throttle It's hacked together, so don't pay too much attention to the quality ...

freeekanayaka · 2023-07-28T15:43:00Z

Wow, this is a nasty bug!

yeah :-(

Was the snapshot throttling experiment run with the un-fixed code? IOW does this bug explain why latency increased instead of decreasing? Or is there something else yet to be investigated?

I ran those tests on June 21st, which predates the work that introduced that bug. I'll revisit the throttling anyway as I want to understand the mechanism behind the phenomenon and double-check I didn't mess something up during the test :-)

Will rerun the throttling tests with code from this tree https://github.com/MathieuBordere/raft/tree/throttle It's hacked together, so don't pay too much attention to the quality ...

Thanks, didn't look at the details but looks good. It'd be interesting to experiment a bit with different values of sleep time and batch size, to see how they affect latency.

Expectation would be of course that the smaller the batch size and the higher the sleep time, the lower will be the latency. If results don't match the expectation and latency is perfectly stable without snapshots at all, then there's probably something else going on that needs investigation and understanding.

uv_append: Initialize barrier upon creation.

36c2388

Signed-off-by: Mathieu Borderé <[email protected]>

MathieuBordere requested a review from cole-miller July 27, 2023 10:53

MathieuBordere merged commit 8ba37b3 into canonical:master Jul 27, 2023
17 checks passed

MathieuBordere deleted the init-barrier branch July 27, 2023 14:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

uv_append: Initialize barrier upon creation. #462

uv_append: Initialize barrier upon creation. #462

MathieuBordere commented Jul 27, 2023 •

edited

Loading

MathieuBordere commented Jul 27, 2023 •

edited

Loading

cole-miller commented Jul 27, 2023

MathieuBordere commented Jul 27, 2023

freeekanayaka commented Jul 27, 2023

MathieuBordere commented Jul 27, 2023

cole-miller commented Jul 27, 2023

MathieuBordere commented Jul 27, 2023 •

edited

Loading

freeekanayaka commented Jul 28, 2023

MathieuBordere commented Jul 28, 2023 •

edited

Loading

freeekanayaka commented Jul 28, 2023

uv_append: Initialize barrier upon creation. #462

uv_append: Initialize barrier upon creation. #462

Conversation

MathieuBordere commented Jul 27, 2023 • edited Loading

MathieuBordere commented Jul 27, 2023 • edited Loading

cole-miller commented Jul 27, 2023

MathieuBordere commented Jul 27, 2023

freeekanayaka commented Jul 27, 2023

MathieuBordere commented Jul 27, 2023

cole-miller commented Jul 27, 2023

MathieuBordere commented Jul 27, 2023 • edited Loading

freeekanayaka commented Jul 28, 2023

MathieuBordere commented Jul 28, 2023 • edited Loading

freeekanayaka commented Jul 28, 2023

MathieuBordere commented Jul 27, 2023 •

edited

Loading

MathieuBordere commented Jul 27, 2023 •

edited

Loading

MathieuBordere commented Jul 27, 2023 •

edited

Loading

MathieuBordere commented Jul 28, 2023 •

edited

Loading