perf: use smallvec for keys #31

cchudant · 2024-05-03T15:08:18Z

I have a few things to note here and questions

-> I don't like the SByteVec name, but I don't have a better one
-> do we keep the BonsaiDb api unchanged, with Vec<u8>s?
if we do, we don't have to expose SByteVec to the api at all actually

-> smallvec => maybe just having type SByteVec = [u8; 32]; would be better but we would have to pad the data in some cases

for background, a standard vec basically a tuple (capacity, len, ptr), a smallvec is (capacity, len, union { inline: [u8; 32], inheap: ptr }) and they know if it's one of the two variants by checking if capacity >= 32
I dont think we would have that much of a difference in perf when fixing N to 32 for every key or even part of the api

-> I have changed some stuff like

let _: Vec<u8> = [
        id.to_bytes().as_slice(),
        &[KEY_SEPARATOR],
        key.as_slice(),
        &[key.into()],
        &[OLD_VALUE],
    ]
    .concat();

to iter based methods

let _: SByteVec = id.to_bytes().into_iter()
    .chain(iter::once(KEY_SEPARATOR))
    .chain(key.as_slice().iter().copied())
    .chain(iter::once(key.into()))
    .chain(iter::once(OLD_VALUE))
    .collect()

That kind of looks bad, using iterators like this for individual bytes may mean this doesn't lower to a bunch of optimized copies
especially as we are using smallvec which is out of std
I don't see it in my profiler, so I assume this is fine and llvm inlines everything correctly but I havent checked the assembly

-> there are still lots of copies when converting bitvecs to smallvecs which could be fixed
Optimizing these is not necessarily worth the effort it just bothers me a little

Results

The benches use HashMap db and they are handcrafted, they may not accurately translate to the real world

The drop storage bench is not really relevant for real world, i've just added it because on my profiler it shows libc free was actually one of the most time consuming functions of my binary

Other

There should only be very minor conflicts between this PR and #30

cchudant · 2024-05-03T15:11:37Z

@AurelienFT

AurelienFT

If we change the APIs like this we have to edit all the examples and I think it might be less usable. Can we keep Vec and convert them directly ?
Otherwise i'm ok with the PR.

cchudant · 2024-05-07T11:18:55Z

If we change the APIs like this we have to edit all the examples and I think it might be less usable. Can we keep Vec and convert them directly ? Otherwise i'm ok with the PR.

The only API that is changed here is the BonsaiDb trait, you only implement that trait if the default Rocksdb impl isnt sufficient for your usecase. I think it's ok to have SByteVec here

To be fair, this is probably not worth making the api more complex but I kind of aspire to have absolutely no useless copies in the whole crate

That aspiration probably clashes at times with the goal of making the crate as simple as possible ahah

@AurelienFT I think I'll remove the api change for now it can always be added later

AurelienFT · 2024-05-07T12:06:08Z

The only API that is changed here is the BonsaiDb trait, you only implement that trait if the default Rocksdb impl isnt sufficient for your usecase. I think it's ok to have SByteVec here

It also change the API of the main structure for get_keys etc

AurelienFT · 2024-05-07T12:07:14Z

@AurelienFT I think I'll remove the api change for now it can always be added later

Yeah I think also

cchudant force-pushed the smallvec branch from 183dcef to 18e5cc7 Compare May 7, 2024 07:50

cchudant added 3 commits May 7, 2024 07:52

perf: use smallvec for keys

f2dea32

benches: benchmark for storage drop

afc5a7c

smallvec: export SByteVec to the api

75d0f27

cchudant force-pushed the smallvec branch from 18e5cc7 to 75d0f27 Compare May 7, 2024 07:53

AurelienFT approved these changes May 7, 2024

View reviewed changes

AurelienFT requested changes May 7, 2024

View reviewed changes

cchudant added 3 commits May 20, 2024 09:23

cleanup: SByteVec => ByteVec and remove from BonsaiStorage api

f831248

cleanup: clippy

be8e1a8

fix: ci testing & bug introduced in previous PR (sorry :p)

0128877

cchudant mentioned this pull request Sep 16, 2024

Fix inserts remove leaks #34

Merged

cchudant merged commit 0128877 into madara-alliance:oss Sep 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: use smallvec for keys #31

perf: use smallvec for keys #31

cchudant commented May 3, 2024

cchudant commented May 3, 2024

AurelienFT left a comment

cchudant commented May 7, 2024

AurelienFT commented May 7, 2024

AurelienFT commented May 7, 2024

perf: use smallvec for keys #31

perf: use smallvec for keys #31

Conversation

cchudant commented May 3, 2024

Results

Other

cchudant commented May 3, 2024

AurelienFT left a comment

Choose a reason for hiding this comment

cchudant commented May 7, 2024

AurelienFT commented May 7, 2024

AurelienFT commented May 7, 2024