Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

up chunk sizes #62

Merged
merged 3 commits into from
May 12, 2024
Merged

up chunk sizes #62

merged 3 commits into from
May 12, 2024

Conversation

gillins
Copy link
Member

@gillins gillins commented Mar 6, 2024

@petebunting is this what you understood from that presentation?

I haven't done any testing with it yet....

cc: @neilflood

@petebunting
Copy link
Collaborator

Hi @gillins

Yes, I think this is was I understood from the presentation.

Will be interesting to see what difference it makes.

@gillins
Copy link
Member Author

gillins commented Apr 22, 2024

@petebunting just to close the loop on this. Turns out setting paged aggregation really increases the file size:

Size of file
Current kealib 736m
These Changes 715m
This plus pageSize=2048 952m
This plus pageSize=4096 1.7G
This plus pageSize=8192 3.9G

So I think it's worth changing the chunksize (all that this PR now does) but let's not bother with the paged aggregation. It increases the filesize by quite a lot (probably because we have lots of small datasets in a KEA file - ie each block) plus breaks the API.

With just changing the chunk size we do get smaller files.

What do you think?

@gillins gillins marked this pull request as ready for review April 22, 2024 03:50
@gillins gillins changed the title up chunk sizes and move to paged aggregation up chunk sizes Apr 22, 2024
@petebunting
Copy link
Collaborator

Hi @gillins

Yes, that sounds like a good way forward. I don't think we want file sizes growing that significantly.

@gillins gillins merged commit f386837 into ubarsc:master May 12, 2024
2 checks passed
@gillins gillins deleted the paged_aggregation branch May 12, 2024 23:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants