Skip to content

Commit

Permalink
2% sample actually or 4k
Browse files Browse the repository at this point in the history
  • Loading branch information
Baunsgaard committed Oct 23, 2024
1 parent ecbc907 commit 097f649
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -533,7 +533,7 @@ private <T> void estimateRCDMapSize(ColumnEncoderComposite c) {
return;
}
// 2% sample or max 3000
int sampleSize = Math.min(in.getNumRows() / 2, 3000);
int sampleSize = Math.min(in.getNumRows() / 50, 4096);
// Find the frequencies of distinct values in the sample
Map<T, Integer> distinctFreq = new HashMap<>();
Array<T> col = (Array<T>) in.getColumn(c._colID-1);
Expand Down

0 comments on commit 097f649

Please sign in to comment.