Skip to content

Conversation

@zeroshade
Copy link
Member

Rationale for this change

As with many other parquet reader/writers we should add support for bloom filters.

What changes are included in this PR?

This only adds an implementation to the metadata package to represent bloom filters and process them for metadata reading and writing. This does not yet wire it up through the actual parquet file reader and writer. That will be done in a subsequent PR.

Are these changes tested?

Yes, unit tests are included.

Are there any user-facing changes?

Only the addition of the new functions that are exposed in the metadata package.

@zeroshade zeroshade requested review from kou, lidavidm and wgtmac March 28, 2025 20:59
Copy link
Member

@lidavidm lidavidm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some initial comments. I have yet to look at the bloom filter itself

@zeroshade zeroshade merged commit 6576e9c into apache:main Apr 1, 2025
23 checks passed
@zeroshade zeroshade deleted the add-bloom-filters branch April 1, 2025 14:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants