Skip to content

Conversation

@zeroshade
Copy link
Member

Rationale for this change

Support for reading and writing bloom filters to files is important for allowing consumers to skip row groups/columns/etc.

What changes are included in this PR?

Adding functionality to the Writer and Reader to allow for writing and reading bloom filters from parquet files directly. Adding write properties to control if and how bloom filters get written.

Are these changes tested?

Yes. Unit tests are added.

Are there any user-facing changes?

Only the newly available functions.

@zeroshade zeroshade merged commit 6637105 into apache:main Apr 2, 2025
23 checks passed
@zeroshade zeroshade deleted the read-write-bloomfilter branch April 2, 2025 15:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants