Skip to content

Conversation

@zeroshade
Copy link
Member

Rationale for this change

As suggested by #278 (comment) allowing the RecordReader from the pqarrow package to also leverage the SeekToRow functionality to skip records from a parquet file while respected skipping particular row groups.

What changes are included in this PR?

Implementing a SeekToRow method for pqarrow.RecordReader to seek the record reader to a specific row where the next read will start from.

Are these changes tested?

Yes, unit tests are added for this.

Are there any user-facing changes?

Just the new functions.

// the ref count is 0.
Release()

SeekToRow(int64) error
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we add docstrings for new methods?

@zeroshade zeroshade merged commit b419b81 into apache:main Mar 21, 2025
8 checks passed
@zeroshade zeroshade deleted the seek-to-row-record-reader branch March 21, 2025 15:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants