Skip to content

Avoid ConfigOptions in IR nodes#19186

Merged
rapids-bot[bot] merged 12 commits intorapidsai:branch-25.08from
TomAugspurger:tom/ir-config-options
Jun 24, 2025
Merged

Avoid ConfigOptions in IR nodes#19186
rapids-bot[bot] merged 12 commits intorapidsai:branch-25.08from
TomAugspurger:tom/ir-config-options

Conversation

@TomAugspurger
Copy link
Copy Markdown
Contributor

@TomAugspurger TomAugspurger commented Jun 17, 2025

This favors putting simpler objects (e.g.
config_options.executor.max_rows_per_partition) in IR nodes rather than large, complicated objects like config_options. This makes understanding what options actually affect the IR node easier, at the cost of making constructing the objects a bit more complicated.

This favors putting simpler objects (e.g.
`config_options.executor.max_rows_per_partition`) in IR nodes rather
than large, complicated objects. This makes understanding what options
actually affect the IR node easier, at the cost of making constructing
the objects a bit more complicated.
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot bot commented Jun 17, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@github-actions github-actions bot added Python Affects Python cuDF API. cudf-polars Issues specific to cudf-polars labels Jun 17, 2025
@GPUtester GPUtester moved this to In Progress in cuDF Python Jun 17, 2025
@TomAugspurger TomAugspurger added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Jun 17, 2025
@TomAugspurger
Copy link
Copy Markdown
Contributor Author

/ok to test

Copy link
Copy Markdown
Member

@rjzamora rjzamora left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this makes sense. I think my only concern is that Scan.parquet_options feels a bit "too-specific" for Scan maybe?

Comment on lines +329 to +330
parquet_options: ParquetOptions | None
"""Parquet-specific options."""
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we expect to eventually need json_options and csv_options options as well?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If and when we expose those as ConfigOptions, then yes, I think so. I think the requirement here is that anything passed into Scan.do_evaluate needs to be present on the node.

If we add those we might want to refactor things into some kind of IOOptions.

@TomAugspurger
Copy link
Copy Markdown
Contributor Author

I think my only concern is that Scan.parquet_options feels a bit "too-specific" for Scan maybe?

Yeah, I think this is a consequence of Scan handling multiple types (Scan.type). We end up needing to give it everything needed for any of the types.

@TomAugspurger TomAugspurger marked this pull request as ready for review June 18, 2025 18:43
@TomAugspurger TomAugspurger requested a review from a team as a code owner June 18, 2025 18:43
Copy link
Copy Markdown
Member

@rjzamora rjzamora left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is looking good - Some small comments.

@TomAugspurger
Copy link
Copy Markdown
Contributor Author

/merge

@rapids-bot rapids-bot bot merged commit 5b9d9a5 into rapidsai:branch-25.08 Jun 24, 2025
107 checks passed
@github-project-automation github-project-automation bot moved this from In Progress to Done in cuDF Python Jun 24, 2025
@TomAugspurger TomAugspurger deleted the tom/ir-config-options branch June 24, 2025 18:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cudf-polars Issues specific to cudf-polars improvement Improvement / enhancement to an existing function non-breaking Non-breaking change Python Affects Python cuDF API.

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

4 participants