[Data] Fix broken `LogicalOperator` abstraction barrier in projection pushdown rule by bveeramani · Pull Request #58683 · ray-project/ray

bveeramani · 2025-11-16T21:42:55Z

Description

The projection pushdown rule was directly accessing _cached_output_metadata.schema, which breaks abstraction barriers by reaching into private implementation details. This violates encapsulation and makes the code fragile to internal changes.

This PR fixes the issue by using the proper infer_schema() method instead, which provides a clean public interface for accessing schema information. This respects the operator's abstraction and ensures we get the correct schema through the intended API.

Additional information

The change is in projection_pushdown.py:342 where we now call input_op.infer_schema() instead of directly accessing input_op._cached_output_metadata.schema.

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

gemini-code-assist

Code Review

This pull request correctly refactors the projection pushdown rule to respect the LogicalOperator abstraction barrier. By replacing direct access to the private _cached_output_metadata.schema attribute with a call to the public infer_schema() method, the code becomes more robust and maintainable. This change aligns with good object-oriented design principles and reduces the risk of future breakage due to internal implementation changes. The fix is clean, targeted, and I approve of the change.

…n pushdown rule (ray-project#58683) ## Description The projection pushdown rule was directly accessing `_cached_output_metadata.schema`, which breaks abstraction barriers by reaching into private implementation details. This violates encapsulation and makes the code fragile to internal changes. This PR fixes the issue by using the proper `infer_schema()` method instead, which provides a clean public interface for accessing schema information. This respects the operator's abstraction and ensures we get the correct schema through the intended API. ## Additional information The change is in `projection_pushdown.py:342` where we now call `input_op.infer_schema()` instead of directly accessing `input_op._cached_output_metadata.schema`. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: Aydin Abiar <aydin@anyscale.com>

…n pushdown rule (ray-project#58683) ## Description The projection pushdown rule was directly accessing `_cached_output_metadata.schema`, which breaks abstraction barriers by reaching into private implementation details. This violates encapsulation and makes the code fragile to internal changes. This PR fixes the issue by using the proper `infer_schema()` method instead, which provides a clean public interface for accessing schema information. This respects the operator's abstraction and ensures we get the correct schema through the intended API. ## Additional information The change is in `projection_pushdown.py:342` where we now call `input_op.infer_schema()` instead of directly accessing `input_op._cached_output_metadata.schema`. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: xiaowen.wxw <wxw403883@alibaba-inc.com>

…n pushdown rule (ray-project#58683) ## Description The projection pushdown rule was directly accessing `_cached_output_metadata.schema`, which breaks abstraction barriers by reaching into private implementation details. This violates encapsulation and makes the code fragile to internal changes. This PR fixes the issue by using the proper `infer_schema()` method instead, which provides a clean public interface for accessing schema information. This respects the operator's abstraction and ensures we get the correct schema through the intended API. ## Additional information The change is in `projection_pushdown.py:342` where we now call `input_op.infer_schema()` instead of directly accessing `input_op._cached_output_metadata.schema`. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: YK <1811651+ykdojo@users.noreply.github.com>

…n pushdown rule (ray-project#58683) ## Description The projection pushdown rule was directly accessing `_cached_output_metadata.schema`, which breaks abstraction barriers by reaching into private implementation details. This violates encapsulation and makes the code fragile to internal changes. This PR fixes the issue by using the proper `infer_schema()` method instead, which provides a clean public interface for accessing schema information. This respects the operator's abstraction and ensures we get the correct schema through the intended API. ## Additional information The change is in `projection_pushdown.py:342` where we now call `input_op.infer_schema()` instead of directly accessing `input_op._cached_output_metadata.schema`. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

…n pushdown rule (ray-project#58683) ## Description The projection pushdown rule was directly accessing `_cached_output_metadata.schema`, which breaks abstraction barriers by reaching into private implementation details. This violates encapsulation and makes the code fragile to internal changes. This PR fixes the issue by using the proper `infer_schema()` method instead, which provides a clean public interface for accessing schema information. This respects the operator's abstraction and ensures we get the correct schema through the intended API. ## Additional information The change is in `projection_pushdown.py:342` where we now call `input_op.infer_schema()` instead of directly accessing `input_op._cached_output_metadata.schema`. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: Future-Outlier <eric901201@gmail.com>

…n pushdown rule (ray-project#58683) ## Description The projection pushdown rule was directly accessing `_cached_output_metadata.schema`, which breaks abstraction barriers by reaching into private implementation details. This violates encapsulation and makes the code fragile to internal changes. This PR fixes the issue by using the proper `infer_schema()` method instead, which provides a clean public interface for accessing schema information. This respects the operator's abstraction and ensures we get the correct schema through the intended API. ## Additional information The change is in `projection_pushdown.py:342` where we now call `input_op.infer_schema()` instead of directly accessing `input_op._cached_output_metadata.schema`. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: peterxcli <peterxcli@gmail.com>

Initial commit

f220d6b

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

bveeramani requested a review from a team as a code owner November 16, 2025 21:42

bveeramani enabled auto-merge (squash) November 16, 2025 21:43

github-actions bot added the go add ONLY when ready to merge, run all tests label Nov 16, 2025

gemini-code-assist bot reviewed Nov 16, 2025

View reviewed changes

bveeramani changed the title ~~[Data] Fix broken LogicalOperator abstraction barrier in predication pushdown rule~~ [Data] Fix broken LogicalOperator abstraction barrier in projection pushdown rule Nov 16, 2025

ray-gardener bot added the data Ray Data-related issues label Nov 17, 2025

goutamvenkat-anyscale approved these changes Nov 17, 2025

View reviewed changes

bveeramani merged commit 5d24749 into master Nov 17, 2025
7 checks passed

bveeramani deleted the data-fix-projection-abstraction branch November 17, 2025 02:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Fix broken `LogicalOperator` abstraction barrier in projection pushdown rule#58683

[Data] Fix broken `LogicalOperator` abstraction barrier in projection pushdown rule#58683
bveeramani merged 1 commit intomasterfrom
data-fix-projection-abstraction

bveeramani commented Nov 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bveeramani commented Nov 16, 2025

Description

Additional information

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants