Skip to content

[GPU] Add CDNA block intrinsics #23941

@nirvedhmeshram

Description

@nirvedhmeshram

Block intrinsics allow subgroups to work on muliple blocks (batch dimenion) in parallel. They often have smaller sizes which make them a good fit for skinny shapes. This issue will track various steps to have them added. These tasks are sharded from this test PR which verified that we have e2e correctness #23934

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions