Skip to content

[FEA] A flash-attention cuteDSL kernel for sm120 #2956

@zzczzc20

Description

@zzczzc20

Which component requires the feature?

CuTe DSL

Feature Request

Dear developers,
Could we offer a cuteDSL flash attention kernel for sm120 (Blackwell_geforce).
I saw the cuteDSL flash attention kernel for ampere has already been developed, and I thought it will not be very hard for sm120 support (add TMA).
It is useful because there are few high-performance FA kernel for sm120

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions