Skip to content

[Feature] Support sliding window for triton attention backend #6161

@Fridge003

Description

@Fridge003

Checklist

Motivation

Currently in blackwell environment, only triton attention backend is stable for running. We need sliding window feature to run gemma 3 model, but triton backend doesn't support it.

Related resources

#6160

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions