[CI] Fix slow grpo CI #3693

kashif · 2025-07-04T10:55:37Z

What does this PR do?

We now check if flash_attn is installed and if not use the sdpa_paged in the transformers paged attention slow test

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2025-07-04T10:59:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

kashif · 2025-07-04T11:01:20Z

@ArthurZucker the paged attention CI was failing since it didnt find flash_attn, so i replaced the check with is_flash_attn_2_available do you think that is reasonable?

Copilot

Pull Request Overview

This PR enhances CI for the GRPO trainer by preferring FlashAttention v2 when available, updates a comment for consistency in the vLLM serve script, and fixes a markdown warning in the SFT trainer docs.

Import and use is_flash_attn_2_available instead of solely relying on torch.cuda.is_available
Update a misaligned example comment in vllm_serve.py to match actual dtype usage
Wrap the {% generation %} tag in a raw block in the SFT trainer docs warning

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
trl/trainer/grpo_trainer.py	Added import of `is_flash_attn_2_available` and replaced the CUDA check for attention
trl/scripts/vllm_serve.py	Corrected comment to show `"torch.float32"` as a string in `collective_rpc` example
docs/source/sft_trainer.md	Changed warning block to use a raw tag around `{% generation %}`

trl/trainer/grpo_trainer.py

ArthurZucker · 2025-07-04T18:18:04Z

Yes completely! Sorry

kashif added 4 commits July 4, 2025 12:38

use paged_attention if is_flash_attn_2_available

2ac60b7

fix Unknown tag 'generation' error

cdb3f3d

fix comment

4debf53

use raw and endraw for jekeyll

25aa346

kashif requested a review from Copilot July 4, 2025 14:04

Copilot AI reviewed Jul 4, 2025

View reviewed changes

trl/trainer/grpo_trainer.py Show resolved Hide resolved

kashif self-assigned this Jul 4, 2025

shirinyamani approved these changes Jul 4, 2025

View reviewed changes

kashif merged commit db19d79 into main Jul 4, 2025
11 checks passed

kashif deleted the fix-ci-slow-grpo branch July 4, 2025 17:46

Copilot AI mentioned this pull request Jul 14, 2025

Fix slow GRPO CI by standardizing max_completion_length to 8 tokens boren-ms/trl#2

Closed

marcandrelarochelle pushed a commit to marcandrelarochelle/trl that referenced this pull request Jul 29, 2025

[CI] Fix slow grpo CI (huggingface#3693)

ec71f68

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Fix slow grpo CI #3693

[CI] Fix slow grpo CI #3693

Uh oh!

kashif commented Jul 4, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 4, 2025

Uh oh!

kashif commented Jul 4, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

ArthurZucker commented Jul 4, 2025

Uh oh!

Uh oh!

[CI] Fix slow grpo CI #3693

[CI] Fix slow grpo CI #3693

Uh oh!

Conversation

kashif commented Jul 4, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 4, 2025

Uh oh!

kashif commented Jul 4, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

ArthurZucker commented Jul 4, 2025

Uh oh!

Uh oh!