Fix Flash Attention 3 interface for new FA3 return format by veeceey · Pull Request #13173 · huggingface/diffusers

veeceey · 2026-02-23T04:19:21Z

After Dao-AILab/flash-attention@ed20940, flash_attn_3_func no longer returns (out, lse, ...) by default -- it just returns out. This breaks _wrapped_flash_attn_3 which unconditionally unpacks out, lse, *_:

ValueError: not enough values to unpack (expected at least 2, got 1)

This PR:

Passes return_attn_probs=True to flash_attn_3_func (consistent with how _flash_attention_3_hub_forward_op already handles it)
Adds a fallback for robustness in case the return format still varies
Applies the same fix to _flash_varlen_attention_3 which had the same issue

Fixes #12022

Newer versions of flash-attn (after Dao-AILab/flash-attention@ed20940) no longer return lse by default from flash_attn_3_func. The function now returns just the output tensor unless return_attn_probs=True is passed. Updated _wrapped_flash_attn_3 and _flash_varlen_attention_3 to pass return_attn_probs and handle both old (always tuple) and new (tensor or tuple) return formats gracefully. Fixes huggingface#12022

veeceey · 2026-02-23T04:19:29Z

Test Results

Can't run FA3 tests locally (no CUDA GPU), but verified the logic:

_wrapped_flash_attn_3: Now passes return_attn_probs=True and handles both tuple (old FA3) and non-tuple (new FA3 fallback) returns
_flash_varlen_attention_3: Same pattern, only requests return_attn_probs when return_lse=True is passed by the caller
Consistent with how _flash_attention_3_hub_forward_op already handles this at line ~1258 (passes return_attn_probs=return_lse and conditionally unpacks)

The fix is backwards-compatible: old FA3 versions that always return tuples will still work since we check isinstance(result, tuple).

DN6

Minor comment. Looks good otherwise 👍🏽

DN6 · 2026-02-23T16:22:44Z

src/diffusers/models/attention_dispatch.py

+    if isinstance(result, tuple):
+        out, lse, *_ = result
+        lse = lse.permute(0, 2, 1)
+    else:
+        out = result
+        lse = torch.empty(q.shape[0], q.shape[2], q.shape[1], device=q.device, dtype=torch.float32)


Don't think we need this guard. In both cases (old vs new FA3) we're always returning a tuple since return_attn_probs=True? Why not just leave as out, lse, *_ = flash_attn_3_func

Since return_attn_probs=True is always passed, the result is guaranteed to be a tuple. Remove the unnecessary isinstance guard.

DN6

Thanks @veeceey

HuggingFaceDocBuilderDev · 2026-02-24T09:28:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

DN6 reviewed Feb 23, 2026

View reviewed changes

Simplify _wrapped_flash_attn_3 return unpacking

c0961d5

Since return_attn_probs=True is always passed, the result is guaranteed to be a tuple. Remove the unnecessary isinstance guard.

DN6 approved these changes Feb 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Fix Flash Attention 3 interface for new FA3 return format#13173

Fix Flash Attention 3 interface for new FA3 return format#13173
veeceey wants to merge 2 commits intohuggingface:mainfrom
veeceey:fix/issue-12022-flash-attention-3-interface

veeceey commented Feb 23, 2026

Uh oh!

veeceey commented Feb 23, 2026

Uh oh!

DN6 left a comment

Uh oh!

DN6 Feb 23, 2026

Uh oh!

DN6 left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

veeceey commented Feb 23, 2026

Uh oh!

veeceey commented Feb 23, 2026

Test Results

Uh oh!

DN6 left a comment

Choose a reason for hiding this comment

Uh oh!

DN6 Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

DN6 left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants