Skip to content

Comments

[PyTorch] Zero-initialize learnable softmax_offset in DotProductAttention#2694

Open
fjosw wants to merge 1 commit intoNVIDIA:mainfrom
fjosw:fix/softmax-offset-zero-init-v2
Open

[PyTorch] Zero-initialize learnable softmax_offset in DotProductAttention#2694
fjosw wants to merge 1 commit intoNVIDIA:mainfrom
fjosw:fix/softmax-offset-zero-init-v2

Commits

Commits on Feb 20, 2026