Skip to content

Latest commit

 

History

History
4 lines (1 loc) · 133 Bytes

File metadata and controls

4 lines (1 loc) · 133 Bytes

Multi-Query Attention: Fast Transformer Decoding: One Write-Head is All You Need(Nov 2019),速度更快,更低的显存占用