Fix sliding window size for token attention kernel by WANDY666 · Pull Request #1312 · ModelTC/LightLLM

WANDY666 · 2026-05-18T03:19:12Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the _token_attention_kernel in transformer_layer_infer.py to adjust the window_size for sliding window attention, changing the second parameter from self.sliding_window - 1 to 0. I have no feedback to provide as there were no review comments.

Fix sliding window size for token attention kernel

43ecd86

WANDY666 merged commit eaf0f42 into main May 18, 2026
1 check passed

WANDY666 deleted the WANDY666-patch-2 branch May 18, 2026 03:20

gemini-code-assist Bot reviewed May 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sliding window size for token attention kernel#1312

Fix sliding window size for token attention kernel#1312
WANDY666 merged 1 commit into
mainfrom
WANDY666-patch-2

WANDY666 commented May 18, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

WANDY666 commented May 18, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant