[TMP] Feature/hma support by chanhopark1 · Pull Request #6 · moreh-dev/LMCache

chanhopark1 · 2026-03-11T07:06:20Z

No description provided.

…odel names - Document GatedDeltaNet + GatedAttention group layout for Qwen3.5 - Keep Full+SWA layout as a second example - Explain why group 0 delegation is correct (standard KV only) - Remove incorrect gpt-oss-20b/120b references Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ation Hybrid models (e.g. Qwen3.5) produce mixed kv_caches dicts where attention layers are torch.Tensor but Mamba/linear-attention layers are list[torch.Tensor]. LMCache only handles attention KV caches, so filter out non-tensor entries at both the adapter and grouping layers. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

github-actions · 2026-05-11T02:24:48Z

This pull request has been automatically marked as stale because it has not had activity within 60 days. It will be automatically closed if no further activity occurs within 30 days.

chanhopark1 and others added 2 commits March 10, 2026 01:15

Add HMA (Hybrid Memory Architecture) support to connector

020cf60

gitgod-bot assigned chanhopark1 Mar 11, 2026

github-actions Bot added the stale label May 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TMP] Feature/hma support#6

[TMP] Feature/hma support#6
chanhopark1 wants to merge 3 commits into
devfrom
feature/hma-support

chanhopark1 commented Mar 11, 2026

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

chanhopark1 commented Mar 11, 2026

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant