Skip to content

[bugfix]: Add compatibility handling for Qwen3.5 GatedDeltaNet padding-free training and fix create_causal_mask patch when cache_positions removed in transformers >5.3.0#202

Draft
meichangsu1 wants to merge 6 commits into
modelscope:mainfrom
meichangsu1:padding_free_bufix_ljl

Commits

Commits on May 22, 2026

Commits on May 24, 2026

Commits on May 25, 2026