[bugfix]: Add compatibility handling for Qwen3.5 GatedDeltaNet padding-free training and fix create_causal_mask patch when cache_positions removed in transformers >5.3.0#202
Draft
meichangsu1 wants to merge 6 commits into
Commits
Commits on May 22, 2026
- committed
qq_30035749 - committed
qq_30035749 - committed
qq_30035749
Commits on May 24, 2026
- committed
qq_30035749
Commits on May 25, 2026
- committed
qq_30035749 - committed
qq_30035749