Bump transformers to >=5.0.0 for GLM-4.7-Flash by tyler-griggs · Pull Request #1241 · NovaSky-AI/SkyRL

tyler-griggs · 2026-03-02T06:13:39Z

Summary

Upgrades transformers from >=4.56.1,<5 to >=5.0.0 to support GLM-4.7-Flash (Glm4MoeLiteForCausalLM), which was added in transformers 5.0.0.

Depends on: #1240 (vLLM 0.16.0 upgrade)
Merge when: vLLM officially declares transformers>=5 support

Why transformers 5.x is required

Glm4MoeLiteForCausalLM (model_type: glm4_moe_lite) only exists in transformers >=5.0.0
The HF model repo has no auto_map or custom code — trust_remote_code=True is useless
Both vLLM and megatron-bridge call AutoConfig.from_pretrained() which needs the model type registered

Changes

transformers>=5.0.0 in root pyproject.toml
transformers>=5.0.0 override-dependency (megatron-bridge declares <5)
transformers>=5.0.0 override in skyrl-train pyproject.toml
return_dict=False added to all 15 apply_chat_template calls (transformers 5.x changed default return type)
Chat templating test marked xfail (hardcoded values need regeneration)

Tested

GLM-4.7-Flash end-to-end GRPO training on 8x A100-80GB with transformers 5.2.0

🤖 Generated with Claude Code

- vllm: 0.13.0 -> 0.16.0 - torch: 2.9.0 -> 2.9.1 (required by vLLM 0.16.0) - flashinfer-python: 0.5.3 -> 0.6.3 (required by vLLM 0.16.0) - flashinfer-jit-cache: 0.5.3 -> 0.6.3 - numpy>=2.0.0 override (vLLM 0.16.0 -> opencv-python-headless>=4.13 -> numpy>=2, conflicting with megatron-core's <2 pin; tested compatible with megatron-core 0.15.0) Migrates vLLM import paths (0.13 -> 0.16): - serving_chat -> chat_completion.serving - serving_completion -> completion.serving - serving_models -> models.serving - protocol split into chat_completion/completion/engine.protocol - ErrorInfo moved to top-level import Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

transformers 5.0.0 adds Glm4MoeLiteConfig (model_type: glm4_moe_lite) required by GLM-4.7-Flash. No 4.x release has this model type and the HF repo provides no auto_map or custom code. - transformers: >=4.56.1,<5 -> >=5.0.0 - Add transformers>=5.0.0 override-dependency (megatron-bridge declares <5) - Add return_dict=False to all apply_chat_template calls (transformers 5.x changed the default return type from list to BatchEncoding) - Mark chat templating test as xfail (hardcoded expected values need regeneration for transformers 5.x tokenizer changes) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request correctly upgrades the transformers library to version 5.0.0 or higher to support GLM-4.7-Flash. The changes include updating the main dependencies and adding return_dict=False to all apply_chat_template calls to adapt to the API change.

However, I've identified a couple of issues:

(High Severity) Inconsistent Dependency Versions: The transformers dependency in the skyrl-train extra in the root pyproject.toml (line 75) and in skyrl-train/pyproject.toml (line 27) has not been updated to >=5.0.0. This could lead to dependency resolution issues or the installation of an older transformers version. These should be updated for consistency.
(Medium Severity) Code Duplication: There is significant code duplication between the skyrl and skyrl-train packages (e.g., dataset.py, skyrl_gym_generator.py). This increases maintenance overhead. A specific comment has been added to highlight this.

Addressing these points will improve the maintainability and robustness of the codebase.

gemini-code-assist · 2026-03-02T06:16:23Z

skyrl-train/skyrl_train/dataset/dataset.py

+            lambda doc: len(
+                tokenizer.apply_chat_template(doc[prompt_key], add_generation_prompt=True, return_dict=False)
+            )
            <= self.max_prompt_length,


While the change to add return_dict=False is correct for transformers>=5.0.0, I've noticed that this file seems to be an exact duplicate of skyrl/train/dataset/dataset.py. There also appear to be other duplicated or near-duplicated files like skyrl-train/skyrl_train/generators/skyrl_gym_generator.py and skyrl-train/skyrl_train/generators/utils.py.

This code duplication increases maintenance overhead, as changes need to be applied in multiple places, which is error-prone. It would be beneficial to refactor this to eliminate the duplication. Perhaps these modules could be shared in a common library.

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 4 additional findings.

Ubuntu and others added 2 commits March 2, 2026 06:11

gemini-code-assist bot reviewed Mar 2, 2026

View reviewed changes

devin-ai-integration bot reviewed Mar 2, 2026

View reviewed changes

vercel bot deployed to Preview March 2, 2026 06:22 View deployment

Base automatically changed from tgriggs/vllm-0.16-upgrade to main March 2, 2026 20:19

erictang000 mentioned this pull request Mar 3, 2026

Support Qwen3.5 with FSDP + Megatron Backends #1254

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump transformers to >=5.0.0 for GLM-4.7-Flash#1241

Bump transformers to >=5.0.0 for GLM-4.7-Flash#1241
tyler-griggs wants to merge 2 commits intomainfrom
tgriggs/transformers-5x

tyler-griggs commented Mar 2, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 2, 2026

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tyler-griggs commented Mar 2, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why transformers 5.x is required

Changes

Tested

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tyler-griggs commented Mar 2, 2026 •

edited by devin-ai-integration bot

Loading