Skip to content

Update configuration_Clarify rotary_pct reset behavior in GPTNeoXConfiggpt_neox.py#45025

Open
layla1824 wants to merge 1 commit intohuggingface:mainfrom
layla1824:patch-1
Open

Update configuration_Clarify rotary_pct reset behavior in GPTNeoXConfiggpt_neox.py#45025
layla1824 wants to merge 1 commit intohuggingface:mainfrom
layla1824:patch-1

Conversation

@layla1824
Copy link

This PR adds a clarification comment regarding the behavior of rotary_pct.

Currently, rotary_pct may reset to its default value (0.25) after reload due to the use of kwargs.pop.

This note helps developers better understand this behavior.

Add note about rotary_pct reset behavior
@github-actions
Copy link
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: gpt_neox

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant