Commit bbb9159
feat: disable Qwen3 thinking mode by default (/no_think)
Qwen3-4B defaults to thinking mode ("Okay, the user asked..."),
wasting tokens on reasoning chains. Adding /no_think to the system
prompt produces direct answers.
Before: "Okay, the user asked... Let me recall... Gravity is a fu"
After: "Gravity is the force that attracts any object with mass..."
Speed: 4.3 tok/s (unchanged)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent e273f2b commit bbb9159
1 file changed
+1
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
209 | 209 | | |
210 | 210 | | |
211 | 211 | | |
212 | | - | |
| 212 | + | |
213 | 213 | | |
214 | 214 | | |
215 | 215 | | |
| |||
0 commit comments