Skip to content

[quantization][draft] [NO_MERGE] Fix instability#670

Draft
stamalakhov wants to merge 1 commit into
Samsung:mainfrom
stamalakhov:fix_stability
Draft

[quantization][draft] [NO_MERGE] Fix instability#670
stamalakhov wants to merge 1 commit into
Samsung:mainfrom
stamalakhov:fix_stability

Conversation

@stamalakhov
Copy link
Copy Markdown
Contributor

@stamalakhov stamalakhov commented Apr 30, 2026

This PR tries to fix instability in llama quantization.

results of ppl evaluation for LLama3.2-3B-Instruct quantized using GPTQ only (smse) on 128 samples

torch and transformers packages were the same on all GPUs:


torch 2.10.0
transformers 5.8.0

environment original fix_stability==1 fix_stability==2 fix_stability==3
GPU_0 12.023604393 11.953644753 12.002959251 12.002959251
GPU_1 11.913434029 12.059534073 11.875122070 11.875122070
GPU_2 12.054678917 12.145004272 11.989699364 11.989699364

with gptq_use_orig_model enabled:

environment fix_stability==0 fix_stability==1 fix_stability==2
GPU_0 11.892453194 11.894100189 11.898873329
GPU_1 11.902789116 11.898837090 11.893599510
GPU_2 11.905962944 11.892909050 11.891006470

Relative error:

parameters fix_stability*==0 (%) fix_stability==1 fix_stability==2
NO_gptq_use_orig_model 1.27 1.73 1.16
gptq_use_orig_model 0.12 0.05 0.07

Related: #656

TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

@stamalakhov stamalakhov self-assigned this Apr 30, 2026
@stamalakhov stamalakhov changed the title [quantization][draft] Fix instability [quantization][draft] [NO_MERGE] Fix instability Apr 30, 2026
@stamalakhov stamalakhov force-pushed the fix_stability branch 2 times, most recently from 6cb7426 to f883c88 Compare May 12, 2026 06:48
This PR tries to fix instability in llama quantization.

TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant