[quantization][draft] [NO_MERGE] Fix instability by stamalakhov · Pull Request #670 · Samsung/TICO

stamalakhov · 2026-04-30T12:43:10Z

This PR tries to fix instability in llama quantization.

results of ppl evaluation for LLama3.2-3B-Instruct quantized using GPTQ only (smse) on 128 samples

torch and transformers packages were the same on all GPUs:


torch 2.10.0
transformers 5.8.0

environment	original	fix_stability==1	fix_stability==2	fix_stability==3
GPU_0	12.023604393	11.953644753	12.002959251	12.002959251
GPU_1	11.913434029	12.059534073	11.875122070	11.875122070
GPU_2	12.054678917	12.145004272	11.989699364	11.989699364

with gptq_use_orig_model enabled:

environment	fix_stability==0	fix_stability==1	fix_stability==2
GPU_0	11.892453194	11.894100189	11.898873329
GPU_1	11.902789116	11.898837090	11.893599510
GPU_2	11.905962944	11.892909050	11.891006470

Relative error:

parameters	fix_stability*==0 (%)	fix_stability==1	fix_stability==2
NO_gptq_use_orig_model	1.27	1.73	1.16
gptq_use_orig_model	0.12	0.05	0.07

Related: #656

TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

This PR tries to fix instability in llama quantization. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

stamalakhov self-assigned this Apr 30, 2026

stamalakhov changed the title ~~[quantization][draft] Fix instability~~ [quantization][draft] [NO_MERGE] Fix instability Apr 30, 2026

stamalakhov force-pushed the fix_stability branch from 46797c8 to 05869c9 Compare May 7, 2026 09:44

stamalakhov mentioned this pull request May 8, 2026

[quantization] Instability on llama quantization #656

Open

stamalakhov force-pushed the fix_stability branch 2 times, most recently from 6cb7426 to f883c88 Compare May 12, 2026 06:48

stamalakhov mentioned this pull request May 13, 2026

[quantization] Add gptq_use_orig_model_inference #702

Merged

[quantization][draft] Fix instability

07189f8

This PR tries to fix instability in llama quantization. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

stamalakhov force-pushed the fix_stability branch from f883c88 to 07189f8 Compare May 13, 2026 13:10

This was referenced May 13, 2026

[quantization] Tune torch #703

Open

[quantization] Increase precision #704

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization][draft] [NO_MERGE] Fix instability#670

[quantization][draft] [NO_MERGE] Fix instability#670
stamalakhov wants to merge 1 commit into
Samsung:mainfrom
stamalakhov:fix_stability

stamalakhov commented Apr 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

stamalakhov commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

stamalakhov commented Apr 30, 2026 •

edited

Loading