[quantization] Add `gptq_use_orig_model_inference` by stamalakhov · Pull Request #702 · Samsung/TICO

stamalakhov · 2026-05-13T11:53:17Z

This PR adds gptq_use_orig_model_inference to stabilize GPTQ for deep models.

Compare GPTQ quantization stability with different torch versions for HuggingFaceTB/SmolLM2-135M-Instruct:

torch	original_PPL	gptq_use_orig_model_inference_PPL
torch_2_6	22.76	22.95
torch_2_9	22.83	22.95
torch_2_10	22.81	22.95

please see additional experiments in the draft #670

Draft: #670
Related: #656

TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

This PR adds `gptq_use_orig_model_inference` to stabilize GPTQ for deep models. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

mhs4670go

LGTM

stamalakhov requested a review from mhs4670go May 13, 2026 11:53

stamalakhov self-assigned this May 13, 2026

[quantization] Add gptq_use_orig_model_inference

d28dae4

This PR adds `gptq_use_orig_model_inference` to stabilize GPTQ for deep models. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

stamalakhov force-pushed the gptq_use_orig_model branch from 6e91b79 to d28dae4 Compare May 13, 2026 12:30

mhs4670go approved these changes May 13, 2026

View reviewed changes

mhs4670go merged commit 7e9b3e5 into Samsung:main May 13, 2026
7 checks passed

stamalakhov deleted the gptq_use_orig_model branch May 13, 2026 12:46

stamalakhov mentioned this pull request May 13, 2026

[quantization] Increase precision #704

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] Add `gptq_use_orig_model_inference`#702

[quantization] Add `gptq_use_orig_model_inference`#702
mhs4670go merged 1 commit into
Samsung:mainfrom
stamalakhov:gptq_use_orig_model

stamalakhov commented May 13, 2026

Uh oh!

mhs4670go left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stamalakhov commented May 13, 2026

Uh oh!

mhs4670go left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants