Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

This is the official code for the Quartet II NVFP4 training paper

Quickstart

Create a conda environment and install dependencies (we recommend Python 3.11):

conda create -n env python=3.11
conda activate env

pip install -r requirements.txt

Reproduce Quartet II sweeps in SLURM:

cd scripts
sbatch quartetv2_sweep.sh

Inspect the scheme implementation at:

[quartet_2.py](./src/models/quantization/schemes/quartet_2.py)

NVFP4 Kernels

We provide the kernels tuned for RTX 5090 (sm120a) in ./kernels. They require CUDA 12.8 or newer and close to latest (~2.9.0) pytorch. Install them with

cd kernels
pip install --no-build-isolation .

You can then use the provided drop-in NVFP4 nn.Linear replacement as follows:

from quartet2.linear import Quartet_II_linear

linear = Quartet_II_linear(
    in_dim,
    out_dim,
    device="cuda",
    dtype=torch.bfloat16,
)
...

You can further benchmark the kernels agains BF16, FP8 and Quartet with

cd test
pythpn bench_linear.py

Cite This Work

@misc{panferov2026quartetiiaccuratellm,
      title={Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation}, 
      author={Andrei Panferov and Erik Schultheis and Soroush Tabesh and Dan Alistarh},
      year={2026},
      eprint={2601.22813},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2601.22813}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.github/workflows		.github/workflows
data		data
figures		figures
for_plots		for_plots
kernels		kernels
notebooks		notebooks
scripts		scripts
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Quickstart

NVFP4 Kernels

Cite This Work

About

Uh oh!

Releases

Contributors 3

Uh oh!

Languages

IST-DASLab/Quartet-II

Folders and files

Latest commit

History

Repository files navigation

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Quickstart

NVFP4 Kernels

Cite This Work

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Contributors 3

Uh oh!

Languages