Skip to content

fix #66/docs: add Windows & Blackwell troubleshooting#67

Merged
bmhowe23 merged 2 commits into
NVIDIA:mainfrom
kvmto:docs/windows-blackwell-workarounds-66
Apr 17, 2026
Merged

fix #66/docs: add Windows & Blackwell troubleshooting#67
bmhowe23 merged 2 commits into
NVIDIA:mainfrom
kvmto:docs/windows-blackwell-workarounds-66

Conversation

@kvmto
Copy link
Copy Markdown
Collaborator

@kvmto kvmto commented Apr 15, 2026

Summary

Addresses the workarounds reported in #66 (RTX 5080 + 9800X3D on Windows — distance 13 success):

  • README — Blackwell GPU note: stable PyTorch cu124 wheels lack SM 12.0 kernels; documents the cu128 nightly install command
  • README — Windows section: Triton is unsupported on native Windows, so torch.compile fails; documents TORCH_COMPILE_DISABLE=1 / PREDECODER_TORCH_COMPILE=0 and the PYTHONPATH=code requirement when running outside the launch scripts
  • README — model-not-found guidance: explains the find_best_model search order (best_model/models/) and shows how to point to an explicit .pt path so users don't have to manually rename weights
  • run.py — better error message: find_best_model now prints which file patterns it expects and a hint to set model_checkpoint_file, instead of a bare FileNotFoundError

Test plan

  • Verify README renders correctly on GitHub (code blocks, bullet nesting)
  • Confirm find_best_model raises the improved error when no .pt files are present
  • Existing CI tests unaffected (no logic changes outside the error message)

Closes Issue #66

…d error

Address workarounds reported in NVIDIA#66 (RTX 5080 + Windows success report):

- README: add Blackwell GPU note (cu128 nightly required for SM 12.0)
- README: add Windows section (Triton unsupported, TORCH_COMPILE_DISABLE,
  PYTHONPATH)
- README: add pre-trained model not found guidance with explicit path option
- run.py: improve find_best_model FileNotFoundError with actionable hint

Closes NVIDIA#66

Signed-off-by: kvmto <kmato@nvidia.com>
@kvmto kvmto requested review from bmhowe23 and ivanbasov April 15, 2026 11:13
Comment thread README.md Outdated
@bmhowe23 bmhowe23 merged commit 07606b3 into NVIDIA:main Apr 17, 2026
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants