fix: resolve crashes in fine-tuning notebooks (half_precision, cuda:0, torch.load) by haoyu-haoyu · Pull Request #169 · agemagician/ProtTrans

haoyu-haoyu · 2026-03-26T20:28:53Z

Summary

Three bugs that prevent the LoRA fine-tuning notebooks from running:

TypeError crash: PT5_classification_model(num_labels, half_precision) has no default for half_precision, but train_per_protein() / train_per_residue() call it as PT5_classification_model(num_labels=num_labels) without passing the argument. Added half_precision=False as the default, matching the training function's mixed=False default. (All 3 notebooks)
Hardcoded to('cuda:0') crash: In the per-residue classification notebook, valid_labels is unconditionally moved to cuda:0, crashing on CPU-only systems. Changed to to(logits.device) to match whatever device the model is on. The regression notebook already handles this correctly. (per_residue_class only)
torch.load deprecation: Added explicit weights_only=False to suppress the FutureWarning in PyTorch 2.6+ (weights_only will default to True). These checkpoints contain LoRA parameter dicts that require full unpickling. (All 3 notebooks)

Files changed

Notebook	Fix
`PT5_LoRA_Finetuning_per_prot.ipynb`	`half_precision` default, `torch.load`
`PT5_LoRA_Finetuning_per_residue_class.ipynb`	`half_precision` default, `cuda:0`, `torch.load`
`PT5_LoRA_Finetuning_per_residue_reg.ipynb`	`half_precision` default, `torch.load`

Test plan

Verified the half_precision=False default matches the behavior expected by the training function
to(logits.device) automatically follows whatever device the model is on
weights_only=False preserves existing behavior while suppressing warnings

- Add default `half_precision=False` to `PT5_classification_model()`. The training function calls it as `PT5_classification_model(num_labels=num_labels)` without passing `half_precision`, causing a TypeError crash. Affects all 3 notebooks. - Fix hardcoded `to('cuda:0')` in per-residue classification notebook. Changed to `to(logits.device)` so CPU users are not forced to have a CUDA device. The per-residue regression notebook already handles this correctly. - Add explicit `weights_only=False` to `torch.load(filepath)` in `load_model()` across all 3 notebooks. PyTorch 2.6+ defaults to `weights_only=True` which would break loading LoRA parameters.

gemini-code-assist

Code Review

This pull request modifies several PT5 fine-tuning notebooks to include default arguments for model initialization, enable non-restrictive model loading, and implement dynamic device placement for tensors. The review feedback suggests adopting a more idiomatic PyTorch pattern by combining device and type casting into a single call.

gemini-code-assist · 2026-03-26T20:33:37Z

    "            valid_labels=active_labels[active_labels!=-100]\n",
    "            \n",
-    "            valid_labels=valid_labels.type(torch.LongTensor).to('cuda:0')\n",
+    "            valid_labels=valid_labels.type(torch.LongTensor).to(logits.device)\n",


While your change correctly addresses the hardcoded device issue, it's more idiomatic in modern PyTorch to use the .to() method for both device and type casting. The .type() method is considered legacy. Combining these into a single .to() call is cleaner and more efficient. Also, torch.long is preferred over torch.LongTensor when specifying a dtype.

valid_labels=valid_labels.to(device=logits.device, dtype=torch.long)\n

gemini-code-assist Bot reviewed Mar 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: resolve crashes in fine-tuning notebooks (half_precision, cuda:0, torch.load)#169

fix: resolve crashes in fine-tuning notebooks (half_precision, cuda:0, torch.load)#169
haoyu-haoyu wants to merge 1 commit intoagemagician:masterfrom
haoyu-haoyu:fix/notebook-crashes

haoyu-haoyu commented Mar 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

haoyu-haoyu commented Mar 26, 2026

Summary

Files changed

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant