WIP: Treeformer by jveitchmichaelis · Pull Request #1371 · weecology/DeepForest

jveitchmichaelis · 2026-04-14T23:10:42Z

Description

This is marked as WIP as there's a lot to clean up, but the model architecture and training loop seem to be OK. It's mostly additive cruft than things that will require extensive rebasing...

Adds a point-detection model based on the TreeFormer architecture in https://arxiv.org/abs/2307.06118 which is based on DM-Count https://arxiv.org/abs/2009.13077.

This contribution is probably more in spirit with DM-Count than TreeFormer because it only implements the supervised path, but the code was heavily adapted from the TreeFormer repository.

The model is a segmentation backbone (here, PvTv2) which feeds into two heads. One, a Global Average Pool (GAP) which predicts a scalar tree count, and the other a density regression head (a multi-scale decoder).

TODO: add model checkpoints to huggingface and default to them when user picks the treeformer config.
TODO: big clean up of commits, obviously.
TODO: remove slurm scripts, etc.

Replication

Sample predictions trained and tested on the paper's KCL dataset (using Google Earth images):

Note: this dataset is probably overfit. The paper reports 500 epochs, but the dataset is only 400 images. Nevertheless, the implementation here beats the paper benchmark in supervised mode and also the unsupervised benchmark (likely due to backbone pretraining) based on MAE (paper: 16.7/18.5, ours: 18.24 using density sum / <15 with peak extraction). We get a peak F1 of around 0.68. Convergence is seen after around 200 epochs, but we train for 500 to match the original hyperparameters.

A single epoch on NEON LIDAR (purple are preds):

Model differences

TreeFormer is, as far as I can tell, not invariant to input image shape. The vanilla implementation trains on 256 crops and then predicts over 256 tiles. If you change the input shape at test time, the model will under or over-predict tree density. This implementation changes the output to regress density instead of absolute count, and learns a scale factor that is applied to the input image. This does only affect scale factor, but it's nice to be consistent here.
In the original paper the model learns spatial structure and count separately. In this version, we couple the two together by forcing the sum of the density map to match the object count for consistency. Typically during training, the model will quickly learn how to place tree "points" and then will slowly learn to correct the predict count output.
Support for DDP is provided via DeepForest, which involved a few tweaks to various parts of the codebase.

PRs to be split - will update

Optimizer config (SGD + Adam) - Add support for Adam(W) and weight decay #1372
Better support for trainer flags
Better resume support
Viz for keypoints + density
Metric support for keypoints + format_keypoints in utilities
Same size image check to speed up validation - Speed up validation for datasets with same-size images #1373
EMA weight averaging

Licensing

TreeFormer does not list a license, and the ancestral code from DM-Count (which is mostly copied verbatim in TreeFormer) is MIT-licensed. Here, the backbone model is taken directly from transformers and the various heads are re-implemented in PyTorch with some optimization from the original.

Related Issue(s)

#809

AI-Assisted Development

I used AI tools (e.g., GitHub Copilot, ChatGPT, etc.) in developing this PR
I understand all the code I'm submitting
I have reviewed and validated all AI-generated code

AI tools used (if applicable):

Claude (Opus/Sonnet 4.6) and Codex 5.3/5.4

jveitchmichaelis · 2026-04-22T17:37:33Z

Closing in favour of simpler branch

jveitchmichaelis added 30 commits April 14, 2026 16:21

init treeformer

cfa5d0d

add keypoint dataset

146dffc

treeformer trains successfully

78d595b

clean up dead code

d61debb

training integrated

6cadf25

tweaks to training

64b0ccb

some tweaks to metrics

d3ec5e0

make ot loss bs invariant

694e65d

mean ot loss instead of divide by batch

27ce251

add ot weight to constructor

b0abc20

speed up sanity check for huge datasets

788a8d9

add image size arg to kp

be5743c

perf

172eb23

paths

90038c8

fixup configs

33b5c51

faster sigma annealing

92a780e

add grad checkpoint

5606db5

simplify dataset validation bypass

8cfa037

pretrain config tweaks

b393c78

add gaussian trees options

a3ba95d

bump agents

c8d1b8b

update configs

bacc88b

tweaks to schedules

bca8d26

tweak lr

0f9e8d2

log wasserstein distance as well as OT loss

038aabf

add debug logging + verbose flag

080086a

update configs + add finetuning

19a5404

add hf export to cli

fa88bfa

update finetune config

6a91538

norm counts by pixel area, update sbatch scripts

3938e55

jveitchmichaelis added 27 commits April 14, 2026 16:25

main should handle inf loss in ddp when one rank has all-empty points

52b6764

drop sigma annealing

6cec22b

improve resume mechanics

484b343

fix resume checkpoint finder to be recursive

d87d6b8

add some additional defense against non-finite loss.

512d931

clamp out of range values in ot/sinkhorn

3a948dd

save comet key in file

8b40f20

remove clamp in ot_loss, default to norm_cood true

02c703b

add very messy auto tests

a69617f

clean up

3a2fc83

clean up

b02c147

more cleanup

3156cfb

add some better logging for sinkhorn failures

c1f7dd3

plot points in density

b806cfa

use len instead of sum for point count

8e52021

fix some issues with count_cls

8bee3b6

trim down tests

6cb8a68

clean up tests

6e5e2fa

clearer naming for gt_density

440a8c3

add back defaults in config

5f19769

drop spatial corr

cd11f59

count_cls area scale

5878c60

fix config in london slurm script

933bfc4

defaul to bf16-mixed

7511a2c

tweak kcl config + ensure comet logs git

b0e448e

reverse cls bias

869d18d

update weights + add export script

8629b4c

jveitchmichaelis closed this Apr 22, 2026

jveitchmichaelis mentioned this pull request Apr 22, 2026

add point model #1381

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Treeformer#1371

WIP: Treeformer#1371
jveitchmichaelis wants to merge 101 commits into
weecology:mainfrom
jveitchmichaelis:treeformer

jveitchmichaelis commented Apr 14, 2026 •

edited

Loading

Uh oh!

jveitchmichaelis commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jveitchmichaelis commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Replication

Model differences

PRs to be split - will update

Licensing

Related Issue(s)

AI-Assisted Development

Uh oh!

jveitchmichaelis commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jveitchmichaelis commented Apr 14, 2026 •

edited

Loading