Add generic observation processes which combine the convolution with the noise model. #644

cdc-mitzimorris · 2025-12-23T16:23:17Z

This PR adds work that was done in https://github.com/cdcent/cfa-pyrenew-hierarchical/pull/4 to PyRenew.

It adds the base observation process class, concrete implementations for Count processes and the abstract base class for Measurement processes, together with unit tests and two new tutorials for count and measurement observation processes respectively.

Once this PR and the work done in https://github.com/cdcent/cfa-pyrenew-hierarchical/pull/5 have been added to PyRenew, subsequent PRs will deprecate unused features and harmonize the documentation and tutorials.

codecov · 2025-12-23T16:27:21Z

Codecov Report

❌ Patch coverage is 98.40426% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 97.26%. Comparing base (02446c5) to head (f932712).
⚠️ Report is 3 commits behind head on main.

Files with missing lines	Patch %	Lines
pyrenew/observation/count_observations.py	96.66%	2 Missing ⚠️
pyrenew/observation/noise.py	98.38%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #644      +/-   ##
==========================================
+ Coverage   96.98%   97.26%   +0.28%     
==========================================
  Files          42       47       +5     
  Lines        1094     1282     +188     
==========================================
+ Hits         1061     1247     +186     
- Misses         33       35       +2

Flag	Coverage Δ
unittests	`97.26% <98.40%> (+0.28%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2025-12-23T16:27:38Z

Thank you for your contribution @cdc-mitzimorris 🚀! Your github-pages is ready for download 👉 here 👈!
_{(The artifact expires on 2026-02-02T15:33:42Z. You can re-generate it by re-running the workflow here.)}

…ents, 'aggregate' instead of 'jurisdiction'

for more information, see https://pre-commit.ci

… into mem_generic_observations

for more information, see https://pre-commit.ci

docs/tutorials/observation_processes_counts.qmd

… into mem_generic_observations

cdc-mitzimorris · 2026-01-13T23:42:59Z

In preparing to add the model_builder class for component composition, it became clear that the noise model shouldn't manage the plates directly - this is the responsibility of the RV.
The utility class VectorizedRV wraps simple distributions in a plate, while hierarchical priors (e.g., HierarchicalNormalPrior) use internal plates with partial pooling. The noise model works with either because it only depends on the interface, not the implementation.
Updated file observation/noise.py and the corresponding notebook.

docs/tutorials/observation_processes_counts.qmd

docs/tutorials/observation_processes_measurements.qmd

dylanhmorris · 2026-01-15T15:12:19Z

pyrenew/observation/count_observations.py

-        expected_counts_safe = jnp.nan_to_num(expected_counts, nan=0.0)
+        predicted_counts = self._predicted_obs(infections)
+        self._deterministic("predicted_counts", predicted_counts)
+        predicted_counts_safe = jnp.nan_to_num(predicted_counts, nan=0.0)


^ This feels dangerous to do automatically and without warning. I think I would rather force the user to do it themself in _predicted_obs, where appropriate. Is there a good reason not to?

@cdc-mitzimorris flagging this

The nan_to_num is applied to predicted_counts from the latent infection process, not to user-provided observations. The NaN values are a structural artifact of the convolution burn-in (first len(delay_pmf)-1 days).
cf

PyRenew/pyrenew/observation/count_observations.py

Lines 145 to 148 in d103d0f

Notes

-----

Output preserves input timeline. First len(delay_pmf)-1 days return

-1 or ~0 (depending on noise model) due to NaN padding.

The nan_to_num is a pragmatic choice: the model doesn't crash, and the behavior is noted in the docstring. It's not a safeguard against misuse — it's just handling a structural artifact so the code runs.

pyrenew/observation/measurements.py

dylanhmorris

Thanks @cdc-mitzimorris! A few things to address and then I can re-review.

Co-authored-by: Dylan H. Morris <dylanhmorris@users.noreply.github.com>

… into mem_generic_observations

for more information, see https://pre-commit.ci

… into mem_generic_observations

cdc-mitzimorris · 2026-01-15T18:40:05Z

@dylanhmorris - ready for re-review - made all suggested changes to the tutorials and added arg "name" to observation processes so that user specifies the signal name.

pyrenew/observation/base.py

test/test_observation_measurements.py

dylanhmorris

Thanks @cdc-mitzimorris! Just a couple remaining questions.

cdc-mitzimorris · 2026-01-26T15:52:28Z

changes made.

cdc-mitzimorris · 2026-01-26T16:51:57Z

@dylanhmorris - conversation resolved.

dylanhmorris

Still need the separate noise module.

dylanhmorris · 2026-01-26T18:39:39Z

pyrenew/observation/noise.py

@cdc-mitzimorris this still needs to be implemented.

damonbayer · 2026-01-26T18:44:24Z

pyrenew/observation/count_observations.py

+    Output preserves input timeline. First len(delay_pmf)-1 days return
+    -1 or ~0 (depending on noise model) due to NaN padding.


What is meant by ~0? and why does the padding change depending on the noise model?

I think we should discuss this f2f in our upcoming meeting.

damonbayer · 2026-01-26T19:41:10Z

pyrenew/observation/count_observations.py

+        times : ArrayLike | None
+            Day indices for sparse observations. None for dense observations.


index relative to what vector?

cdc-mitzimorris added 11 commits September 15, 2025 18:24

Merge branch 'main' of https://github.com/CDCgov/PyRenew

680bb1e

update

2cb876b

Merge branch 'main' of github-bf06:CDCgov/PyRenew

60db8df

Merge branch 'main' of github-bf06:CDCgov/PyRenew

32a5314

Merge branch 'main' of github-bf06:CDCgov/PyRenew

d6213f2

Merge branch 'main' of github-bf06:CDCgov/PyRenew

96f27c9

Merge branch 'main' of github-bf06:CDCgov/PyRenew

1cb6fa2

Merge branch 'main' of github-bf06:CDCgov/PyRenew

f62e1e4

Merge branch 'main' of github-bf06:CDCgov/PyRenew

0c6785d

added generic observation processes and unit tests

35bba26

adding tutorials

89d7aab

cdc-mitzimorris requested review from SamuelBrand1, damonbayer and dylanhmorris as code owners December 23, 2025 16:23

cdc-mitzimorris and others added 8 commits December 23, 2025 12:07

improve test coverage

a2e4630

improve unit test coverage

24096bc

consistent names: 'subpop' (not site), 'sensor' for site/lab measurem…

671d9d0

…ents, 'aggregate' instead of 'jurisdiction'

consistent names: 'subpop' (not site), 'sensor' for site/lab measurem…

57d2fba

…ents, 'aggregate' instead of 'jurisdiction'

[pre-commit.ci] auto fixes from pre-commit.com hooks

7efb524

for more information, see https://pre-commit.ci

add observation types

8a7947f

Merge branch 'mem_generic_observations' of github-bf06:CDCgov/PyRenew…

571ada3

… into mem_generic_observations

[pre-commit.ci] auto fixes from pre-commit.com hooks

6dff8cd

for more information, see https://pre-commit.ci

dylanhmorris reviewed Dec 31, 2025

View reviewed changes

docs/tutorials/observation_processes_counts.qmd Outdated Show resolved Hide resolved