Skip to content

Load long-term US projected datasets#362

Merged
MaxGhenis merged 3 commits into
mainfrom
codex/us-long-term-datasets
May 16, 2026
Merged

Load long-term US projected datasets#362
MaxGhenis merged 3 commits into
mainfrom
codex/us-long-term-datasets

Conversation

@MaxGhenis
Copy link
Copy Markdown
Contributor

Summary

  • add load_long_term_datasets for pre-built US projected H5 datasets with sidecar metadata validation
  • support both the existing .py entity-table H5 layout and PolicyEngine Core variable/period H5 layout
  • export the long-term dataset loader and metadata validator from policyengine.us

Why

Long-run analyses can produce expensive projected datasets outside the normal bundled release path. This gives policyengine.py a small, metadata-aware loader for those artifacts without making the package own the projection/calibration build itself.

Validation

  • uv run --extra dev ruff check src/policyengine/tax_benefit_models/us/__init__.py src/policyengine/tax_benefit_models/us/datasets.py tests/test_us_long_term_datasets.py
  • uv run --extra dev python -m pytest tests/test_us_long_term_datasets.py -q

@MaxGhenis MaxGhenis force-pushed the codex/us-long-term-datasets branch from 4f37e4f to 32699e5 Compare May 16, 2026 00:15
@MaxGhenis MaxGhenis marked this pull request as ready for review May 16, 2026 04:06
@MaxGhenis MaxGhenis merged commit ab38116 into main May 16, 2026
11 checks passed
@MaxGhenis MaxGhenis deleted the codex/us-long-term-datasets branch May 16, 2026 04:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant