hotdata-dlt-destination

hotdata-dlt-destination is a Python package that implements a custom dlt destination for loading data into Hotdata managed databases with deterministic idempotency keys and explicit write semantics.

What this repo includes

Custom destination via @dlt.destination in src/hotdata_dlt_destination/destination.py
Managed-database ingestion through hotdata-runtime (upload_parquet, load_managed_table, SELECT)
Read-modify-write append/merge using only supported API operations
Deterministic batch and row idempotency keys
Example pipelines:
- hotdata-dlt-basic-pipeline (append)
- hotdata-dlt-incremental-pipeline (upsert/merge)
- hotdata-dlt-linear-pipeline (Linear issues → Hotdata)
Unit tests in tests/
Architecture and runbook docs in docs/

Data contract defaults

Managed database: database_name (default dlt, created on first load when missing)
Schema: public
Table name: normalized lowercase dlt table identifier
Nested table names: {parent}__{child}
Write semantics (all use load_managed_table(replace) under the hood):
- replace: upload batch parquet and replace the target table
- append: read existing target rows, append batch in Python, replace target
- upsert/merge: read existing rows, upsert by dlt primary_key (or _hotdata_row_key), replace target
Idempotency:
- Batch key _hotdata_batch_key = hash(table + full batch payload)
- Row key _hotdata_row_key = hash(table + canonical row payload)

Configure

Set environment variables (or pass destination kwargs / dlt secrets):

HOTDATA_API_KEY
HOTDATA_WORKSPACE
HOTDATA_DATABASE (managed database name, default dlt)
optional: HOTDATA_SCHEMA, HOTDATA_WRITE_DISPOSITION, HOTDATA_DECLARED_TABLES, retry tuning

For pipelines with multiple tables, declare every target table when the managed database is first created:

hotdata_destination(
    database_name="analytics",
    declared_tables=["customers", "orders", "orders__items"],
)

Usage

import dlt
from hotdata_dlt_destination import hotdata_destination

pipeline = dlt.pipeline(
    pipeline_name="my_pipeline",
    destination=hotdata_destination(
        database_name="analytics",
        write_disposition="append",
        declared_tables=["customers"],
    ),
)
pipeline.run(my_resource())

Per-resource write_disposition and primary_key from dlt take precedence over the destination default.

Developer workflow

uv sync
uv run ruff check .
uv run pytest
uv run hotdata-dlt-destination

Run pipelines:

uv run hotdata-dlt-basic-pipeline
uv run hotdata-dlt-incremental-pipeline
uv run hotdata-dlt-linear-pipeline

Run the live end-to-end integration test (requires Hotdata + Linear env vars):

uv run pytest tests/test_e2e_linear_hotdata.py -m integration

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
docs		docs
scripts		scripts
src/hotdata_dlt_destination		src/hotdata_dlt_destination
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
README.md		README.md
RELEASING.md		RELEASING.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hotdata-dlt-destination

What this repo includes

Data contract defaults

Configure

Usage

Developer workflow

References

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

hotdata-dlt-destination

What this repo includes

Data contract defaults

Configure

Usage

Developer workflow

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages