feat: Multi instance metrics by CasperTeirlinck · Pull Request #31 · datamindedbe/checkup

CasperTeirlinck · 2026-04-24T09:14:24Z

This PR adds the ability to configure multiple instances for the same Metric class.

Note: also includes the following:

Context

#29 made it possible to configure metrics using the constructor, which is more intuitive than subclassing.
It is however not possible yet to have more than one instance for the same metric. This is a common use case for for example the GitTrackedFileCountMetric:

Before, you would define multiple metric classes that track certain files like this:

class CruftFileExistsMetric(GitTrackedFileCountMetric):
    name: ClassVar[str] = "cruft_file_exists"
    description: ClassVar[str] = "Whether .cruft.json exists"
    pattern: str = ".cruft.json"

class SomeConfigExistsMetric(GitTrackedFileCountMetric):
    name: ClassVar[str] = "config_exists"
    description: ClassVar[str] = "Whether config.yml exists"
    pattern: str = "config.yml"

class DagCountMetric(GitTrackedFileCountMetric):
    name: ClassVar[str] = "airflow_dag_count"
    description: ClassVar[str] = "Number of DAG files"
    pattern: str = "dags/*.py"

This is however possible now because of #29 by just creating multiple instances of the GitTrackedFileCountMetric:

GitTrackedFileCountMetric(
    name="cruft_file_exists",
    description="Whether .cruft.json exists",
    pattern=".cruft.json",
)
GitTrackedFileCountMetric(
    name="config_exists",
    description="Whether config.yml exists",
    pattern="config.yml",
)
GitTrackedFileCountMetric(
    name="airflow_dag_count",
    description="Number of DAG files",
    pattern="dags/*.py",
)

except that the calculation of metrics still assumed one instance per metric class which would only keep a single Measurement out of the 3 configured metrics above.

Summary

Instance-based metric configuration: Changed name, description, and unit from ClassVar to instance fields on the Metric class. This allows creating multiple instances of the same metric class with different configurations.
Multiple metric instances support: Changed the measurements signature from dict[type[Metric], Measurement] to dict[type[Metric], list[Measurement]] to support multiple instances of the same metric class. It is the responsibility of the calculate method of a metric to handle multiple Measurements of a dependent metric. Metric dependencies keep being defined using Metric classes.
Refactored executor module because it was too large (~540 lines).

jvanbuel

To be quite honest, I don't find the list of measurements that elegant (especially that you now always need to index with zero to get the default behaviour of one measurement per metric), but I do understand the reasoning. The requirement that downstream metrics need to decide what to do with multiple measurements also makes sense to. The utility function to assert there is only one measurement seems like a good way to make dealing with this easier.

Maybe one idea: what if instead of list of measurements, the signature of measure is an iterator of measurements? Than the default behaviour becomes calling next. I don't know why, but to me that feels a bit better. But that's a feeling, which is not really sound for making decisions.

jvanbuel · 2026-04-28T06:40:23Z

+                logger.debug("Executing provider: %s", provider.name)
+                data = provider.provide()
+                context[provider.name] = data
+                if provider.is_tag_provider():


The tag provider is treated as a special provider, but I'm not sure if that's absolutely necessary? What prevents the tag provider from just enriching the context, and the metrics trying to fetch data from the context to populate their tags?

On the other hand, we do have a tag attribute for each metric, so tags are kind of special. Not blocking, just wanted to hear your thoughts on this.

Yeah I agree, I see no real reason why we should treat it as special when executing the providers, using the context is much more consistent and simplifies the code a bit too. In the MetricCalculator we can then just access the tags from the context to add it to the tags attribute of the measurements.
I refactored it here: #33

jvanbuel · 2026-04-28T06:43:08Z

+        if len(instances) > 1
    }
    if duplicates:
        # Report the first duplicate found


Why only report the first duplicate?

No reason I think. I changed it so it reports all duplicates.
d22049a

jvanbuel · 2026-04-28T06:44:51Z

-        name, classes = next(iter(duplicates.items()))
-        raise DuplicateMetricNameError(name, classes)
+        name, instances = next(iter(duplicates.items()))
+        raise DuplicateMetricNameError(name, [type(m) for m in instances])


doesn't the iterator return a list of the same types? Or do I misinterpret this?

It gives a list of Metric (classes) that share the same name, so it could be different types if you gave the same name to 2 different metrics I think.

CasperTeirlinck · 2026-04-29T14:51:43Z

To be quite honest, I don't find the list of measurements that elegant (especially that you now always need to index with zero to get the default behaviour of one measurement per metric), but I do understand the reasoning. The requirement that downstream metrics need to decide what to do with multiple measurements also makes sense to. The utility function to assert there is only one measurement seems like a good way to make dealing with this easier.

Maybe one idea: what if instead of list of measurements, the signature of measure is an iterator of measurements? Than the default behaviour becomes calling next. I don't know why, but to me that feels a bit better. But that's a feeling, which is not really sound for making decisions.

@jvanbuel I agree it does not feel super intuitive and is a bit ugly with the list, especially the long type hint. I propose an alternative using a Measurements class to provide hopefully a cleaner interface for the measurements here: #34
Internally it still uses a list because we need to append, but type hint it as a Sequence to the outside.

CasperTeirlinck added 10 commits April 23, 2026 11:32

Bump checkup to 0.3.0

457c5f0

Bump checkup-dbt to 0.3.0

b26ed8c

Bump checkup-git to 0.3.0

e036e20

Bump checkup-python to 0.3.0

0db6ca3

Bump checkup-conveyor to 0.3.0

b088245

Bump checkup-airflow to 0.2.0

4fcad8c

Bump checkup-bitbucket to 0.2.0

656b236

Bump checkup-github to 0.2.0

db131a0

Bump checkup-gitlab to 0.2.0

e92bde9

update

dcdf86e

CasperTeirlinck marked this pull request as ready for review April 24, 2026 13:52

CasperTeirlinck requested a review from jvanbuel April 24, 2026 13:52

add tags to context

0dc6a74

CasperTeirlinck mentioned this pull request Apr 28, 2026

feat: Add multi instance metric support to the yaml config #32

Open

jvanbuel approved these changes Apr 29, 2026

View reviewed changes

This was referenced Apr 29, 2026

refactor: Have TagProvider use context instead of dedicated tags variable #33

Merged

refactor: Measurements wrapper #34

Merged

CasperTeirlinck and others added 3 commits April 29, 2026 17:24

duplicate name error report all duplicates

d22049a

Added measurements wrapper (#34)

1151a0a

Have TagProvider use context instead of dedicated tags variable (#33)

e7b6d75

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Multi instance metrics#31

feat: Multi instance metrics#31
CasperTeirlinck wants to merge 14 commits intomainfrom
feat/metric-multi-instance

CasperTeirlinck commented Apr 24, 2026 •

edited

Loading

Uh oh!

jvanbuel left a comment

Uh oh!

jvanbuel Apr 28, 2026

Uh oh!

CasperTeirlinck Apr 29, 2026 •

edited

Loading

Uh oh!

jvanbuel Apr 28, 2026

Uh oh!

CasperTeirlinck Apr 29, 2026

Uh oh!

jvanbuel Apr 28, 2026

Uh oh!

CasperTeirlinck Apr 29, 2026

Uh oh!

CasperTeirlinck commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

CasperTeirlinck commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Summary

Uh oh!

jvanbuel left a comment

Choose a reason for hiding this comment

Uh oh!

jvanbuel Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

CasperTeirlinck Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jvanbuel Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

CasperTeirlinck Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

jvanbuel Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

CasperTeirlinck Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

CasperTeirlinck commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CasperTeirlinck commented Apr 24, 2026 •

edited

Loading

CasperTeirlinck Apr 29, 2026 •

edited

Loading