Skip to content

Vocabulary/data drift for OpenContext material URIs used in facets (missing labels downstream) #207

@rdhyee

Description

@rdhyee

Context

Downstream facet UIs use vocab_labels.parquet (built in isamplesorg.github.io) to render friendly labels for material URIs. Two OpenContext material URIs appear in facet data but do not resolve to labels:

  • https://w3id.org/isample/opencontext/material/0.1/organicanimalproduct (count 261)
  • https://w3id.org/isample/opencontext/material/0.1/plantmaterial (count 1)

These URIs are present in facet_summaries but absent from vocab_labels because they appear not to be in the current OpenContext material extension TTL used for label generation (which has organicanimalmaterial and organicplantmaterial).

Why filing in metadata

This seems like a metadata/vocabulary governance question: are these legacy/invalid terms, aliases, or missing concepts that should be represented explicitly?

Repro SQL

SELECT facet_value, count
FROM read_parquet('https://data.isamples.org/isamples_202601_facet_summaries.parquet')
WHERE facet_type='material'
  AND (facet_value ILIKE '%organicanimalproduct%'
       OR facet_value ILIKE '%plantmaterial%');
WITH facet AS (
  SELECT DISTINCT facet_value AS uri
  FROM read_parquet('https://data.isamples.org/isamples_202601_facet_summaries.parquet')
  WHERE facet_type='material'
), labels AS (
  SELECT DISTINCT uri
  FROM read_parquet('https://data.isamples.org/vocab_labels.parquet')
  WHERE lang='en'
)
SELECT f.uri
FROM facet f
LEFT JOIN labels l USING (uri)
WHERE l.uri IS NULL
ORDER BY f.uri;

Requested decision

Please define canonical handling for these two URIs:

  1. Add/restore terms (or deprecated aliases) in the appropriate vocabulary repo/profile, OR
  2. Declare them legacy/invalid and provide canonical replacements + mapping guidance for exporters.

Cross-repo linkage

Related integration issue (consumer/build side):

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions