Skip to content

Vocabulary/data drift for OpenContext material URIs used in facets (missing labels downstream) #207

@rdhyee

Description

@rdhyee

Context

Downstream facet UIs use vocab_labels.parquet (built in isamplesorg.github.io) to render friendly labels for material URIs. Two OpenContext material URIs appear in facet data but do not resolve to labels:

  • https://w3id.org/isample/opencontext/material/0.1/organicanimalproduct (count 261)
  • https://w3id.org/isample/opencontext/material/0.1/plantmaterial (count 1)

These URIs are present in facet_summaries but absent from vocab_labels because they appear not to be in the current OpenContext material extension TTL used for label generation (which has organicanimalmaterial and organicplantmaterial).

Why filing in metadata

This seems like a metadata/vocabulary governance question: are these legacy/invalid terms, aliases, or missing concepts that should be represented explicitly?

Repro SQL

SELECT facet_value, count
FROM read_parquet('https://data.isamples.org/isamples_202601_facet_summaries.parquet')
WHERE facet_type='material'
  AND (facet_value ILIKE '%organicanimalproduct%'
       OR facet_value ILIKE '%plantmaterial%');
WITH facet AS (
  SELECT DISTINCT facet_value AS uri
  FROM read_parquet('https://data.isamples.org/isamples_202601_facet_summaries.parquet')
  WHERE facet_type='material'
), labels AS (
  SELECT DISTINCT uri
  FROM read_parquet('https://data.isamples.org/vocab_labels.parquet')
  WHERE lang='en'
)
SELECT f.uri
FROM facet f
LEFT JOIN labels l USING (uri)
WHERE l.uri IS NULL
ORDER BY f.uri;

Requested decision

Please define canonical handling for these two URIs:

  1. Add/restore terms (or deprecated aliases) in the appropriate vocabulary repo/profile, OR
  2. Declare them legacy/invalid and provide canonical replacements + mapping guidance for exporters.

Cross-repo linkage

Related integration issue (consumer/build side):

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions