Data Provenance (DPS) TC

 View Only

Use cases and data source mapping

  • 1.  Use cases and data source mapping

    Posted 08-26-2025 12:56

    Hello DPS TC Members,

    As mentioned on this morning's call, the use cases are now available here:
    Use Cases Document

    These demonstrate (with a previous version of the standards and metadata) where the standards can be applied and where they cannot. This could be useful for the mapping work that Stefan highlighted.

    In addition, the following data sources were mapped for values:

    • data.gov – Metadata fields: title, description, publisher, issued date, spatial/temporal coverage, license
    • data.europa.eu – Uses DCAT-AP (EU's profile of DCAT), includes dataset provenance, spatial coverage, issuance dates, licensing
    • UK Data Service and data.gov.uk – Specifically for public sector datasets
    • Zenodo (operated by CERN, using DataCite schema), and Figshare – Methods, licensing, keywords
    • ICPSR – Provenance, methodology, confidentiality
    • PANGAEA – ISO 19115 lineage & geospatial metadata
    • GA4GH Data Use Ontology (DUO) – Intended use metadata; dbGaP (NIH) for consent, use restrictions, provenance
    • LOV (Linked Open Vocabularies) – Vocabularies such as Dublin Core, PROV, DCAT, DPV
    • SPDX License List – Standardized license metadata
    • W3C Data Privacy Vocabulary (DPV) – Consent, PETs, processing location
    • GA4GH DUO – Standardized restrictions on biomedical datasets

    Please note that the data values we have from our Alliance members are subject to an NDA. They would require significant scrubbing before being shareable. However, if we think that would accelerate progress, I can work on preparing as much as possible before the end of the week.

    Thank you,

    Kristina



    ------------------------------
    Kristina Podnar
    Data & Trust Alliance
    ------------------------------