language en

GOLDCASE: A Generic Ontology Layer for Data Catalog Semantics

Release April 4th, 2022

This version:
http://dqm.faw.jku.at/ontologies/GOLDCASE/1.0.0
Latest version:
http://dqm.faw.jku.at/ontologies/GOLDCASE
Authors:
Johannes Schrott, Johannes Kepler University Linz
Lisa Ehrlinger, Johannes Kepler University Linz and Software Competence Center Hagenberg GmbH
Sabine Weidinger, Software Competence Center Hagenberg GmbH
Imported Ontologies:
dcat
dsd
Download serialization:
JSON-LD RDF/XML N-Triples TTL
License:
MIT License
Provenance of this page
Ontology Specification Draft

Abstract

Data catalogs automatically collect metadata from distributed data sources and provide a unified and easily accessible view on the data. Many existing data catalog tools focus on the automatic collection of technical metadata (e.g., from a data dictionary) into a central repository. The functionality of annotating data with semantics (i.e., its meaning) is often too less expressive in these tools for modelling complex real-world scenarios. We propose a generic ontology layer (GOLDCASE), which maps the semantics of the data in the form of a (business) ontology to the technical metadata provided by a data catalog. In order to apply GOLDCASE onto real-world use cases, it needs to be customized. For details on this we refer to our publication which is accepted at the MTSR Conference 2022.

GOLDCASE: Overview back to ToC

GOLDCASE re-uses the existing vocabularies DCAT and DSD. To connect the both, dsd:Datasource is set to be equivalent to dcat:Dataset. The following classes and properties are used:

Classes

Object Properties

GOLDCASE: Description back to ToC

Structure of GOLDCASE

The image shows how the classes used in GOLDCASE relate to each other.

Cross-reference for GOLDCASE classes, object properties and data properties back to ToC

This section provides details for each class and property used in GOLDCASE.

Classes

Attributec back to ToC or Class ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#Attribute

An Attribute describes a property of a Concept.
Is defined by
http://dqm.faw.jku.at/ontologies/DSD#Attribute
is in domain of
isAttributeOf op
is in range of
hasAttribute op

Catalogc back to ToC or Class ToC

IRI: http://www.w3.org/ns/dcat#Catalog

A curated collection of metadata about resources (e.g., datasets and data services in the context of a data catalog).
Is defined by
http://www.w3.org/TR/vocab-dcat/
has super-classes
Dataset c
is in domain of
hasPart op

Conceptc back to ToC or Class ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#Concept

A Concept defines a part of a data source that can be e.g. a class in an object oriented source or a table in a relational database.
Is defined by
http://dqm.faw.jku.at/ontologies/DSD#Concept
has keys
has primary key dp

Data servicec back to ToC or Class ToC

IRI: http://www.w3.org/ns/dcat#DataService

A site or end-point providing operations related to the discovery of, access to, or processing functions on, data or related resources.
Is defined by
http://www.w3.org/TR/vocab-dcat/
has super-classes
resource
is in domain of
serves dataset op
is in range of
hasPart op

Datasetc back to ToC or Class ToC

IRI: http://www.w3.org/ns/dcat#Dataset

A collection of data, published or curated by a single source, and available for access or download in one or more representations.
Is defined by
http://www.w3.org/TR/vocab-dcat/
has super-classes
resource
has sub-classes
Catalog c
is in range of
serves dataset op

Datasourcec back to ToC or Class ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#Datasource

A generic class for a data source. Could be anything from RDF-Triple Store, relational Datasource, comma separated list or just semi-structured text files.
Is defined by
http://dqm.faw.jku.at/ontologies/DSD#Datasource
is equivalent to
Dataset c
is in domain of
hasSchema op
is in range of
isSchemaOf op

Schemac back to ToC or Class ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#Schema

Is defined by
http://dqm.faw.jku.at/ontologies/DSD#Schema
is in domain of
isSchemaOf op
is in range of
hasSchema op

Object Properties

hasAttributeop back to ToC or Object Property ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#hasAttribute

Describes the number of Attributes a Concept or Association comprises.
Is defined by
http://dqm.faw.jku.at/ontologies/DSD#hasAttribute
has domain
association c or Concept c
has range
Attribute c
is inverse of
isAttributeOf op

hasComponentop back to ToC or Object Property ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#hasComponent

Describes the number of components a Datasource comprises.
Is defined by
http://dqm.faw.jku.at/ontologies/DSD#hasComponent
has domain
Datasource c or Schema c
has range
association c or Attribute c or Concept c or foreign key c or primary key c
is inverse of
isComponentOf op

hasSchemaop back to ToC or Object Property ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#hasSchema

Is defined by
http://dqm.faw.jku.at/ontologies/DSD#hasSchema

has characteristics: inverse functional

has domain
Datasource c
has range
Schema c
is inverse of
isSchemaOf op

isAttributeOfop back to ToC or Object Property ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#isAttributeOf

Describes an Attributes's connection to its Concept or Association.
Is defined by
http://dqm.faw.jku.at/ontologies/DSD#isAttributeOf
has domain
Attribute c
has range
association c or Concept c
is inverse of
hasAttribute op

isComponentOfop back to ToC or Object Property ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#isComponentOf

Describes a components connection to its Datasource.
Is defined by
http://dqm.faw.jku.at/ontologies/DSD#isComponentOf
has domain
association c or Attribute c or Concept c or foreign key c or primary key c
has range
Datasource c or Schema c
is inverse of
hasComponent op

isSchemaOfop back to ToC or Object Property ToC

IRI: http://dqm.faw.jku.at/ontologies/DSD#isSchemaOf

Is defined by
http://dqm.faw.jku.at/ontologies/DSD#isSchemaOf

has characteristics: functional

has domain
Schema c
has range
Datasource c
is inverse of
hasSchema op

serves datasetop back to ToC or Object Property ToC

IRI: http://www.w3.org/ns/dcat#servesDataset

A collection of data that this DataService can distribute.
Is defined by
http://www.w3.org/TR/vocab-dcat/
has domain
Data service c
has range
Dataset c

hasPartop back to ToC or Object Property ToC

IRI: http://www.w3.org/ns/dcat#hasPart

A site or endpoint that is listed in the catalog.
Is defined by
http://www.w3.org/TR/vocab-dcat/
has domain
Catalog c
has range
Data service c

Legend back to ToC

c: Classes
op: Object Properties

References back to ToC

The research reported on this webpage has been funded by BMK, BMDW, and the State of Upper Austria in the frame of the COMET Programme managed by FFG.

Acknowledgments back to ToC

The authors would like to thank Silvio Peroni for developing LODE, a Live OWL Documentation Environment, which is used for representing the Cross Referencing Section of this document and Daniel Garijo for developing Widoco, the program used to create the template used in this documentation.