Skip to content Learn about the access keys available for Education, Skills and Employment metadata registry

Concept help - Data Set

A Data Set describes a record of data, including any location or time boundaries for the data, that has been captured and is available for use under a specific licence. A Data Set may be included in a Data Catalog, and can reference multiple Distributions that record different parts or formats of the data that are available to download.

A a dataset in DCAT is defined as a "collection of data, published or curated by a single agent, and available for access or download in one or more formats". A dataset does not have to be available as a downloadable file. For example, a dataset that is available via an API can be defined as an instance of dcat:Dataset and the API can be defined as an instance of dcat:Distribution. DCAT itself does not define properties specific to APIs description. These are considered out of the scope of this version of the vocabulary. Nevertheless, this can be defined as a profile of the DCAT vocabulary.

Fields available on this metadata type

Field ISO definition
Name The primary name used for human identification purposes.
Definition Representation of a concept by a descriptive statement which serves to differentiate it from related concepts. (3.2.39)
Is Federated
Is Not Federable
Version Unique version identifier of this metadata item.
References Significant documents that contributed to the development of the metadata item which were not the direct source for the metadata content.
Origin The source (e.g. document, project, discipline or model) for the item (8.1.2.2.3.5)
Comments Descriptive comments about the metadata item (8.1.2.2.3.4)
Deleted The date after which the item has been soft deleted and is no longer visible in the registry
License Information about the license document under which the dataset is made available.
Rights Information about rights held in and over the dataset.
Release Date Date of formal publication of the dataset.
Modification Date Most recent date on which the dataset was changed, updated or modified.
Frequency The frequency at which dataset is published.
Spatial Coverage Spatial or geographic coverage of the dataset.
Temporal Coverage The temporal or time period that the dataset covers.
Catalog An entity responsible for making the dataset available.
Landing Page A Web page that can be navigated to in a Web browser to gain access to the dataset, its distributions and/or additional information
Contact Point Relevant contact information for the Dataset.
Conforming Specification An established standard to which the described resource conforms.
Item Base

Custom Fields

Field Short definition Long definition
Purpose The primary business purpose of the data asset. “This is collected/used for…”
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Information Type
This field is not mandatory.
Technology
This field is not mandatory.
Status Whether the asset is being actively updated or not. * Active – currently being modified and added to * Inactive – no longer being kept up to date * Archive – inactive and removed * In development – data is currently being developed
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Type Type: Source, Product, Other
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Rights and Permissions The ability of users to use or enhance this data asset as determined by legislation, legal authority, data sharing agreements or guidelines.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Security Classification The security classification of the data set which is applied to ensure the confidentiality, integrity and availability of all official information.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Sensitivity Describe any personally identifying information (e.g., name, mobile number, email), commercial (e.g., unpublished) or sensitive data (e.g., information or option about racial, ethnic origin, political opinions, sexual orientation, health, biometric information etc.) in this data asset.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Data Custodian Organisation that controls data lawfully created, collected, held by or on behalf of that Commonwealth entity.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Senior Data Steward Position Job title of SES (generally Band 1) with delegated decision-making powers and oversight responsibilities for specific departmental and NSC data assets, or data we use as a third-party.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Senior Data Steward Branch Branch name of SES (generally Band 1) with delegated decision-making powers and oversight responsibilities for specific departmental and NSC data assets, or data we use as a third-party.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Data Steward Position Job title of delegate (generally EL2) responsible for managing this data asset to ensure that it is secure, high quality and fit for purpose.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Data Steward Section Section name of delegate (generally EL2) responsible for managing this data asset to ensure that it is secure, high quality and fit for purpose.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Contacts Name and email of people who can be contacted for further details about this data asset.
This is a free-text field that allows more detail (e.g. links) to be included than the standard Point of Contact field in Aristotle. This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Date Last Updated The date when this data asset was last modified or updated.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Next Review Date The date when this data asset entry is scheduled to be reviewed by the Data Steward. Set for every 12 months.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Access Details Describe how consumers can locate, access, use and verify (e.g., versioning, hash code) this data asset.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Format Description of the format of the data as stored in the asset.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Keywords Words, phrases and tags to help discovery.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Granularity Provides an indicator if the data is in aggregate form, unit record level or both. * Unit – data asset is recorded at the unit level. * Aggregate – data asset is grouped. * Both – data asset contains data that is at the unit record level and aggregated.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Key Attributes Key columns, characteristics or data elements that would be of interest to consumers, including non-standard formats (e.g., date), granularity.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Consumers Internal NSC teams, projects, data products or known external users that utilise this data asset.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.

Official Definition

A representation of a dataset in a catalog. Data Catalog Vocabulary (DCAT): 5.3 Class: Dataset