Skip to content Learn about the access keys available for Employment and Workplace Relations metadata registry

Concept help - Data Set

A Data Set describes a record of data, including any location or time boundaries for the data, that has been captured and is available for use under a specific licence. A Data Set may be included in a Data Catalog, and can reference multiple Distributions that record different parts or formats of the data that are available to download.

A a dataset in DCAT is defined as a "collection of data, published or curated by a single agent, and available for access or download in one or more formats". A dataset does not have to be available as a downloadable file. For example, a dataset that is available via an API can be defined as an instance of dcat:Dataset and the API can be defined as an instance of dcat:Distribution. DCAT itself does not define properties specific to APIs description. These are considered out of the scope of this version of the vocabulary. Nevertheless, this can be defined as a profile of the DCAT vocabulary.

Fields available on this metadata type

Field ISO definition
Name The primary name used for human identification purposes.
Definition Representation of a concept by a descriptive statement which serves to differentiate it from related concepts. (3.2.39)
Is Federated
Is Not Federable
Version Unique version identifier of this metadata item.
References Significant documents that contributed to the development of the metadata item which were not the direct source for the metadata content.
Origin The source (e.g. document, project, discipline or model) for the item (8.1.2.2.3.5)
Comments Descriptive comments about the metadata item (8.1.2.2.3.4)
Deleted The date after which the item has been soft deleted and is no longer visible in the registry
License Information about the license document under which the dataset is made available.
Rights Information about rights held in and over the dataset.
Release Date Date of formal publication of the dataset.
Modification Date Most recent date on which the dataset was changed, updated or modified.
Frequency The frequency at which dataset is published.
Spatial Coverage Spatial or geographic coverage of the dataset.
Temporal Coverage The temporal or time period that the dataset covers.
Catalog An entity responsible for making the dataset available.
Landing Page A Web page that can be navigated to in a Web browser to gain access to the dataset, its distributions and/or additional information
Contact Point Relevant contact information for the Dataset.
Conforming Specification An established standard to which the described resource conforms.
Item Base

Custom Fields

Field Short definition Long definition
Purpose The primary business purpose of the data asset. “This is collected/used for…”
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Information Type
This field is not mandatory.
Technology
This field is not mandatory.
Status Whether the asset is being actively updated or not. * Active – currently being modified and added to * Inactive – no longer being kept up to date * Archive – inactive and removed * In development – data is currently being developed
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Type Type: Source, Product, Other
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Rights and Permissions The ability of users to use or enhance this data asset as determined by legislation, legal authority, data sharing agreements or guidelines.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Security Classification The security classification of the data set which is applied to ensure the confidentiality, integrity and availability of all official information.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Sensitivity Describe any personally identifying information (e.g., name, mobile number, email), commercial (e.g., unpublished) or sensitive data (e.g., information or option about racial, ethnic origin, political opinions, sexual orientation, health, biometric information etc.) in this data asset.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Data Custodian Organisation that controls data lawfully created, collected, held by or on behalf of that Commonwealth entity.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Senior Data Steward Position Job title of SES (generally Band 1) with delegated decision-making powers and oversight responsibilities for specific departmental and NSC data assets, or data we use as a third-party.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Senior Data Steward Branch Branch name of SES (generally Band 1) with delegated decision-making powers and oversight responsibilities for specific departmental and NSC data assets, or data we use as a third-party.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Data Steward Position Job title of delegate (generally EL2) responsible for managing this data asset to ensure that it is secure, high quality and fit for purpose.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Data Steward Section Section name of delegate (generally EL2) responsible for managing this data asset to ensure that it is secure, high quality and fit for purpose.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Contacts Name and email of people who can be contacted for further details about this data asset.
This is a free-text field that allows more detail (e.g. links) to be included than the standard Point of Contact field in Aristotle. This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Date Last Updated The date when this data asset was last modified or updated.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Next Review Date The date when this data asset entry is scheduled to be reviewed by the Data Steward. Set for every 12 months.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Access Details Describe how consumers can locate, access, use and verify (e.g., versioning, hash code) this data asset.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Format Description of the format of the data as stored in the asset.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Keywords Words, phrases and tags to help discovery.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Granularity Provides an indicator if the data is in aggregate form, unit record level or both. * Unit – data asset is recorded at the unit level. * Aggregate – data asset is grouped. * Both – data asset contains data that is at the unit record level and aggregated.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Key Attributes Key columns, characteristics or data elements that would be of interest to consumers, including non-standard formats (e.g., date), granularity.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Consumers Internal NSC teams, projects, data products or known external users that utilise this data asset.
This field was added for the National Skills Commission and is not mandatory for other stewardship organisations.
Legal Authority The legal mandate under which the asset was collected, created, received, used or disclosed.
The legal authority/legislation for the existence of the data. Could include MOUs, Legislation, Machinery of Government, Policy etc. authority. E.g. (Australian Government) Federal Register of Legislation. This field was added to meet the Office of the National Data Commissioner (ONDC) requirements.
Disposal Information about current records authorities and the disposal actions that relate to the data asset.
A statement on the disposal information of the data asset according to the National Archives Act. Could include any records authority, disposal action, and disposal trigger date. This field was added to meet the Office of the National Data Commissioner (ONDC) requirements.
Resource Type The category of asset being described.
Should be a centrally managed list of recognised Australian Government types. e.g. 19115 scopeCode list: dataset, series, software, service (as in API), model, document, repository, product, application etc. This field was added to meet the Office of the National Data Commissioner (ONDC) requirements.
Data Status A status that describes the state of progression or registration of the data asset.
completed, historicalArchive, obsolete, onGoing, planned, required, underDevelopment, final, pending, retired, superseded, tentative, valid, accepted, notAccepted, withdrawn, proposed, deprecated. This field was added to meet the Office of the National Data Commissioner (ONDC) requirements.
File Size The size of the asset in bytes.
This field was added to meet the Office of the National Data Commissioner (ONDC) requirements.
Language Language of the asset.
A language of the item. This refers to the natural language used for textual metadata (i.e. titles, descriptions, etc.) of a catalogued asset (i.e. dataset or service). This field was added to meet the Office of the National Data Commissioner (ONDC) requirements.
Publisher The name of an entity responsible for making the asset available.
The specification of the individuals or organisations responsible for the publication of the data set. Free text OR picklist OR standard statement. e.g. Geoscience Australia. This field was added to meet the Office of the National Data Commissioner (ONDC) requirements.

Official Definition

A representation of a dataset in a catalog. Data Catalog Vocabulary (DCAT): 5.3 Class: Dataset