Data Catalog Vocabulary Explained

Data Catalog Vocabulary (DCAT) is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. By using DCAT to describe datasets in catalogs, publishers increase discoverability and enable applications to consume metadata from multiple catalogs. It enables decentralized publishing of catalogs and facilitates federated dataset search across catalogs. Aggregated DCAT metadata can serve as a manifest file to facilitate digital preservation.[1]

The original DCAT vocabulary was developed at DERI, as an idea from Vassilios Peristeras and his master student Fadi Maali together also with Richard Cyganiak. The vocabulary was further developed by W3C's eGov Interest Group, then brought onto the Recommendation Track by W3C's "Government Linked Data" Working Group. DCAT is the foundation for open dataset descriptions in the European Union public sector and was adapted by the ISA programme of the European Commission.[2] A2022 report reviews DCATAP compliance on national data portals.[3]

DCATv2 was published as a W3C Recommendation 2020-02-04.[4] Version2 adds support for cataloguing data services or APIs, and has stronger support for expressing relationships between datasets. An alignment to Schema.org is included.

As DCAT is extensible, more specific extensions have been created in the statistical and geodata domains.[5] [6]

An open-source licensed porting of the version DCAT-AP 2.0.1 compatible with NGSI-LD API standard is available in the DCAT-AP subject at Smart Data Models program.

Notes and References

  1. Web site: Fadi Maali. John Erickson. Phil Archer. Data Catalog Vocabulary (DCAT). The World Wide Web Consortium (W3C). 23 August 2017. 16 January 2014.
  2. Web site: DCAT application profile for data portals in Europe. 23 August 2017.
  3. Book: Carsaniga . Giulia . Lincklaen Arriƫns . Eline N . Dogger . Jochem . van Assen . Mariska . Cecconi . Gianfranco . Open data maturity report 2022 . December 2022 . Publications Office of the European Union . Luxembourg, Belgium . 10.2830/70973 . 978-92-78-43386-4 . 2023-07-03.
  4. Web site: Riccardo Albertoni. David Browning. Simon Cox. Alejandra Gonzalez Beltran. Andrea Perego. Peter Winstanley. Data Catalog Vocabulary (DCAT) - Version 2. The World Wide Web Consortium (W3C). 5 February 2020. 4 February 2020.
  5. Web site: StatDCAT application profile for data portals in Europe. ISA Programme, European Commission. 23 August 2017.
  6. Web site: GeoDCAT-AP v1.0. ISA Programme, European Commission. 23 August 2017. 23 August 2017. https://web.archive.org/web/20170823205840/https://joinup.ec.europa.eu/asset/dcat_application_profile/asset_release/geodcat-ap-v10. dead.