Linked data explained

Linked data should not be confused with Linked data structure.

In computing, linked data is structured data which is interlinked with other data so it becomes more useful through semantic queries. It builds upon standard Web technologies such as HTTP, RDF and URIs, but rather than using them to serve web pages only for human readers, it extends them to share information in a way that can be read automatically by computers. Part of the vision of linked data is for the Internet to become a global database.[1]

Tim Berners-Lee, director of the World Wide Web Consortium (W3C), coined the term in a 2006 design note about the Semantic Web project.[2]

Linked data may also be open data, in which case it is usually described as Linked Open Data.[3]

Principles

In his 2006 "Linked Data" note, Tim Berners-Lee outlined four principles of linked data, paraphrased along the following lines:

  1. Uniform Resource Identifiers (URIs) should be used to name and identify individual things.
  2. HTTP URIs should be used to allow these things to be looked up, interpreted, and subsequently "dereferenced".
  3. Useful information about what a name identifies should be provided through open standards such as RDF, SPARQL, etc.
  4. When publishing data on the Web, other things should be referred to using their HTTP URI-based names.

Tim Berners-Lee later restated these principles at a 2009 TED conference, again paraphrased along the following lines:[4]

  1. All conceptual things should have a name starting with HTTP.
  2. Looking up an HTTP name should return useful data about the thing in question in a standard format.
  3. Anything else that that same thing has a relationship with through its data should also be given a name beginning with HTTP.

Components

Thus, we can identify the following components as essential to a global Linked Data system as envisioned, and to any actual Linked Data subset within it:

Linked open data

Linked open data are linked data that are open data.[5] [6] [7] Tim Berners-Lee gives the clearest definition of linked open data as differentiated from linked data.

Large linked open data sets include DBpedia, Wikibase, Wikidata and .

5-star linked open data

In 2010, Tim Berners-Lee suggested a 5-star scheme for grading the quality of open data on the web, for which the highest ranking is Linked Open Data:[8]

History

The term "linked open data" has been in use since at least February 2007, when the "Linking Open Data" mailing list[9] was created.[10] The mailing list was initially hosted by the SIMILE project[11] at the Massachusetts Institute of Technology.

Linking Open Data community project

The goal of the W3C Semantic Web Education and Outreach group's Linking Open Data community project is to extend the Web with a data commons by publishing various open datasets as RDF on the Web and by setting RDF links between data items from different data sources. In October 2007, datasets consisted of over two billion RDF triples, which were interlinked by over two million RDF links.[12] [13] By September 2011 this had grown to 31 billion RDF triples, interlinked by around 504 million RDF links. A detailed statistical breakdown was published in 2014.[14]

European Union projects

There are a number of European Union projects involving linked data. These include the linked open data around the clock (LATC) project,[15] the AKN4EU project for machine-readable legislative data, the PlanetData project,[16] the DaPaaS (Data-and-Platform-as-a-Service) project,[17] and the Linked Open Data 2 (LOD2) project.[18] [19] [20] Data linking is one of the main goals of the EU Open Data Portal, which makes available thousands of datasets for anyone to reuse and link.

Ontologies

Ontologies are formal descriptions of data structures. Some of the better known ontologies are:

Datasets

Dataset instance and class relationships

Clickable diagrams that show the individual datasets and their relationships within the DBpedia-spawned LOD cloud (as by the figures to the right) are available.[25] [26]

See also

Further reading

External links

Notes and References

  1. Web site: Linked Data as JSON. 2020-12-04. Linked Data as JSON. en.
  2. Web site: Linked Data . Design Issues . Tim Berners-Lee . Tim Berners-Lee . 2006-07-27 . . 2010-12-18.
  3. Web site: What are Linked Data and Linked Open Data?. Ontotext. en-US. 2019-05-08.
  4. Web site: Tim Berners-Lee on the next Web. 2009-03-15. 2011-04-10. https://web.archive.org/web/20110410204952/http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html. dead.
  5. Web site: Frequently Asked Questions (FAQs) - Linked Data - Connect Distributed Data across the Web. 2014-12-29. 2015-11-18. https://web.archive.org/web/20151118060145/http://linkeddata.org/faq. dead.
  6. Web site: COAR » 7 things you should know about…Linked Data. 2015-12-29. https://web.archive.org/web/20151118085816/https://www.coar-repositories.org/activities/repository-observatory/second-edition-linked-open-data/7-things-you-should-know-about-open-data/. 2015-11-18. dead.
  7. Web site: Linked Data Basics for Techies. 2015-12-29. 2021-05-05. https://web.archive.org/web/20210505205603/http://openorg.ecs.soton.ac.uk/wiki/Linked_Data_Basics_for_Techies#Open_Linked_Data. dead.
  8. Web site: What is 5 Star Linked Data? Webize Everything Community Group. 2021-03-07. www.w3.org. en-US.
  9. Web site: public-lod@w3.org Mail Archives.
  10. Web site: SweoIG/TaskForces/CommunityProjects/LinkingOpenData/NewsArchive.
  11. Web site: SIMILE Project - Mailing Lists.
  12. Web site: SweoIG/TaskForces/CommunityProjects/LinkingOpenData - W3C Wiki. esw.w3.org. 22 March 2018.
  13. Book: Fensel . Dieter . Facca . Federico Michele . Simperl . Elena . Ioan . Toma . Semantic Web Services . 2011 . Springer. 978-3642191923 . 99.
  14. Web site: State of the LOD Cloud. Max. linkeddatacatalog.dws.informatik.uni-mannheim.de. 22 March 2018.
  15. Web site: Linked open data around the clock (LATC). latc-project.eu. 22 March 2018. https://web.archive.org/web/20180919095411/https://latc-project.eu/. 19 September 2018. dead.
  16. Web site: Welcome to PlanetData! - PlanetData. planet-data.eu. 22 March 2018. 21 April 2021. https://web.archive.org/web/20210421082019/http://www.planet-data.eu/. dead.
  17. Web site: DaPaaS. project.dapaas.eu. 22 March 2018. 18 December 2020. https://web.archive.org/web/20201218070059/http://project.dapaas.eu/. dead.
  18. https://web.archive.org/web/20180929075540/http://lod2.eu/ Linking Open Data 2 (LOD2)
  19. Web site: CORDIS FP7 ICT Projects – LOD2 . European Commission . 2010-04-20.
  20. Web site: LOD2 Project Fact Sheet – Project Summary . 2010-09-01 . 2010-12-18 . dead . https://web.archive.org/web/20110720164405/http://static.lod2.eu/Deliverables/LOD2_D12.5.1_Project_Fact_Sheet_Version.pdf . 2011-07-20 .
  21. Web site: GRID Statistics. grid.ac/stats. en-GB. 2018-10-26.
  22. Web site: GRID Policies. grid.ac. en-GB. 2018-10-26.
  23. Web site: KnowWhereGraph. knowwheregraph.org. en-US. 2022-05-16.
  24. Know, Know Where, Knowwheregraph: A Densely Connected, Cross-Domain Knowledge Graph and Geo-Enrichment Service Stack for Applications in Environmental Intelligence. 2022. 10.1609/aimag.v43i1.19120. Krzysztof Janowicz. Pascal Hitzler. Wenwen Li. Dean Rehberger. Mark Schildhauer. Rui Zhu. Cogan Shimizu. Colby K. Fisher. Ling Cai. Gengchen Mai. Joseph Zalewski. Lu Zhou. Shirly Stephen. Seila Gonzalez Estrecha. Bryce D. Mecum. Anna Lopez-Carr. Andrew Schroeder. Dave Smith. Dawn J. Wright. Sizhe Wang. Yuanyuan Tian. Zilong Liu. Meilin Shi. Anthony D'Onofrio. Zhining G. Kitty Currier . AI Magazine. 43. 1. 30–39. free. 1983/be176aba-9dec-456c-9615-01a0e8556b7b. free.
  25. Web site: Instance relationships amongst datasets. fu-berlin.de. 22 March 2018. 2012-10-17. https://web.archive.org/web/20121017231016/http://www4.wiwiss.fu-berlin.de/bizer/pub/lod-datasets_2009-07-14.html. dead.
  26. Web site: Class relationships amongst datasets. https://web.archive.org/web/20110828103804/http://umbel.org/sites/umbel.org/lod/lod_constellation.html. dead. 28 August 2011. 22 March 2018.