UK Web Archive explained

The UK Web Archive is a consortium of the six UK legal deposit libraries which aims to collect all UK websites at least once each year.[1]

UK Web Archive
Established:2005
Ref Legal Mandate:Yes, provided in law by:
Parent Organisation:-->

History

In 2005, the British Library, The National Archives, Wellcome Trust, National Library of Scotland, National Library of Wales and JISC formed the UK Web Archiving Consortium, a project to archive websites.[3]

UKWAC archived selected websites by licence or permission, using PANDAS software developed by the National Library of Australia. During the project its members collected sites relevant to their interest; the Wellcome Library collected medical sites, the national libraries sites that reflect life in contemporary Wales or Scotland. The British Library worked with a broad policy of collecting sites of cultural, historical and political importance to the UK.[4]

The Consortium wound up in 2010. The Archiving and Preservation Working Group took over UKWAC's co-ordinating role web archiving in the UK. The Digital Preservation Coalition hosts the working group.[5]

Web Archiving

The archive undertakes an annual crawl of .uk and other UK geographic Top Level Domains such as .scot, .cymru or .london. The crawl is archived in a shared infrastructure called the Digital Library System. Members of the public can nominate sites for preservation there through the UKWA website. The whole web archive is available to registered readers on library premises; and where permission has been given, or license conditions can be met, copies are also accessible through the website.[6]

The archive gathers sites in response to events, building collections - these have preserved writing and imagery recording natural disasters, election campaigns since 2005 and the UK's blogosphere for research, among more than a hundred more.[7]

SHINE

The UK Web Archive holds a collection of all the .uk websites that were archived by the Internet Archive until the end of March in 2013.[8] SHINE is a web interface which can be used to create repeatable lists of results of historical .uk pages. Trends, or occurrences of keywords in the data set on .uk pages over that time, use concordance to show keywords in context.[9]

Mementos

Memento is a name for prior versions of web pages coined by the Memento Project. The UK Web Archive Memento interface allows Mementos to be found across web archives.[10] The interface can be used to find a Memento by its date in a snapshot table, or see how often a site appears across public web archives.

Researching the archive

Research into the web as a reflection of society has helped develop access to the archive.[11] Libraries have developed guides to research skills needed to use web archives. These include using big data to see patterns or trends,[12] or writing citations for archived copies of websites.[13]

GLAM Workbench

GLAM Workbench is a project which looks at how researchers can use data preserved by galleries, libraries, archives and museums.[14] It includes a collection of Jupyter notebooks which draw on Mementos and index data.[15] The notebooks mix description and editable code to help researchers find evidence in web archives.

Where the whole archive can be accessed, by Library

See also

External links

Notes and References

  1. Web site: UKWA Home. 2020-10-13. www.webarchive.org.uk.
  2. Web site: The Legal Deposit Libraries (Non-Print Works) Regulations 2013 . legislation.gov.uk . February 21, 2022.
  3. Web site: 15 Years of the UK Web Archive - The Early Years - UK Web Archive blog. live. https://www.webarchive.org.uk/wayback/archive/20200308114459/https://blogs.bl.uk/webarchive/2020/03/15-years-of-the-uk-web-archive.html. 8 March 2020. 2020-10-13. blogs.bl.uk.
  4. Web site: April 2006. UK Web Archiving Consortium: Evaluation Report. dead. https://www.webarchive.org.uk/wayback/archive/20170109124742/http://www.dpconline.org/advice/web-archiving. 9 January 2017. 17 March 2014. Digital Preservation Coalition. April 2006.
  5. Web site: Web Archiving & Preservation Working Group - Digital Preservation Coalition. live. https://www.webarchive.org.uk/wayback/archive/20200731103406/https://www.dpconline.org/digipres/waptf. 31 July 2020. 2020-10-13. www.dpconline.org.
  6. Web site: What is the UK Web Archive?. live. https://www.webarchive.org.uk/wayback/archive/20191205103036/https://www.webarchive.org.uk/ukwa/info/about. 5 December 2019. 17 March 2014. UK Web Archive.
  7. Web site: 15 Years of UKWA - Looking back at our first collections - UK Web Archive blog. live. https://www.webarchive.org.uk/wayback/archive/20200729095939/https://blogs.bl.uk/webarchive/2020/07/15-years-of-ukwa-looking-back-at-our-first-collections.html. 29 July 2020. 2020-10-19. blogs.bl.uk.
  8. Web site: www.webarchive.org.uk. JISC UK Web Domain Dataset (1996-2013). 2020-10-16. data.webarchive.org.uk. en.
  9. Web site: Trend results 1996-2013 for "big data" :: SHINE. 2020-10-13. www.webarchive.org.uk.
  10. Web site: Mementos - Archived history of www.webarchive.org.uk. 2020-10-09. Mementos - Finding historical archives across the world wide web..
  11. Web site: Blaney. Jonathan. More project case studies available. live. https://www.webarchive.org.uk/wayback/en/archive/20170216053730/https://buddah.projects.history.ac.uk/blog/. 16 February 2017. 2020-10-09. Big UK Domain Data for the Arts and Humanities. 19 April 2016 . en-GB.
  12. Web site: McNally. Anna. LibGuides: Finding and Using Digital Archives during COVID-19: Web archives. 2020-10-14. libguides.westminster.ac.uk. en.
  13. Web site: Thomas. Susan. Oxford LibGuides: Web Archives: Home. 2020-10-14. ox.libguides.com. en.
  14. Web site: Welcome to the GLAM Workbench - GLAM Workbench. 2020-10-13. glam-workbench.github.io.
  15. Sherratt. Tim. Jackson. Andrew. 2020-06-15. GLAM-Workbench/web-archives. Zenodo. en. 10.5281/zenodo.3894079. 2020zndo...3894079S.
  16. Web site: Team. National Records of Scotland Web. 2013-05-31. NRS Web Continuity Service. live. https://www.webarchive.org.uk/wayback/archive/20200118234351/https://www.nrscotland.gov.uk/research/researching-online/the-nrs-web-continuity-service/web-archiving-and-web-continuity. 18 January 2020. 2020-10-13. National Records of Scotland. en.
  17. Web site: 2015-12-09. Search the PRONI Web Archive. live. https://www.webarchive.org.uk/wayback/archive/20200827111518/https://www.nidirect.gov.uk/services/search-proni-web-archive. 27 Aug 2020. 2020-10-13. nidirect. en.
  18. Web site: MirrorWeb - UK Parliament Web Archive. 2020-10-13. webarchive.parliament.uk.