CORE (research service) explained

CORE (Connecting Repositories)
Commercial:	No
Type:	Open Access, Repositories, Harvesting
Location:	Open University
Country:	United Kingdom
Key People:	Petr Knoth

CORE (Connecting Repositories) is a service provided by the based at The Open University, United Kingdom. The goal of the project is to aggregate all open access content distributed across different systems, such as repositories and open access journals, enrich this content using text mining and data mining, and provide free access to it through a set of services.^[1] The CORE project also aims to promote open access to scholarly outputs. CORE works closely with digital libraries and institutional repositories.^[2]

Service description

There are existing commercial academic search systems, such as Google Scholar, which provide search and access level services, but do not support programmable machine access to the content. This is seen with the use of an API or data dumps, and limits the further reuse of the open access content (e.g., text and data mining). There are three access levels to content:

access at the granularity of papers
analytical access and granularity of collections
programmable machine access to data

The programmable machine access is the main feature that distinguishes CORE from Google Scholar and Microsoft Academic Search.

History

The first version of CORE was created in 2011 by Petr Knoth with the aim to make it easier to access and text mine very large amounts of research publications.^[3] The value of the aggregation was first demonstrated by developing a content recommendation system for research papers, following the ideas of literature-based discovery introduced by Don R. Swanson. Since its start, CORE has received financial support from a range of funders including Jisc and the European Commission. CORE aggregates from across the world; in 2017, it was calculated that it reached documents from 102 countries in 52 languages.^[4] It has the status of the UK's national aggregator of open access content, aggregating metadata and full-text outputs from both UK publishers' databases as well as institutional and subject repositories.^[5] ^[6]

CORE operates as a one step search tool for UK's open access research outputs, facilitating discoverability, use and reuse. The importance of the service has been widely recognised by Jisc, which suggested that CORE should preserve the required resources to sustain its operation and explore an international sustainability model. CORE is now one of the Repository Shared Services projects, along with Sherpa Services,^[7] IRUS,^[8] Jisc Publications Router^[9] and OpenDOAR.

In 2018, CORE said it was the world's largest aggregator of open access research papers.^[10] Based on the open access fundamental principles, as they were described in the Budapest Open Access Initiative, its open access content not only must be openly available to download and read, but it must also allow its reuse, both by humans and machines. As a result, there was a need to exploit the content reuse, which could be made possible with the implementation of a technical infrastructure. The CORE project started with the goal of connecting metadata and full-text outputs offering, through content aggregation, value-added services, and by opening new opportunities in the research process.

CORE later changed the license of its datasets to "all rights reserved" and was overtaken by Internet Archive Scholar, which in 2022 had over 25 million full-text articles vs. less than 10 million on CORE.^[11]

Programmable access to CORE data

CORE data can be accessed through an API or downloaded as a pre-processed and semantically enriched data dump.^[12]

Searching CORE

CORE provides searchable access to a collection of over 125 million open access harvested research outputs. All outputs can be accessed and downloaded free of cost and have limited re-use restrictions. One can search the CORE content using a faceted search. CORE also provides a cross-repository content recommendation system based on full-texts. The collection of the harvested outputs is available either by looking at the latest additions^[13] or by browsing^[14] the collection at the date of harvesting.The CORE search engine was selected by an author on Jisc in 2013 as one of the top 10 search engines^[15] for open access research, facilitating access to academic papers.^[16] ^[17]

Analytical use of CORE data

The availability of data aggregated and enriched by CORE provides opportunities for the development of new analytical services for research literature. These can be used, for example, to monitor growth and trends in research, validate compliance with open access mandates and to develop new automatic metrics for evaluating research excellence.

According to the Registry of Open Access Repositories, the number of funders increased from 22 units in 2007 to 34 in 2010 and then to 67 in 2015, while the number of institutional full-text and open access mandates picked up from 137 units in 2007 to 430 in 2015.^[18]

Applications

CORE offers eight applications:

CORE API, provides an access point to develop applications making use of CORE's collection of Open Access content.^[19]
CORE Dataset, provides access to the data aggregated from repositories by CORE and allows their further manipulation.^[20]
CORE Recommender, can link an institutional repository with the CORE service and recommends semantically related resources.^[21]
CORE Repository Dashboard, is a tool for repository managers or research output administrators. The aim of the Repository Dashboard is to provide control over the aggregated content and help in the management and validation of the repository collections and services.^[22] It is integrated in the Institutional Repository Usage Statistics (IRUS-UK), a Jisc-funded project that serves as a national repository usage statistics aggregation servire.^[23]
CORE Analytics Dashboard, helps institutions to understand and monitor the impact of their research.^[24]
CORE Search, enables users to search and access research papers.^[25]
CORE Publisher Connector, provides access to Gold and Hybrid Gold Open Access articles aggregated from non-standard systems of major publishers. Data is exposed via the ResourceSync protocol.^[26]
CORE SDKs, provide access to content for programs. The CORE SDK R is freely available and it is mainly community led. The aim is to maximise the productivity and data analysis, prototyping and migration.

Notes and References

Web site: OU's full text search system makes huge leaps in widening access to academic papers. 24 October 2012. 19 December 2014 .
Web site: Enhancing the visibility of Maltese research. 25 December 2016 .
Web site: OUs full text search system makes huge leaps in widening access to academic papers. 24 October 2012. 19 December 2014.
Web site: CORE franchit le cap des 5 millions de documents en texte intégral indexés et en libre accès. fr.
Web site: CORE melds UK repositories. Times of Higher Education . 13 October 2011. 11 November 2014.
Web site: UK's first open access full-text search engine to aid research. The Research Centre. 3 October 2011. 19 December 2014. https://web.archive.org/web/20150109130034/http://www.theresearchcentre.co.uk/feedstory/uk-s-open-access-full-text-search-engine-aid-research. 9 January 2015. dead.
Web site: SHERPA Services . 20 January 2015.
Web site: IRUS UK. 20 January 2015.
Web site: Jisc Publications Router. 20 January 2015.
Web site: CORE becomes the world's largest aggregator . 1 June 2018 . Jisc scholarly communications blog . Balviar . Notay . Petr . Knoth . Nancy . Pontika.
Web site: CORE Dataset .
Web site: CORE Services . core.ac.uk . 2021-06-20.
Web site: CORE Latest Additions. 20 January 2015. 15 May 2013. https://web.archive.org/web/20130515181017/http://core.kmi.open.ac.uk/browse/latest. dead.
Web site: CORE Browsing. 20 January 2015. 12 January 2015. https://web.archive.org/web/20150112125757/http://core.kmi.open.ac.uk/browse. dead.
Web site: Ten Search Engines for researchers that go beyond Google. Jisc Inform. Summer 2013. 17 November 2014. dead. 24 December 2014. https://web.archive.org/web/20141224190720/http://www.jisc.ac.uk/inform/inform37/SearchingBeyondGoogle.html.
Web site: OU widens access to academic papers. 17 November 2014. https://web.archive.org/web/20150109134206/http://www.openuniversity.edu/news-blog/news/ou-widens-access-to-academic-papers. 9 January 2015. dead.
News: Else . Holly . 'Dismal' start for Access to Research initiative . Times Higher Education . 14 August 2014 . 15 November 2014.
Nancy . Pontika . Petr . Knoth . Matteo . Cancellieri . Samuel . Pearce . Developing Infrastructure to Support Closer Collaboration of Aggregators with Open Repositories . LIBER Quarterly . 2016 . 25 . 4 . 172–188 . 1005985574 . 10.18352/lq.10138 . 1435-5205 . free .
Web site: CORE API. 6 March 2018.
Web site: CORE Dataset. 6 March 2018.
Web site: CORE Recommender. 6 March 2018.
Web site: CORE Repository Dashboard. 6 March 2018.
Nancy. Pontika. Petr. Knoth. Matteo. Cancellieri. Samuel. Pearce. Developing Infrastructure to Support Closer Collaboration of Aggregators with Open Repositories. LIBER Quarterly. 25. 4. 1435-5205. 183. March 8, 2016. 10.18352/lq.10138.
Web site: CORE Analytics Dashboard. 6 March 2018.
Web site: CORE Search. 6 March 2018.
Web site: CORE Publisher Connector. 6 March 2018.