AMiner (database) explained

Aminer
Type:Bibliographic database
Registration:Optional
Owner:Tsinghua University
Current Status:Active

AMiner (formerly ArnetMiner) is a free online service used to index, search, and mine big scientific data.

Overview

AMiner (ArnetMiner) is designed to search and perform data mining operations against academic publications on the Internet, using social network analysis to identify connections between researchers, conferences, and publications.[1] This allows it to provide services such as expert finding, geographic search, trend analysis, reviewer recommendation, association search, course search, academic performance evaluation, and topic modeling.

AMiner was created as a research project in social influence analysis, social network ranking, and social network extraction. A number of peer-reviewed papers have been published arising from the development of the system. It has been in operation for more than three years, and has indexed 130,000,000 researchers and more than 265 million publications.[2] The research was funded by the Chinese National High-tech R&D Program and the National Science Foundation of China.

AMiner is commonly used in academia to identify relationships between and draw statistical correlations about research and researchers. It has attracted more than 10 million independent IP accesses from 220 countries and regions. The product has been used in Elsevier's SciVerse platform,[3] and academic conferences such as SIGKDD, ICDM, PKDD, WSDM.

Operation

AMiner automatically extracts the researcher profile from the web. It collects and identifies the relevant pages, then uses a unified approach to extract data from the identified documents. It also extracts publications from online digital libraries using heuristic rules.

It integrates the extracted researchers’ profiles and the extracted publications. It employs the researcher name as the identifier. A probabilistic framework has been proposed to deal with the name ambiguity problem in the integration. The integrated data is stored into a researcher network knowledge base (RNKB).

The principal other product in the area are Google Scholar, Elsevier's Scirus, and the open source project CiteSeer.

History

It was initiated and created by professor Jie Tang from Tsinghua University, China. It was first launched in March 2006. The following provide a list of updates in the past years:

Resources

AMiner published several datasets for academic research purpose, including Open Academic Graph,[6] DBLP+citation[7] (a data set augmenting citations into the DBLP data from Digital Bibliography & Library Project), Name Disambiguation,[8] Social Tie Analysis.[9] For more available datasets and source codes for research, please refer to.[10]

See also

References

  1. Book: Jie Tang. Jing Zhang. Limin Yao. Juanzi Li. Li Zhang. Zhong Su. Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining . ArnetMiner . 2008. 990–998 . ACM. New York. 10.1145/1401890.1402008 . 9781605581934 . 3348552 . http://portal.acm.org/citation.cfm?id=1401890.1402008.
  2. Web site: Arnetminer: introduction. 17 Dec 2020.
  3. Web site: SciVerse - HUB - Home. https://archive.today/20120909214954/http://www.hub.sciverse.com/. dead. 9 September 2012. 24 April 2012.
  4. Web site: Trend Analysis. 24 December 2018.
  5. Book: Yutao Zhang. Fanjin Zhang. Peiran Yao. Jie Tang. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining . Name Disambiguation in AMiner . 2018. 1002–1011 . ACM. London. 10.1145/3219819.3219859 . 9781450355520 . 207579405 . https://dl.acm.org/citation.cfm?id=3219859.
  6. Web site: Open Academic Graph. 24 December 2018.
  7. Web site: DBLP Papers + Citation Relationship. 24 December 2018.
  8. Web site: Name Disambiguation. 24 April 2012.
  9. Web site: Inferring Social Ties in Large Networks. 24 April 2012.
  10. Web site: Open Data and Codes by ArnetMiner. 24 April 2012.

External links

Further reading