DisGeNET explained

Scope:Gene-disease associations
Organism:Homo sapiens
Center:GRIB
Laboratory:IBI Group
Author:Laura I. Furlong and Ferran Sanz, Team Leaders
Released:2010
Url:DisGeNET
Pmid:25877637
Download:Downloads
Sparql:DisGeNET-RDF
License:The DisGeNET data is made available on a Open Database License
Versioning:DisGeNET v5.0

DisGeNET is a discovery platform designed to address a variety of questions concerning the genetic underpinning of human diseases. DisGeNET is one of the largest and comprehensive repositories of human gene-disease associations (GDAs) currently available.[1] It also offers a set of bioinformatic tools to facilitate the analysis of these data by different user profiles. It is maintained by the Integrative Biomedical Informatics (IBI) Group, of the (GRIB)-IMIM/UPF, based at the Barcelona Biomedical Research Park (PRBB), Barcelona, Spain.

Scope and access

In the pursuit to gather different aspects of the current knowledge on the genetic basis of human diseases, DisGeNET covers information on all disease areas (Mendelian, complex and environmental diseases). With more than 400 000 genotype-phenotype relationships from different origins integrated and annotated with explicit provenance and evidence information, DisGeNET is a valuable knowledge and evidence-based discovery resource for Translational Research.DisGeNET is an open access resource that makes available a comprehensive knowledge base on disease genes and different tools for their exploitation and analysis. DisGeNET is available through a Web interface, a Cytoscape plugin,[2] as linked data for the Semantic Web, and supports programmatic access to its data. These valuable set of tools allows investigating the molecular mechanisms underlying diseases of genetic origin,[3] and are designed to support the data exploitation from different perspectives and to fulfill the needs of different types of users, including bioinformaticians, biologists and healthcare practitioners.

Integrated data

The DisGeNET database integrates over 400 000 associations between > 17 000 genes and > 14 000 diseases from human to animal model expert curated databases with text mined GDAs from MEDLINE using a NLP-based approach.[4] The highlights of DisGeNET are the data integration, standardisation and a fine-grained tracking of the provenance information. The integration is performed by means of gene and disease vocabulary mapping and by using the DisGeNET association type ontology. Furthermore, GDAs are organised according to their type and level of evidence as CURATED, PREDICTED and LITERATURE, and they are also scored based on the supporting evidence to prioritise and ease their exploration.

The DisGeNET Association Type Ontology

For a seamless integration of gene-disease association data, we developed the DisGeNET association type ontology. All association types as found in the original source databases are formally structured from a parent GeneDiseaseAssociation class if there is a relationship between the gene/protein and the disease, and represented as ontological classes. It is an OWL ontology that is integrated into the Sematicscience Integrated Ontology (SIO), which provides essential types and relations for the rich description of objects, processes and their attributes.[5] You can check SIO gene-disease association classes from this URL.

Cytoscape plugin

The DisGeNET Cytoscape plugin offers a network representation of the gene-disease associations. It represents gene-disease associations in terms of bipartite graphs and additionally provides gene centric and disease centric views of the data. It assists the user in the interpretation and exploration of human complex diseases with respect to their genetic origin by a variety of built-in functions. Using the DisGeNET Cytoscape plugin you can perform queries restricted to (i) the original data source, (ii) the association type, (iii) the disorder class of interest and (iv) specific diseases or genes.

Linked Data

The information contained in DisGeNET can also be expanded and complemented using Semantic Web technologies and linked to a variety of resources already present in the Linked Open Data cloud. DisGeNET is distributed as RDF and Nanopublications linked datasets. The DisGeNET-RDF linked dataset is an alternative way to access the DisGeNET data and provides new opportunities for data integration, querying and integrating DisGeNET data to other external RDF datasets. The RDF and Nanopublication distributions of DisGeNET have been developed in the context of the Open PHACTS project to provide disease relevant information to the knowledge base on pharmacological data.

European projects

See also

Notes and References

  1. Piñero. J.. Queralt-Rosinach. N.. Bravo. A.. Deu-Pons. J.. Bauer-Mehren. A.. Baron. M.. Sanz. F.. Furlong. L. I.. DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes. Database. 15 April 2015. 2015. bav028. 10.1093/database/bav028. 25877637. 4397996.
  2. Bauer-Mehren. A. Rautschka. M. Sanz. F. Furlong. LI. DisGeNET: a Cytoscape plugin to visualize, integrate, search and analyze gene-disease networks.. Bioinformatics. 15 November 2010. 26. 22. 2924–6. 10.1093/bioinformatics/btq538. 20861032. free.
  3. Bauer-Mehren. A. Bundschus. M. Rautschka. M. Mayer. MA. Sanz. F. Furlong. LI. Gene-disease network analysis reveals functional modules in mendelian, complex and environmental diseases.. PLOS ONE. 14 June 2011. 6. 6. e20284. 10.1371/journal.pone.0020284. 21695124. 3114846. 2011PLoSO...620284B. free.
  4. Bravo. À. Piñero. J. Queralt-Rosinach. N. Rautschka. M. Furlong. LI. Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research.. BMC Bioinformatics. 21 February 2015. 16. 55. 55. 10.1186/s12859-015-0472-9. 25886734. 4466840. free.
  5. Dumontier. Michel. Baker. Christopher JO. Baran. Joachim. Callahan. Alison. Chepelev. Leonid. Cruz-Toledo. José. Del Rio. Nicholas R. Duck. Geraint. Furlong. Laura I. Keath. Nichealla. Klassen. Dana. McCusker. James P. Queralt-Rosinach. Núria. Samwald. Matthias. Villanueva-Rosales. Natalia. Wilkinson. Mark D. Hoehndorf. Robert. The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery. Journal of Biomedical Semantics. 2014. 5. 1. 14. 10.1186/2041-1480-5-14. 4015691. 24602174 . free .