GenoCAD explained

Latest Release Version:2.3.1
Programming Language:PHP JavaScript C++ MySQL
Genre:Computer-Aided Design Bioinformatics
License:Apache v2.0

GenoCAD is one of the earliest computer assisted design tools for synthetic biology. The software is a bioinformatics tool developed and maintained by GenoFAB, Inc.. GenoCAD facilitates the design of protein expression vectors, artificial gene networks and other genetic constructs for genetic engineering and is based on the theory of formal languages.

History

GenoCAD originated as an offshoot of an attempt to formalize functional constraints of genetic constructs using the theory of formal languages. In 2007, the website genocad.org (now retired) was set up as a proof of concept by researchers at Virginia Bioinformatics Institute, Virginia Tech. Using the website, users could design genes by repeatedly replacing high-level genetic constructs with lower level genetic constructs, and eventually with actual DNA sequences.

On August 31, 2009, the National Science Foundation granted a three-year $1,421,725 grant to Dr. Jean Peccoud, an associate professor at the Virginia Bioinformatics Institute at Virginia Tech, for the development of GenoCAD.[1] GenoCAD was and continues to be developed by GenoFAB, Inc., a company founded by Peccoud (currently CSO and acting CEO), who was also one of the authors of the originating study.

Source code for GenoCAD was originally released on SourceForge in December 2009.[2]

GenoCAD version 2.0 was released in November 2011 and included the ability to simulate the behavior of the designed genetic code. This feature was a result of a collaboration with the team behind COPASI.[3]

In April, 2015, Peccoud and colleagues published a library of biological parts, called GenoLIB,[4] that can be incorporated into the GenoCAD platform.[5]

Goals

The four aims of the project are to develop a:[6]

  1. computer language to represent the structure of synthetic DNA molecules used in E.coli, yeast, mice, and Arabidopsis thaliana cells
  2. compiler capable of translating DNA sequences into mathematical models in order to predict the encoded phenotype
  3. collaborative workflow environment which allow to share parts, designs, fabrication resource
  4. means to forward the results to the user community through an external advisory board, an annual user conference, and outreach to industry

Features

The main features of GenoCAD can be organized into three main categories.[7]

Theoretical foundation

GenoCAD is rooted in the theory of formal languages; in particular, the design rules describing how to combine different kinds of parts and form context-free grammars.[9]

A context free grammar can be defined by its terminals, variables, start variable and substitution rules.[10] In GenoCAD, the terminals of the grammar are sequences of DNA that perform a particular biological purpose (e.g. a promoter). The variables are less homogeneous: they can represent longer sequences that have multiple functions or can represent a section of DNA that can contain one of multiple different sequences of DNA but perform the same function (e.g. a variable represents the set of promoters). GenoCAD includes built in substitution rules to ensure that the DNA sequence is biologically viable. Users can also define their own sets of rules for other purposes.

Designing a sequence of DNA in GenoCAD is much like creating a derivation in a context free grammar. The user starts with the start variable and repeatedly selects a variable and a substitution for it until only terminals are left.[9]

Alternatives

The most common alternatives to GenoCAD are Proto, GEC and EuGene

Tool Advantages Disadvantages
GEC
  • Designer only needs to know basic part types and determine constraints
EuGene
  • Interfacing with other simulation and assembly tools[12]
Proto
  • Choice of molecules and sequences can be made by other programs
  • Integration capability with some other languages
  • Relatively hard to learn
  • Results are less efficient [13]

External links

Notes and References

  1. Web site: National Science Foundation awards $1.4 million for GenoCAD development . Jodi Lewis . September 14, 2009 . October 7, 2013 . dead . https://web.archive.org/web/20150611111849/http://peccoud.vbi.vt.edu/national-science-foundation-awards-1-4-million-for-genocad-development/ . June 11, 2015 .
  2. Web site: GenoCAD Code. Sourceforge. 8 October 2013.
  3. Web site: Wilson . Mandy . GenoCAD Release Notes . Peccoud Lab . 8 October 2013 . dead . https://web.archive.org/web/20131013161353/http://peccoud.vbi.vt.edu/genocad-release-notes/ . 13 October 2013 .
  4. Adames . Neil . Wilson . Mandy . Fang . Gang . Lux . Matthew . Glick . Benjamin . Peccoud . Jean . GenoLIB: a database of biological parts derived from a library of common plasmid features . Nucleic Acids Research . April 29, 2016 . 43 . 10 . 4823–32 . 10.1093/nar/gkv272 . 25925571 . 4446419.
  5. Adames N, Wilson M, Fang G, Lux M, Glick B, Peccoud J . GenoLIB: a database of biological parts derived from a library of common plasmid features.. Nucleic Acids Research. 2015 . 25925571 . 10.1093/nar/gkv272. 4446419. 43. 10. 4823–32.
  6. Web site: GenoCAD: Computer Assisted Design of Synthetic DNA . Jean Peccoud . June 21, 2013 . October 7, 2013 . dead . https://web.archive.org/web/20130707072240/http://peccoud.vbi.vt.edu/genocad-computer-assisted-design-of-synthetic-dna/ . July 7, 2013.
  7. Book: Wilson ML . Hertzberg R . Adam L . Peccoud J . A Step-by-Step Introduction to Rule-Based Design of Synthetic Genetic Constructs Using GenoCAD . Synthetic Biology, Part B - Computer Aided Design and DNA Assembly. 498. 173–88. 2011 . 21601678 . 10.1016/B978-0-12-385120-8.00008-5. Methods in Enzymology . 9780123851208 .
  8. Cai . Y. . Lux . M. W. . Adam . L. . Peccoud . J. . Sauro . Herbert M . Modeling Structure-Function Relationships in Synthetic DNA Sequences using Attribute Grammars . 10.1371/journal.pcbi.1000529 . PLOS Computational Biology . 5 . 10 . e1000529 . 2009 . 19816554. 2748682 . 2009PLSCB...5E0529C . free .
  9. Cai Y . Hartnett B . Gustafsson C . Peccoud J . A syntactic model to design and verify synthetic genetic constructs derived from standard biological parts.. Bioinformatics. 23. 20. 2760–7. 2007 . 17804435 . 10.1093/bioinformatics/btm446.
  10. Book: Sipser, Michael. Introduction to the Theory of Computation, Third edition. 2013. Cengage Learning. Boston, MA, USA. 978-1-133-18779-0. 104.
  11. Pedersen, M. (2010). Modular languages for systems and synthetic biology.
  12. Habibi, N., Mohd Hashim, S. Z., Rodriguez, C. A., & Samian, M. R. (2013). A Review of CADs, Languages and Data Models for Synthetic Biology. Jurnal Teknologi, 63(1).
  13. Encyclopedia: Beal. Jacob. Phillips. Andrew. Densmore. Douglas. Cai. Yizhi. Heinz. Koeppl. Douglas. Densmore. Gianluca. Setti. Mario. di Bernardo. Design and Analysis of Biomolecular Circuits. High-Level Programming Languages for Biomolecular Systems. 2011. Springer. New York Dordrecht Heidelberg London. 978-1-4419-6765-7. 10.1007/978-1-4419-6766-4. 241.