Molecular genetics is a branch of biology that addresses how differences in the structures or expression of DNA molecules manifests as variation among organisms. Molecular genetics often applies an "investigative approach" to determine the structure and/or function of genes in an organism's genome using genetic screens.[1]
The field of study is based on the merging of several sub-fields in biology: classical Mendelian inheritance, cellular biology, molecular biology, biochemistry, and biotechnology. It integrates these disciplines to explore things like genetic inheritance, gene regulation and expression, and the molecular mechanism behind various life processes.
A key goal of molecular genetics is to identify and study genetic mutations. Researchers search for mutations in a gene or induce mutations in a gene to link a gene sequence to a specific phenotype. Therefore molecular genetics is a powerful methodology for linking mutations to genetic conditions that may aid the search for treatments of various genetics diseases.
The discovery of DNA as the blueprint for life and breakthroughs in molecular genetics research came from the combined works of many scientists. In 1869, chemist Johann Friedrich Miescher, who was researching the composition of white blood cells, discovered and isolated a new molecule that he named nuclein from the cell nucleus, which would ultimately be the first discovery of the molecule DNA that was later determined to be the molecular basis of life. He determined it was composed of hydrogen, oxygen, nitrogen and phosphorus.[2] Biochemist Albrecht Kosell identified nuclein as a nucleic acid and provided its name deoxyribonucleic acid (DNA). He continued to build on that by isolating the basic building blocks of DNA and RNA; made up of the nucleotides: adenine, guanine, thymine, cytosine. and uracil. His work on nucleotides earned him a Nobel Prize in Physiology.[3]
In the early 1900s, Gregor Mendel, who became known as one of the fathers of genetics, made great contributions to the field of genetics through his various experiments with pea plants where he was able to discover the principles of inheritance such as recessive and dominant traits, without knowing what genes where composed of.[4] In the mid 19th century, anatomist Walther Flemming, discovered what we now know as chromosomes and the separation process they undergo through mitosis. His work along with Theodor Boveri first came up with the Chromosomal Theory of Inheritance, which helped explain some of the patterns Mendel had observed much earlier.[5]
For molecular genetics to develop as a discipline, several scientific discoveries were necessary. The discovery of DNA as a means to transfer the genetic code of life from one cell to another and between generations was essential for identifying the molecule responsible for heredity. Molecular genetics arose initially from studies involving genetic transformation in bacteria. In 1944 Avery, McLeod and McCarthy[6] isolated DNA from a virulent strain of S. pneumoniae, and using just this DNA were able to convert a harmless strain to virulence. They called the uptake, incorporation and expression of DNA by bacteria "transformation". This finding suggested that DNA is the genetic material of bacteria. [7] Bacterial transformation is often induced by conditions of stress, and the function of transformation appears to be repair of genomic damage.
In 1950, Erwin Chargaff derived rules that offered evidence of DNA being the genetic material of life. These were "1) that the base composition of DNA varies between species and 2) in natural DNA molecules, the amount of adenine (A) is equal to the amount of thymine (T), and the amount of guanine (G) is equal to the amount of cytosine (C)."[8] These rules, known as Chargaff's rules, helped to understand of molecular genetics. In 1953 Francis Crick and James Watson, building upon the X-ray crystallography work done by Rosalind Franklin and Maurice Wilkins, were able to derive the 3-D double helix structure of DNA.[9]
The phage group was an informal network of biologists centered on Max Delbrück that contributed substantially to molecular genetics and the origins of molecular biology during the period from about 1945 to 1970.[10] The phage group took its name from bacteriophages, the bacteria-infecting viruses that the group used as experimental model organisms. Studies by molecular geneticists affiliated with this group contributed to understanding how gene-encoded proteins function in DNA replication, DNA repair and DNA recombination, and on how viruses are assembled from protein and nucleic acid components (molecular morphogenesis). Furthermore, the role of chain terminating codons was elucidated. One noteworthy study was performed by Sydney Brenner and collaborators using "amber" mutants defective in the gene encoding the major head protein of bacteriophage T4.[11] This study demonstrated the co-linearity of the gene with its encoded polypeptide, thus providing strong evidence for the "sequence hypothesis" that the amino acid sequence of a protein is specified by the nucleotide sequence of the gene determining the protein.
The isolation of a restriction endonuclease in E. coli by Arber and Linn in 1969 opened the field of genetic engineering.[12] Restriction enzymes were used to linearize DNA for separation by electrophoresis and Southern blotting allowed for the identification of specific DNA segments via hybridization probes.[13] [14] In 1971, Berg utilized restriction enzymes to create the first recombinant DNA molecule and first recombinant DNA plasmid.[15] In 1972, Cohen and Boyer created the first recombinant DNA organism by inserting recombinant DNA plasmids into E. coli, now known as bacterial transformation, and paved the way for molecular cloning.[16] The development of DNA sequencing techniques in the late 1970s, first by Maxam and Gilbert, and then by Frederick Sanger, was pivotal to molecular genetic research and enabled scientists to begin conducting genetic screens to relate genotypic sequences to phenotypes.[17] Polymerase chain reaction (PCR) using Taq polymerase, invented by Mullis in 1985, enabled scientists to create millions of copies of a specific DNA sequence that could be used for transformation or manipulated using agarose gel separation.[18] A decade later, the first whole genome was sequenced (Haemophilus influenzae), followed by the eventual sequencing of the human genome via the Human Genome Project in 2001.[19] The culmination of all of those discoveries was a new field called genomics that links the molecular structure of a gene to the protein or RNA encoded by that segment of DNA and the functional expression of that protein within an organism.[20] Today, through the application of molecular genetic techniques, genomics is being studied in many model organisms and data is being collected in computer databases like NCBI and Ensembl. The computer analysis and comparison of genes within and between different species is called bioinformatics, and links genetic mutations on an evolutionary scale.[21]
This image shows an example of the central dogma using a DNA strand being transcribed then translated and showing important enzymes used in the processesThe Central Dogma plays a key role in the study of molecular genetics. The Central Dogma states that DNA replicates itself, DNA is transcribed into RNA, and RNA is translated into proteins.[22] Along with the Central Dogma, the genetic code is used in understanding how RNA is translated into proteins. Replication of DNA and transcription from DNA to mRNA occurs in the nucleus while translation from RNA to proteins occurs in the ribosome.[23] The genetic code is made of four interchangeable parts othe DNA molecules, called "bases": adenine, cytosine, uracil (in RNA; thymine in DNA), and guanine and is redundant, meaning multiple combinations of these base pairs (which are read in triplicate) produce the same amino acid.[24] Proteomics and genomics are fields in biology that come out of the study of molecular genetics and the Central Dogma.[25]
An organism's genome is made up by its entire set of DNA and is responsible for its genetic traits, function and development. The composition of DNA itself is an essential component to the field of molecular genetics; it is the basis of how DNA is able to store genetic information, pass it on, and be in a format that can be read and translated.[26]
DNA is a double stranded molecule, with each strand oriented in an antiparallel fashion. Nucleotides are the building blocks of DNA, each composed of a sugar molecule, a phosphate group and one of four nitrogenous bases: adenine, guanine, cytosine, and thymine. A single strand of DNA is held together by covalent bonds, while the two antiparallel strands are held together by hydrogen bonds between the nucleotide bases. Adenine binds with thymine and cytosine binds with guanine. It is these four base sequences that form the genetic code for all biological life and contains the information for all the proteins the organism will be able to synthesize.
Its unique structure allows DNA to store and pass on biological information across generations during cell division. At cell division, cells must be able to copy its genome and pass it on to daughter cells. This is possible due to the double-stranded structure of DNA because one strand is complementary to its partner strand, and therefore each of these strands can act as a template strand for the formation of a new complementary strand. This is why the process of DNA replication is known as a semiconservative process.[27]
Forward genetics is a molecular genetics technique used to identify genes or genetic mutations that produce a certain phenotype. In a genetic screen, random mutations are generated with mutagens (chemicals or radiation) or transposons and individuals are screened for the specific phenotype. Often, a secondary assay in the form of a selection may follow mutagenesis where the desired phenotype is difficult to observe, for example in bacteria or cell cultures. The cells may be transformed using a gene for antibiotic resistance or a fluorescent reporter so that the mutants with the desired phenotype are selected from the non-mutants.
Mutants exhibiting the phenotype of interest are isolated and a complementation test may be performed to determine if the phenotype results from more than one gene. The mutant genes are then characterized as dominant (resulting in a gain of function), recessive (showing a loss of function), or epistatic (the mutant gene masks the phenotype of another gene). Finally, the location and specific nature of the mutation is mapped via sequencing.[28] Forward genetics is an unbiased approach and often leads to many unanticipated discoveries, but may be costly and time consuming. Model organisms like the nematode worm Caenorhabditis elegans, the fruit fly Drosophila melanogaster, and the zebrafish Danio rerio have been used successfully to study phenotypes resulting from gene mutations.[29]
Reverse genetics is the term for molecular genetics techniques used to determine the phenotype resulting from an intentional mutation in a gene of interest. The phenotype is used to deduce the function of the un-mutated version of the gene. Mutations may be random or intentional changes to the gene of interest. Mutations may be a mis-sense mutation caused by nucleotide substitution, a nucleotide addition or deletion to induce a frameshift mutation, or a complete addition/deletion of a gene or gene segment. The deletion of a particular gene creates a gene knockout where the gene is not expressed and a loss of function results (e.g. knockout mice). Mis-sense mutations may cause total loss of function or result in partial loss of function, known as a knockdown. Knockdown may also be achieved by RNA interference (RNAi).[30] Alternatively, genes may be substituted into an organism's genome (also known as a transgene) to create a gene knock-in and result in a gain of function by the host.[31] Although these techniques have some inherent bias regarding the decision to link a phenotype to a particular function, it is much faster in terms of production than forward genetics because the gene of interest is already known.
Molecular genetics is a scientific approach that utilizes the fundamentals of genetics as a tool to better understand the molecular basis of a disease and biological processes in organisms. Below are some tools readily employed by researchers in the field.
Microsatellites or single sequence repeats (SSRS) are short repeating segment of DNA composed to 6 nucleotides at a particular location on the genome that are used as genetic marker. Researchers can analyze these microsatellites in techniques such DNA fingerprinting and paternity testing since these repeats are highly unique to individuals/families. a can also be used in constructing genetic maps and to studying genetic linkage to locate the gene or mutation responsible for specific trait or disease. Microsatellites can also be applied to population genetics to study comparisons between groups.[32]
Genome wide association studies (GWAS) are a technique that relies on single nucleotide polymorphisms (SNPs) to study genetic variations in populations that can be associated with a particular disease. The Human Genome Project mapped the entire human genome and has made this approach more readily available and cost effective for researchers to implement. In order to conduct a GWAS researchers use two groups, one group that has the disease researchers are studying and another that acts as the control that does not have that particular disease. DNA samples are obtained from participants and their genome can then be derived through lab machinery and quickly surveyed to compare participants and look for SNPs that can potentially be associated with the disease. This technique allows researchers to pinpoint genes and locations of interest in the human genome that they can then further study to identify that cause of the disease.[33]
Karyotyping allows researchers to analyze chromosomes during metaphase of mitosis, when they are in a condensed state. Chromosomes are stained and visualized through a microscope to look for any chromosomal abnormalities. This technique can be used to detect congenital genetic disorder such as down syndrome, identify gender in embryos, and diagnose some cancers that are caused by chromosome mutations such as translocations. [34]
Genetic engineering is an emerging field of science, and researcher are able to leverage molecular genetic technology to modify the DNA of organisms and create genetically modified and enhanced organisms for industrial, agricultural and medical purposes. This can be done through genome editing techniques, which can involve modifying base pairings in a DNA sequence, or adding and deleting certain regions of DNA.[35]
Gene editing allows scientists to alter/edit an organism's DNA. One way to due this is through the technique Crispr/Cas9, which was adapted from the genome immune defense that is naturally occurring in bacteria. This technique relies on the protein Cas9 which allows scientists to make a cut in strands of DNA at a specific location, and it uses a specialized RNA guide sequence to ensure the cut is made in the proper location in the genome. Then scientists use DNAs repair pathways to induce changes in the genome; this technique has wide implications for disease treatment.[36]
Molecular genetics has wide implications in medical advancement and understanding the molecular basis of a disease allows the opportunity for more effective diagnostic and therapies. One of the goals of the field is personalized medicine, where an individual's genetics can help determine the cause and tailor the cure for a disease they are afflicted with and potentially allow for more individualized treatment approaches which could be more effective. For example, certain genetic variations in individuals could make them more receptive to a particular drug while other could have a higher risk of adverse reaction to treatments. So this information would allow researchers and clinicals to make the most informed decisions about treatment efficacy for patients rather than the standard trial and error approach.[37]
Forensic genetics plays an essential role for criminal investigations through that use of various molecular genetic techniques. One common technique is DNA fingerprinting which is done using a combination of molecular genetic techniques like Polymerase Chain reaction (PCR) and gel electrophoresis. PCR is a technique that allows a target DNA sequence to be amplified, meaning even a tiny quantity of DNA from a crime scene can be extracted and replicated many times to provide a sufficient amount of material for analysis. Gel electrophoresis allows the DNA sequence to be separated based on size, and the pattern that is derived is known as DNA fingerprinting and is unique to each individual. This combination of molecular genetic techniques allows a simple DNA sequence to be extracted, amplified, analyzed and compared with others and is a standard technique used in forensics.[38]