A DNA construct is an artificially-designed segment of DNA borne on a vector that can be used to incorporate genetic material into a target tissue or cell.[1] A DNA construct contains a DNA insert, called a transgene, delivered via a transformation vector which allows the insert sequence to be replicated and/or expressed in the target cell. This gene can be cloned from a naturally occurring gene, or synthetically constructed.[2] The vector can be delivered using physical, chemical or viral methods. Typically, the vectors used in DNA constructs contain an origin of replication, a multiple cloning site, and a selectable marker. Certain vectors can carry additional regulatory elements based on the expression system involved.[3]
DNA constructs can be as small as a few thousand base pairs (kbp) of DNA carrying a single gene, using vectors such as plasmids or bacteriophages, or as large as hundreds of kbp for large-scale genomic studies using an artificial chromosome. A DNA construct may express wildtype protein, prevent the expression of certain genes by expressing competitors or inhibitors, or express mutant proteins, such as deletion mutations or missense mutations. DNA constructs are widely adapted in molecular biology research for techniques such as DNA sequencing, protein expression, and RNA studies.
The first standardized vector, pBR220, was designed in 1977 by researchers in Herbert Boyer’s lab. The plasmid contains various restriction enzyme sites and a stable antibiotic-resistance gene free from transposon activities.[4]
In 1982, Jeffrey Vieira and Joachim Messing described the development of M13mp7-derived pUC vectors that consist of a multiple cloning site and allow for more efficient sequencing and cloning using a set of universal M13 primers. Three years later, the currently popular pUC19 plasmid was engineered by the same scientists.[5]
The gene on a DNA sequence of interest can either be cloned from an existing sequence or developed synthetically. To clone a naturally occurring sequence in an organism, the organism's DNA is first cut with restriction enzymes, which recognize DNA sequences and cut them, around the target gene. The gene can then be amplified using polymerase chain reaction (PCR). Typically, this process includes using short sequences known as primers to initially hybridize to the target sequence; in addition, point mutations can be introduced in the primer sequences and then copied in each cycle in order to modify the target sequence.
It is also possible to synthesize a target DNA strand for a DNA construct. Short strands of DNA known as oligonucleotides can be developed using column-based synthesis, in which bases are added one at a time to a strand of DNA attached to a solid phase. Each base has a protecting group to prevent linkage that is not removed until the next base is ready to be added, ensuring that they are linked in the correct sequence. Oligonucleotides can also be synthesized on a microarray, which allows for tens of thousands of sequences to be synthesized at once, in order to reduce cost. To synthesize a larger gene, oligonucleotides are developed with overlapping sequences on the ends and then joined together. The most common method is called polymerase cycling assembly (PCA): fragments hybridize at the overlapping regions and are extended, and larger fragments are created in each cycle.
Once a sequence has been isolated, it must be inserted into a vector. The easiest way to do this is to cut the vector DNA using restriction enzymes; if the same enzymes were used to isolate the target sequence, then the same "overhang" sequences will be created on each end allowing for hybridization. Once the target gene has hybridized to the vector DNA, they can be joined using a DNA ligase. An alternative strategy uses recombination between homologous sites on the target gene and the vector sequence, eliminating the need for restriction enzymes.[6]
There are three general categories of DNA construct delivery: physical, chemical, and viral. Physical methods, which deliver the DNA by physically penetrating the cell, include microinjection, electroporation, and biolistics.[7] Chemical methods rely on chemical reactions to deliver the DNA and include transformation with cells made competent using calcium phosphate as well as delivery via lipid nanoparticles.[8] [9] Viral methods use a variety of viral vectors to deliver the DNA, including adenovirus, lentivirus, and herpes simplex virus[10]
In addition to the target gene, there are three important elements in a vector: an origin of replication, a selectable marker, and a multiple cloning site. An origin of replication is a DNA sequence that starts the process of DNA replication, allowing the vector to clone itself. A multiple cloning site contains binding sites for several restriction enzymes, making it easier to insert different DNA sequences into the vector. A selectable marker confers some trait that can be easily selected for in a host cell, so that it can be determined whether transformation was successful. The most common selectable markers are genes for antibiotic resistance, so that host cells without the construct will die off when exposed to the antibody and only host cells with the construct will remain.
DNA constructs can be used to produce proteins, including both naturally occurring proteins and engineered mutant proteins. These proteins can be used to make therapeutic products, such as pharmaceuticals and antibodies. DNA constructs can also change the expression levels of other genes by expressing regulatory sequences such as promoters and inhibitors. Additionally, DNA constructs can be used for research such as creating genomic libraries, sequencing cloned DNA, and studying RNA and protein expression.