Sequence saturation mutagenesis explained

Sequence saturation mutagenesis (SeSaM) is a chemo-enzymatic random mutagenesis method applied for the directed evolution of proteins and enzymes. It is one of the most common saturation mutagenesis techniques. In four PCR-based reaction steps, phosphorothioate nucleotides are inserted in the gene sequence, cleaved and the resulting fragments elongated by universal or degenerate nucleotides. These nucleotides are then replaced by standard nucleotides, allowing for a broad distribution of nucleic acid mutations spread over the gene sequence with a preference to transversions and with a unique focus on consecutive point mutations, both difficult to generate by other mutagenesis techniques. The technique was developed by Professor Ulrich Schwaneberg at Jacobs University Bremen and RWTH Aachen University.

Technology, development and advantages

SeSaM has been developed in order to overcome several of the major limitations encountered when working with standard mutagenesis methods based on simple error-prone PCR (epPCR) techniques. These epPCR techniques rely on the use of polymerases and thus encounter limitations which mainly result from the circumstance that only single, but very rarely consecutive, nucleic acid substitutions are performed and that these substitutions occur usually at specific, favored positions only. In addition, transversions of nucleic acids are much less likely than transitions and require specifically designed polymerases with an altered bias.[1] These characteristics of epPCR catalyzed nucleic acid exchanges together with the fact that the genetic code is degenerated decrease the resulting diversity on the amino acid level. Synonymous substitutions lead to amino acid preservation or conservative mutations with similar physico-chemical properties such as size and hydrophobicity are strongly prevalent.[2] [3] By non-specific introduction of universal bases at every position in the gene sequence, SeSaM overcomes the polymerase bias favoring transitory substitutions at specific positions but opens the complete gene sequence to a diverse array of amino acid exchanges.[4] During the development of the SeSaM-method, several modifications were introduced that allowed for the introduction of several mutations simultaneously.[5] Another advancement of the method was achieved by introduction of degenerate bases instead of universal inosine and the use of optimized DNA polymerases, further increasing the ratio of introduced transversions.[6] This modified SeSaM-TV+ method in addition allows for and favors the introduction of two consecutive nucleotide exchanges, broadening strongly the spectrum of amino acids that may be substituted. By several optimizations including the application of an improved chimera polymerase in Step III of the SeSaM-TV-II method [7] [8] and the addition of an alternative degenerate nucleotide for efficient substitution of thymine and cytosine bases and increased mutation frequency in SeSaM-P/R,[9] generated libraries were further improved with regard to transversion number and the number of consecutive mutations was raised to 2–4 consecutive mutations with a rate of consecutive mutations of up to 30%.[10]

Procedure

The SeSaM-method consists of four PCR-based steps which can be executed within two to three days. Major parts include the incorporation of phosphorothioate nucleotides, the chemical fragmentation at these positions, the introduction of universal or degenerate bases and their replacement by natural nucleotides inserting point mutations. Initially, universal “SeSaM”-sequences are inserted by PCR with gene-specific primers binding in front of and behind the gene of interest. The gene of interest with its flanking regions is amplified to introduce these SeSaM_fwd and SeSaM_rev sequences and to generate template for consecutive PCR steps. These generated so-called fwd template and rev templates are now amplified in a PCR reaction with a pre-defined mixture of phosphorothioate and standard nucleotides to ensure an even distribution of inserted mutations over the full length of the gene. PCR products of Step 1 are cleaved specifically at the phosphorothioate bonds, generating a pool of single-stranded DNA fragments of different lengths starting from the universal primer. In Step 2 of SeSaM, the DNA single strands are elongated by one to several universal or degenerate bases (depending on the modification of SeSaM applied) catalyzed by terminal deoxynucleotidyl transferase (TdT). This step is the key step to introduce the characteristic consecutive mutations to randomly mutate entire codons. Subsequently, in Step 3 a PCR is performed recombining the single stranded DNA fragments with the corresponding full-length reverse template, generating the full-length double stranded gene including universal or degenerate bases in its sequence. By replacement of the universal/degenerate bases in the gene sequence by random standard nucleotides in SeSaM Step 4, a diverse array of full-length gene sequences with substitution mutations is generated, including a high load of transversions and subsequent substitution mutations.

Applications

SeSaM is used to directly optimize proteins on amino acid level, but also to preliminarily identify amino acid positions to test in saturation mutagenesis for the ideal amino acid exchange. SeSaM has been successfully applied in numerous directed evolution campaigns of different classes of enzymes for their improvement towards selected properties such as cellulase for ionic liquid resistance,[11] protease with increased detergent tolerance,[12] glucose oxidase for analytical application,[13] phytase with increased thermostability [14] and monooxygenase with improved catalytic efficiency using alternative electron donors.[15] SeSaM is patent protected by US770374 B2 in over 13 countries and is one of the platform technologies of SeSaM-Biotech GmbH.

Notes and References

  1. Wong. T.S.. Zhurina. D.. Schwaneberg. U.. 2006. The diversity challenge in directed protein evolution. 10.2174/138620706776843192. 16724918. Comb. Chem. High Throughput Screen.. 9. 4. 271–288.
  2. Füllen. G.. Youvan D.C.. 1994. Genetic algorithms and recursive ensemble mutagenesis in protein engineering. Complex Int.. 1.
  3. Wong. T.S.. Roccatano. D.. Zacharias. M.. Schwaneberg. U.. 2006. A statistical analysis of random mutagenesis methods used for directed protein evolution. 10.1016/j.jmb.2005.10.082. 16325201. J. Mol. Biol.. 355. 4. 858–871.
  4. Wong. T.S.. Tee. K.L.. Hauer. B.. Schwaneberg. U.. 2004. Sequence Saturation Mutagenesis (SeSaM): a novel method for directed protein evolution. 10.1093/nar/gnh028. 14872057. 373423. Nucleic Acids Res.. 32. 3. e26.
  5. Wong. T.S.. Tee. K.L.. Hauer. B.. Schwaneberg. U.. 2005. Sequence saturation mutagenesis with tunable mutation frequencies. 10.1016/j.ab.2005.03.023. 15866543. Anal. Biochem.. 341. 1. 187–189.
  6. Wong. T.S.. Roccatano. D.. Loakes. D.. Tee. K.L.. Schenk. A.. Hauer. B.. Schwaneberg. U.. 2008. Transversion-enriched sequence saturation mutagenesis (SeSaM-Tv+): A random mutagenesis method with consecutive nucleotide exchanges that complements the bias of error-prone PCR. 10.1002/biot.200700193. 18022859. Biotechnol. J.. 3. 1. 74–82. 9111046.
  7. d'Abbadie. M.. Hofreiter. M.. Vaisman. A.. Loakes. D.. Gasparutto. D.. Cadet. J.. Woodgate. R.. Pääbo. S.. Holliger. P.. 2007. Molecular breeding of polymerases for amplification of ancient DNA. 10.1038/nbt1321. 17632524. Nat. Biotechnol.. 25. 8. 939–943. 1978225.
  8. Mundhada. H.. Marienhagen. J.. Scacioc. A.. Schenk. A.. Roccatano. D.. Schwaneberg. U.. 2011. SeSaM-Tv-II generates a protein sequence space that is unobtainable by epPCR. 10.1002/cbic.201100010. 21671328. ChemBioChem. 12. 10. 1595–1601. 31951491.
  9. Ruff. A.J.. Marienhagen. J.. Verma. R.. Roccatano. D.. Genieser. H.-G.. Niemann. R.. Shivange. A.V.. Schwaneberg. U.. 2012. dRTP and dPTP a complementary nucleotide couple for the Sequence Saturation Mutagenesis (SeSaM) method. 10.1016/j.molcatb.2012.04.018. J Mol Catal B-Enzym.. 84. 40–47.
  10. Zhao. J.. Kardashliev. T.. Ruff. A.J.. Bocola. M.. Schwaneberg. M.. 2014. Lessons from diversity of directed evolution experiments by an analysis of 3000 mutations. 10.1002/bit.25302. 24904008. Biotechnol Bioeng. 111. 2. 2380–2389. 27297091.
  11. Pottkämper. J.. Barthen. P.. Ilmberger. N.. Schwaneberg. U.. Schenk. A.. Schulte. M.. Ignatiev. N.. Streit. W.. 2009. Applying metagenomics for the identification of bacterial cellulases that are stable in ionic liquids. 10.1039/B820157A. Green Chem.. 11. 7. 957–965.
  12. Li. Z.. Roccatano. D.. Lorenz. M.. Schwaneberg. U.. 2012. Directed evolution of subtilisin E into a highly active and guanidinium chloride- and sodium dodecylsulfate-tolerant protease. 10.1002/cbic.201100714. 22408062. ChemBioChem. 13. 5. 691–699. 5134486.
  13. Gutierrez. E.A.. Mundhada. H.. Meier. T.. Duefuel. H.. Bocola. M.. Schwaneberg. U.. 2013. Reengineered glucose oxidase for amperometric glucose determination in diabetes analytics. 10.1016/j.bios.2013.06.029. 23835222. Biosens. Bioelectron.. 50. 84–90.
  14. Shivange. A.V.. Roccatano. D.. Schwaneberg. U.. 2016. Iterative key-residues interrogation of a phytase with thermostability increasing substitutions identified in directed evolution. 10.1007/s00253-015-6959-5. 26403922. Appl. Microbiol. Biot.. 100. 1. 227–242. 10424164.
  15. Belsare. K.D.. Horn. T.. Ruff. A.J.. Martinez. R.. Magnusson. A.. Holtmann. D.. Schrader. J.. Schwaneberg. U.. 2017. Directed evolution of P450cin for mediated electron transfer. 10.1093/protein/gzw072. 28007937. Protein Engineering Design and Selection. 30. 2. 119–127. free.