TALE-likes explained

Transcription Activator-Like Effector-Likes (TALE-likes) are a group of bacterial DNA binding proteins named for the first and still best-studied group, the TALEs of Xanthomonas bacteria. TALEs are important factors in the plant diseases caused by Xanthomonas bacteria, but are known primarily for their role in biotechnology as programmable DNA binding proteins, particularly in the context of TALE nucleases. TALE-likes have additionally been found in many strains of the Ralstonia solanacearum bacterial species complex, in Paraburkholderia rhizoxinica strain HKI 454, and in two unknown marine bacteria. Whether or not all these proteins form a single phylogenetic grouping is as yet unclear.

The unifying feature of the TALE-likes are their tandem arrays of DNA binding repeats. These repeats are, with few exceptions, 33-35 amino acids in length, and composed of two alpha-helices on either side of a flexible loop containing the DNA base binding residues and with neighbouring repeats joined by flexible linker loops.[1] Evidence for this common structure comes in part from solved crystal structures of TALEs[2] and a Burkholderia TALE-like (BAT),[3] but also from the conservation of the code that all TALE-likes use to recognise DNA-sequences. In fact, TALE, RipTAL, and BAT repeats can be mixed and matched to generate functional DNA-binding proteins with varying affinity.[4]

TALEs

TALEs are the first identified, best-studied and largest group within the TALE-likes. TALEs are found throughout the bacterial genus Xanthomonas,[5] comprising mostly plant pathogens. Those TALEs which have been studied have all been shown to be secreted as part of the Type III secretion system into host plant cells. Once inside the host cell they translocate to the nucleus, bind specific DNA sequences within host promoters and turn on downstream genes. Every part of this process is thought to be conserved across all TALEs. The single meaningful difference between individual TALEs, based on current understanding, is the specific DNA sequence that each TALE binds. TALEs from even closely related strains differ in the composition of repeats that make up their DNA binding domain.[6] Repeat composition determines DNA binding preference. In particular position 13 of each repeat confers the DNA base preference of each repeat. During early research it was noted that almost all the differences between repeats of a single TALE repeat array are found in positions 12 and 13 and this finding led to the hypothesis that these residues determine base preference.[7] In fact repeat positions 12 and 13, referred to jointly as the Repeat Variable Diresidue (RVD) are commonly said to confer base specificity despite clear evidence that position 13 is the base determining residue.[8] In addition to the repeat domain TALEs also possess a number of conserved features in the domains flanking the repeats. These include domains for type-III-secretion, nuclear localization and transcriptional activation. This allows TALEs to carry out their biological role as effector proteins secreted into host plant cells to activate expression of specific host genes.

Diversity and evolution

Whilst the RVD positions are commonly the only variable positions within a single TALE repeat array, there are more differences when comparing repeat arrays of different TALEs. The diversity of TALEs across the Xanthomonas genus is considerable, but a particularly striking finding is that the evolutionary history one arrives at by comparing repeat compositions differs from that found when comparing non-repeat sequences.[6] Repeat arrays of TALEs are thought to evolve rapidly, with a number of recombinatorial processes suggested to shape repeat array evolution.[5] Recombination of TALE repeat arrays has been demonstrated in a forced-selection experiment.[9] This evolutionary dynamism is thought to be made possible by the very high sequence identity of TALE repeats, which is a unique feature of TALEs as opposed to other TALE-likes.

T-zero

Another unique feature of TALEs is a set of four repeat structures at the N-terminal flank of the core repeat array. These structures, termed non-canonical or degenerate repeats have been shown to be vital for DNA binding,[10] though all but one do not contact DNA bases and thus make no contribution to sequence preference. The one exception is repeat -1, which encodes a fixed T-zero preference to all TALEs. This means that the target sequences of TALEs are always preceded by a thymine base. This is thought to be common to all TALEs, with the possible exception of TalC from Xanthomonas oryzae pv. oryzae strain AXO1947 .[11]

RipTALs

Uniprot:Q8XYE3
Symbol:brg11
TAL effector protein Brg11
Organism:Ralstonia solanacearum (strain GMI1000)

Discovery and molecular properties

It was noted in the 2002 publication of the genome of reference strain Ralstonia solanacearum GMI1000 that its genome encodes a protein similar to Xanthomonas TALEs.[12] Based on similar domain structure and repeat sequences it was presumed that this gene and homologs in other Ralstonia strains would encode proteins with the same molecular properties as TALEs, including sequence-specific DNA binding. In 2013 this was confirmed by two studies.[13] [14] These genes and the proteins they encode are referred to as RipTALs (Ralstonia injected protein TALE-like) in line with the standard nomenclature of Ralstonia effectors.[15] Whilst the DNA binding code of the core repeats is conserved with TALEs, RipTALs do not share the T-zero preference, instead they have a strict G-zero requirement.[13] In addition repeats within a single RipTAL repeat array have multiple sequence differences beyond the RVD positions, unlike the near-identical repeats of TALEs.

RipTALs have been found in all four phylotypes of R. solanacearum, making it an ancestral feature of this clade. Despite differences in the flanking domains, the sequences their RVDs target are highly similar.[16]

Biological role

Several lines of evidence support the idea that RipTALs function as effector proteins, promoting bacterial growth or disease by manipulating the expression of plant genes. They are secreted into plant cells by the Type III secretion system, which is the main delivery system for effector proteins.[17] They localize to the cell nucleus and are able to function as sequence-specific transcription factors in plant cells.[13] In addition a strain lacking its RipTAL was shown to grow slower inside eggplant leaf tissue than the wild type.[18] Furthermore, a study based on DNA polymorphisms in ripTAL repeat domain sequences and host plants found a statistically significant connection between host plant and repeat domain variants.[19] This is expected if the RipTALs of different strains are adapted to target genes in specific host plants. Despite this, no target genes have been identified for any RipTAL, .

BATs

Uniprot:E5AV36
Symbol:bat1
Burkholderia TALE-like protein 1
Organism:Paraburkholderia rhizoxinica

Discovery

The publication of the genome of bacterial strain Paraburkholderia rhizoxinica HKI 454, in 2011 [20] led to the discovery of a set of TALE-like genes that differed considerably in nature from the TALEs and RipTALS. The proteins encoded by these genes were studied for their DNA binding properties by two groups independently and named the Bats (Burkholderia TALE-likes;) or BurrH.[21] [22] This research showed that the repeat units of the Burkholderia TALE-likes bind DNA with the same code as TALEs, governed by position 13 of each repeat. There are, however, a number of differences.

Biological role

Burkholderia TALE-likes are composed almost entirely of repeats, lacking the large non-repetitive domains found flanking the repeats in TALEs and RpTALs. Those domains are key to the functions of TALEs and RipTALs allowing them to infiltrate the plant nucleus and turn on gene expression. It is therefore currently unclear what the biological roles of Burkholderia TALE-likes are. What is clear is that they are not effector proteins secreted into plant cells to act as transcription factors, the biological role of TALEs and RipTALs. It is not unexpected that they may differ in biological roles from TALEs and RipTALs since the life style of the bacterium they derive from is very unlike that of TALE and RipTAL bearing bacteria. B. rhizoxinica is an endosymbiont, living inside a fungus, unlike Rhizopus microsporus, a plant pathogen. The same fungus is also an opportunistic human pathogen in immuno-compromised patients, but whereas B. rhizoxinica is necessary for pathogenicity on plant hosts it is irrelevant to human infection.[23] It is unclear whether the Burkholderia TALE-likes are ever secreted either into the fungus, let alone into host plants.

Uses in biotechnology

As noted in the publications on Burkholderia TALE-likes there may be some advantages to using these proteins as a scaffold for programmable DNA-binding proteins to function as transcription factors or designer-nucleases, compared to TALEs. It has been fused with a FokI nuclease analogous to TALEN.[3] Advantages include a shorter repeat size, more compact domain structure (no large non-repeat domains), greater repeat sequence diversity enabling the use of PCR on the genes encoding them and making them less vulnerable to recombinatorial repeat loss. In addition, Burkholderia TALE-likes have no T-zero requirement relaxing the constraints on DNA target selection. However, few uses of Burkholderia TALE-likes as programmable DNA binding proteins have been published, outside of the original characterization publications.

MOrTLs

Discovery

In 2007 the results of a metagenomic sweep of the world's oceans by the J. Craig Venter Institute were made publicly available.[24] The paper in 2014 on Burkholderia TALE-likes [22] was also the first to report that two entries from that database resembled TALE-likes, based on sequence similarity. These were further characterized and assessed for their DNA-binding potential in 2015.[25] The repeat units encoded by these sequences were found to mediate DNA binding with base preference matching the TALE code, and judged likely to form structures nearly identical to Bat1 repeats based on molecular dynamics simulations. The proteins encoded by these DNA sequences were therefore designated Marine Organism TALE-likes (MOrTLs) 1 and 2 (GenBank:,).[25] Similar sequences found in metagenomes include and .[26]

Evolutionary relationship to other TALE-likes

Whilst repeats of MOrTL1 and 2 both conform structurally and functionally to the TALE-like norm, they differ considerably at the sequence level both from all other TALE-likes and from one another. It is not known whether they are truly homologous to the other TALE-likes, and thus constitute together with the TALEs, RipTALs and Bats a true protein-family. Alternatively, they may have evolved independently. It is particularly difficult to judge the relationship to the other TALE-likes because almost nothing is known of the organisms that MOrTL1 and MOrTL2 come from. It is known only that they were found in two separate sea-water samples from the Gulf of Mexico and are likely to be bacteria based on size-exclusion before DNA sequencing.[25]

Legal status

A patent for BATs and marine TALE-likes in protein engineering was filed in July 2012., it is currently pending in all jurisdictions.[27]

Notes and References

  1. Deng D, Yan C, Wu J, Pan X, Yan N . Revisiting the TALE repeat . Protein & Cell . 5 . 4 . 297–306 . April 2014 . 24622844 . 3978159 . 10.1007/s13238-014-0035-2 .
  2. Deng D, Yan C, Pan X, Mahfouz M, Wang J, Zhu JK, Shi Y, Yan N . Structural basis for sequence-specific recognition of DNA by TAL effectors . Science . 335 . 6069 . 720–3 . February 2012 . 22223738 . 3586824 . 10.1126/science.1215670 . 2012Sci...335..720D .
  3. Stella S, Molina R, López-Méndez B, Juillerat A, Bertonati C, Daboussi F, Campos-Olivas R, Duchateau P, Montoya G . BuD, a helix-loop-helix DNA-binding domain for genome modification . Acta Crystallographica. Section D, Biological Crystallography . 70 . Pt 7 . 2042–52 . July 2014 . 25004980 . 4089491 . 10.1107/S1399004714011183 . 2014AcCrD..70.2042S .
  4. de Lange O, Schandry N, Wunderlich M, Berendzen KW, Lahaye T . Exploiting the sequence diversity of TALE-like repeats to vary the strength of dTALE-promoter interactions . Synthetic Biology . January 2017 . 2 . 1 . ysx004 . 10.1093/synbio/ysx004 . 32995505 . 7445789 . free.
  5. Ferreira RM, de Oliveira AC, Moreira LM, Belasque J, Gourbeyre E, Siguier P, Ferro MI, Ferro JA, Chandler M, Varani AM . A TALE of transposition: Tn3-like transposons play a major role in the spread of pathogenicity determinants of Xanthomonas citri and other xanthomonads . mBio . 6 . 1 . e02505-14 . February 2015 . 25691597 . 4337579 . 10.1128/mBio.02505-14 .
  6. Pérez-Quintero AL, Lamy L, Gordon JL, Escalon A, Cunnac S, Szurek B, Gagnevin L . QueTAL: a suite of tools to classify and compare TAL effectors functionally and phylogenetically . Frontiers in Plant Science . 6 . 545 . 3 August 2015 . 26284082 . 4522561 . 10.3389/fpls.2015.00545 . free .
  7. Boch. Jens. Schornack. Sebastian . vanc . Unraveling a 20-Year Enigma. IS-MPMI Reporter. 2010. 1. 3–4.
  8. de Lange O, Binder A, Lahaye T . From dead leaf, to new life: TAL effectors as tools for synthetic biology . The Plant Journal . 78 . 5 . 753–71 . June 2014 . 24602153 . 10.1111/tpj.12431 . free .
  9. Yang B, Sugio A, White FF . Avoidance of host recognition by alterations in the repetitive and C-terminal regions of AvrXa7, a type III effector of Xanthomonas oryzae pv. oryzae . Molecular Plant-Microbe Interactions . 18 . 2 . 142–9 . February 2005 . 15720083 . 10.1094/MPMI-18-0142 . free .
  10. Gao H, Wu X, Chai J, Han Z . Crystal structure of a TALE protein reveals an extended N-terminal DNA binding region . Cell Research . 22 . 12 . 1716–20 . December 2012 . 23147789 . 3515758 . 10.1038/cr.2012.156 .
  11. Yu Y, Streubel J, Balzergue S, Champion A, Boch J, Koebnik R, Feng J, Verdier V, Szurek B . Colonization of rice leaf blades by an African strain of Xanthomonas oryzae pv. oryzae depends on a new TAL effector that induces the rice nodulin-3 Os11N3 gene . Molecular Plant-Microbe Interactions . 24 . 9 . 1102–13 . September 2011 . 21679014 . 10.1094/MPMI-11-10-0254 . free .
  12. Salanoubat M, Genin S, Artiguenave F, Gouzy J, Mangenot S, Arlat M, Billault A, Brottier P, Camus JC, Cattolico L, Chandler M, Choisne N, Claudel-Renard C, Cunnac S, Demange N, Gaspin C, Lavie M, Moisan A, Robert C, Saurin W, Schiex T, Siguier P, Thébault P, Whalen M, Wincker P, Levy M, Weissenbach J, Boucher CA . Genome sequence of the plant pathogen Ralstonia solanacearum . Nature . 415 . 6871 . 497–502 . January 2002 . 11823852 . 10.1038/415497a . free .
  13. de Lange O, Schreiber T, Schandry N, Radeck J, Braun KH, Koszinowski J, Heuer H, Strauß A, Lahaye T . Breaking the DNA-binding code of Ralstonia solanacearum TAL effectors provides new possibilities to generate plant resistance genes against bacterial wilt disease . The New Phytologist . 199 . 3 . 773–86 . August 2013 . 23692030 . 10.1111/nph.12324 . free .
  14. Li L, Atef A, Piatek A, Ali Z, Piatek M, Aouida M, Sharakuu A, Mahjoub A, Wang G, Khan S, Fedoroff NV, Zhu JK, Mahfouz MM . Characterization and DNA-binding specificities of Ralstonia TAL-like effectors . Molecular Plant . 6 . 4 . 1318–30 . July 2013 . 23300258 . 3716395 . 10.1093/mp/sst006 .
  15. Peeters N, Carrère S, Anisimova M, Plener L, Cazalé AC, Genin S . Repertoire, unified nomenclature and evolution of the Type III effector gene set in the Ralstonia solanacearum species complex . BMC Genomics . 14 . 1 . 859 . December 2013 . 24314259 . 3878972 . 10.1186/1471-2164-14-859 . free .
  16. Schandry N, de Lange O, Prior P, Lahaye T . TALE-Like Effectors Are an Ancestral Feature of the Ralstonia solanacearum Species Complex and Converge in DNA Targeting Specificity . Frontiers in Plant Science . 7 . 1225 . 17 August 2016 . 27582755 . 4987410 . 10.3389/fpls.2016.01225 . free .
  17. Mukaihara T, Tamura N, Iwabuchi M . Genome-wide identification of a large repertoire of Ralstonia solanacearum type III effector proteins by a new functional screen . Molecular Plant-Microbe Interactions . 23 . 3 . 251–62 . March 2010 . 20121447 . 10.1094/mpmi-23-3-0251 . free .
  18. Macho AP, Guidot A, Barberis P, Beuzón CR, Genin S . A competitive index assay identifies several Ralstonia solanacearum type III effector mutant strains with reduced fitness in host plants . Molecular Plant-Microbe Interactions . 23 . 9 . 1197–205 . September 2010 . 20687809 . 10.1094/MPMI-23-9-1197 . free .
  19. Heuer H, Yin YN, Xue QY, Smalla K, Guo JH . Repeat domain diversity of avrBs3-like genes in Ralstonia solanacearum strains and association with host preferences in the field . Applied and Environmental Microbiology . 73 . 13 . 4379–84 . July 2007 . 17468277 . 1932761 . 10.1128/AEM.00367-07 . 2007ApEnM..73.4379H .
  20. Lackner G, Moebius N, Partida-Martinez LP, Boland S, Hertweck C . Evolution of an endofungal lifestyle: Deductions from the Burkholderia rhizoxinica genome . BMC Genomics . 12 . 1 . 210 . May 2011 . 21539752 . 3102044 . 10.1186/1471-2164-12-210 . free .
  21. de Lange O, Wolf C, Dietze J, Elsaesser J, Morbitzer R, Lahaye T . Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain . Nucleic Acids Research . 42 . 11 . 7436–49 . June 2014 . 24792163 . 4066763 . 10.1093/nar/gku329 .
  22. Juillerat A, Bertonati C, Dubois G, Guyot V, Thomas S, Valton J, Beurdeley M, Silva GH, Daboussi F, Duchateau P . BurrH: a new modular DNA binding protein for genome engineering . Scientific Reports . 4 . 3831 . January 2014 . 24452192 . 5379180 . 10.1038/srep03831 . 2014NatSR...4E3831J .
  23. Partida-Martinez LP, Bandemer S, Rüchel R, Dannaoui E, Hertweck C . Lack of evidence of endosymbiotic toxin-producing bacteria in clinical Rhizopus isolates . Mycoses . 51 . 3 . 266–9 . May 2008 . 18399908 . 10.1111/j.1439-0507.2007.01477.x . 35688902 .
  24. Yooseph S, Sutton G, Rusch DB, Halpern AL, Williamson SJ, Remington K, Eisen JA, Heidelberg KB, Manning G, Li W, Jaroszewski L, Cieplak P, Miller CS, Li H, Mashiyama ST, Joachimiak MP, van Belle C, Chandonia JM, Soergel DA, Zhai Y, Natarajan K, Lee S, Raphael BJ, Bafna V, Friedman R, Brenner SE, Godzik A, Eisenberg D, Dixon JE, Taylor SS, Strausberg RL, Frazier M, Venter JC . The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families . PLOS Biology . 5 . 3 . e16 . March 2007 . 17355171 . 1821046 . 10.1371/journal.pbio.0050016 . free .
  25. de Lange O, Wolf C, Thiel P, Krüger J, Kleusch C, Kohlbacher O, Lahaye T . DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats . Nucleic Acids Research . 43 . 20 . 10065–80 . November 2015 . 26481363 . 4787788 . 10.1093/nar/gkv1053 .
  26. Web site: Pfam alignment: PF03377 metagenome (auto-generated match) . 28 May 2019.
  27. Web site: Claudia . Bertonati . Philippe . Duchateau . Alexandre . Juillerat . George . Silva . Julien . Valton . vanc . WO2014018601A2 New modular base-specific nucleic acid binding domains from burkholderia rhizoxinica proteins . Google Patents . 24 July 2013.