Signal peptide explained

Symbol:	N/A

Opm Family:	256
Opm Protein:	1skh

A signal peptide (sometimes referred to as signal sequence, targeting signal, localization signal, localization sequence, transit peptide, leader sequence or leader peptide) is a short peptide (usually 16-30 amino acids long)^[1] present at the N-terminus (or occasionally nonclassically at the C-terminus^[2] or internally) of most newly synthesized proteins that are destined toward the secretory pathway.^[3] These proteins include those that reside either inside certain organelles (the endoplasmic reticulum, Golgi or endosomes), secreted from the cell, or inserted into most cellular membranes. Although most type I membrane-bound proteins have signal peptides, most type II and multi-spanning membrane-bound proteins are targeted to the secretory pathway by their first transmembrane domain, which biochemically resembles a signal sequence except that it is not cleaved. They are a kind of target peptide.

Function (translocation)

Signal peptides function to prompt a cell to translocate the protein, usually to the cellular membrane. In prokaryotes, signal peptides direct the newly synthesized protein to the SecYEG protein-conducting channel, which is present in the plasma membrane. A homologous system exists in eukaryotes, where the signal peptide directs the newly synthesized protein to the Sec61 channel, which shares structural and sequence homology with SecYEG, but is present in the endoplasmic reticulum.^[4] Both the SecYEG and Sec61 channels are commonly referred to as the translocon, and transit through this channel is known as translocation. While secreted proteins are threaded through the channel, transmembrane domains may diffuse across a lateral gate in the translocon to partition into the surrounding membrane.

Structure

The core of the signal peptide contains a long stretch of hydrophobic amino acids (about 5–16 residues long)^[5] that has a tendency to form a single alpha-helix and is also referred to as the "h-region". In addition, many signal peptides begin with a short positively charged stretch of amino acids, which may help to enforce proper topology of the polypeptide during translocation by what is known as the positive-inside rule.^[6] Because of its close location to the N-terminus it is called the "n-region". At the end of the signal peptide there is typically a stretch of amino acids that is recognized and cleaved by signal peptidase and therefore named cleavage site. This cleavage site is absent from transmembrane-domains that serve as signal peptides, which are sometimes referred to as signal anchor sequences. Signal peptidase may cleave either during or after completion of translocation to generate a free signal peptide and a mature protein. The free signal peptides are then digested by specific proteases.Moreover, different target locations are aimed by different types of signal peptides. For example, the structure of a target peptide aiming for the mitochondrial environment differs in terms of length and shows an alternating pattern of small positively charged and hydrophobic stretches. Nucleus aiming signal peptides can be found at both the N-terminus and the C-terminus of a protein and are in most cases retained in the mature protein.

It is possible to determine the amino acid sequence of the N-terminal signal peptide by Edman degradation, a cyclic procedure that cleaves off the amino acids one at a time.^[7] ^[8]

Co-translational versus post-translational translocation

In both prokaryotes and eukaryotes signal sequences may act co-translationally or post-translationally.

The co-translational pathway is initiated when the signal peptide emerges from the ribosome and is recognized by the signal-recognition particle (SRP).^[9] SRP then halts further translation (translational arrest only occurs in Eukaryotes) and directs the signal sequence-ribosome-mRNA complex to the SRP receptor, which is present on the surface of either the plasma membrane (in prokaryotes) or the ER (in eukaryotes).^[10] Once membrane-targeting is completed, the signal sequence is inserted into the translocon. Ribosomes are then physically docked onto the cytoplasmic face of the translocon and protein synthesis resumes.^[11]

The post-translational pathway is initiated after protein synthesis is completed. In prokaryotes, the signal sequence of post-translational substrates is recognized by the SecB chaperone protein that transfers the protein to the SecA ATPase, which in turn pumps the protein through the translocon. Although post-translational translocation is known to occur in eukaryotes, it is poorly understood. It is known that in yeast post-translational translocation requires the translocon and two additional membrane-bound proteins, Sec62 and Sec63.^[12]

Secretion efficiency determination

Signal peptides are extremely heterogeneous, many prokaryotic and eukaryotic ones are functionally interchangeable within or between species and all determine protein secretion efficiency.^[13] ^[14] ^[15]

Nucleotide level features

In vertebrates, the region of the mRNA that codes for the signal peptide (i.e. the signal sequence coding region, or SSCR) can function as an RNA element with specific activities. SSCRs promote nuclear mRNA export and the proper localization to the surface of the endoplasmic reticulum. In addition SSCRs have specific sequence features: they have low adenine-content, are enriched in certain motifs, and tend to be present in the first exon at a frequency that is higher than expected.^[16] ^[17]

Alternate secretion mechanisms

Proteins without signal peptides can also be secreted by unconventional mechanisms. E.g. Interleukin, Galectin.^[18] The process by which such secretory proteins gain access to the cell exterior is termed unconventional protein secretion (UPS). In plants, even 50% of secreted proteins can be UPS dependent.^[19]

Nonclassical sequences

Signal peptides are usually located at the N-terminus of proteins. Some have C-terminal or internal signal peptides (examples: peroxisomal targeting signal and nuclear localisation signal). The structure of these nonclassical signal peptides differs vastly from the N-terminal signal peptides.

Nomenclature

Signal peptides are not to be confused with the leader peptides sometimes encoded by leader mRNA, although both are sometimes ambiguously referred to as "leader peptides." These other leader peptides are short polypeptides that do not function in protein localization, but instead may regulate transcription or translation of the main protein, and are not part of the final protein sequence. This type of leader peptide primarily refers to a form of gene regulation found in bacteria, although a similar mechanism is used to regulate eukaryotic genes, which is referred to as uORFs (upstream open reading frames).

References

Book: Post-Targeting Functions of Signal Peptides. Kapp. Katja. Schrempf. Sabrina. Lemberg. Marius K.. Dobberstein. Bernhard. 2013-01-01. Landes Bioscience. en.
Owji . Hajar . Nezafat . Navid . Negahdaripour . Manica . Hajiebrahimi . Ali . Ghasemi . Younes . A comprehensive review of signal peptides: Structure, roles, and applications . European Journal of Cell Biology . August 2018 . 97 . 6 . 422–441 . 10.1016/j.ejcb.2018.06.003. 29958716 . 49612506 .
Blobel G, Dobberstein B . Transfer of proteins across membranes. I. Presence of proteolytically processed and unprocessed nascent immunoglobulin light chains on membrane-bound ribosomes of murine myeloma . The Journal of Cell Biology . 67 . 3 . 835–51 . December 1975 . 811671 . 2111658 . 10.1083/jcb.67.3.835 .
Rapoport TA . Protein translocation across the eukaryotic endoplasmic reticulum and bacterial plasma membranes . Nature . 450 . 7170 . 663–9 . November 2007 . 18046402 . 10.1038/nature06384 . 2007Natur.450..663R . 2497138 .
Käll L, Krogh A, Sonnhammer EL . A combined transmembrane topology and signal peptide prediction method . Journal of Molecular Biology . 338 . 5 . 1027–36 . May 2004 . 15111065 . 10.1016/j.jmb.2004.03.016 .
von Heijne G, Gavel Y . Topogenic signals in integral membrane proteins . European Journal of Biochemistry . 174 . 4 . 671–8 . July 1988 . 3134198 . 10.1111/j.1432-1033.1988.tb14150.x . free . Gunnar von Heijne .
News: 26.6 Peptide Sequencing: The Edman Degradation. 2015-08-26. Chemistry LibreTexts. 2018-09-27. en-US.
Web site: N-terminal sequencing service - Edman degradation. www.alphalyse.com. en-US. 2018-09-27.
Walter P, Ibrahimi I, Blobel G . Translocation of proteins across the endoplasmic reticulum. I. Signal recognition protein (SRP) binds to in-vitro-assembled polysomes synthesizing secretory protein . The Journal of Cell Biology . 91 . 2 Pt 1 . 545–50 . November 1981 . 7309795 . 2111968 . 10.1083/jcb.91.2.545 .
Gilmore R, Blobel G, Walter P . Protein translocation across the endoplasmic reticulum. I. Detection in the microsomal membrane of a receptor for the signal recognition particle . The Journal of Cell Biology . 95 . 2 Pt 1 . 463–9 . November 1982 . 6292235 . 2112970 . 10.1083/jcb.95.2.463 .
Görlich D, Prehn S, Hartmann E, Kalies KU, Rapoport TA . A mammalian homolog of SEC61p and SECYp is associated with ribosomes and nascent polypeptides during translocation . Cell . 71 . 3 . 489–503 . October 1992 . 1423609 . 10.1016/0092-8674(92)90517-G . 19078317 .
Panzner S, Dreier L, Hartmann E, Kostka S, Rapoport TA . Posttranslational protein transport in yeast reconstituted with a purified complex of Sec proteins and Kar2p . Cell . 81 . 4 . 561–70 . May 1995 . 7758110 . 10.1016/0092-8674(95)90077-2 . 14398668 . free .
Kober L, Zehe C, Bode J . Optimized signal peptides for the development of high expressing CHO cell lines . Biotechnology and Bioengineering . 110 . 4 . 1164–73 . April 2013 . 23124363 . 10.1002/bit.24776 . 449870 .
von Heijne G . Signal sequences. The limits of variation . Journal of Molecular Biology . 184 . 1 . 99–105 . July 1985 . 4032478 . 10.1016/0022-2836(85)90046-4 . Gunnar von Heijne .
Molino JV, de Carvalho JC, Mayfield SP . Comparison of secretory signal peptides for heterologous protein expression in microalgae: Expanding the secretion portfolio for Chlamydomonas reinhardtii . PLOS ONE . 13 . 2 . e0192433 . 2018-02-06 . 29408937 . 5800701 . 10.1371/journal.pone.0192433 . 2018PLoSO..1392433M . free .
Palazzo AF, Springer M, Shibata Y, Lee CS, Dias AP, Rapoport TA . The signal sequence coding region promotes nuclear export of mRNA . PLOS Biology . 5 . 12 . e322 . December 2007 . 18052610 . 2100149 . 10.1371/journal.pbio.0050322 . free .
Cenik C, Chua HN, Zhang H, Tarnawsky SP, Akef A, Derti A, Tasan M, Moore MJ, Palazzo AF, Roth FP . 6 . Genome analysis reveals interplay between 5'UTR introns and nuclear mRNA export for secretory and mitochondrial genes . PLOS Genetics . 7 . 4 . e1001366 . April 2011 . 21533221 . 3077370 . 10.1371/journal.pgen.1001366 . Snyder . Michael . free .
Nickel W, Seedorf M . Unconventional mechanisms of protein transport to the cell surface of eukaryotic cells . Annual Review of Cell and Developmental Biology . 24 . 287–308 . 2008 . 18590485 . 10.1146/annurev.cellbio.24.110707.175320 .
Agrawal GK, Jwa NS, Lebrun MH, Job D, Rakwal R . Plant secretome: unlocking secrets of the secreted proteins . Proteomics . 10 . 4 . 799–827 . February 2010 . 19953550 . 10.1002/pmic.200900514 . 20647387 .

External links

- SPdb (Signal Peptide DataBase)
SignalP — predicts the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms.