The Rossmann fold is a tertiary fold found in proteins that bind nucleotides, such as enzyme cofactors FAD, NAD+, and NADP+. This fold is composed of alternating beta strands and alpha helical segments where the beta strands are hydrogen bonded to each other forming an extended beta sheet and the alpha helices surround both faces of the sheet to produce a three-layered sandwich. The classical Rossmann fold contains six beta strands whereas Rossmann-like folds, sometimes referred to as Rossmannoid folds, contain only five strands. The initial beta-alpha-beta (bab) fold is the most conserved segment of the Rossmann fold.[1] The motif is named after Michael Rossmann who first noticed this structural motif in the enzyme lactate dehydrogenase in 1970 and who later observed that this was a frequently occurring motif in nucleotide binding proteins.[2]
Rossmann and Rossmannoid fold proteins are extremely common. They make up 20% of proteins with known structures in the Protein Data Bank, and are found in more than 38% of KEGG metabolic pathways. The fold is extremely versatile in that it can accommodate a wide range of ligands. They can function as metabolic enzymes, DNA/RNA binding, and regulatory proteins in addition to the traditional role.
The Rossmann fold was first described by Dr. Michael Rossmann and coworkers in 1974. He was the first to deduce the structure of lactate dehydrogenase and characterized the structural motif within this enzyme which would later be called the Rossmann fold. It was subsequently found that most dehydrogenases that utilize NAD or NADP contain this same structurally conserved Rossmann fold motif.[3] [4]
In 1989, Israel Hanukoglu from the Weizmann Institute of Science discovered that the consensus sequence for NADP+ binding site in some enzymes that utilize NADP+ differs from the NAD+ binding motif.[5] This discovery was used to re-engineer coenzyme specificities of enzymes.[6]
The Rossmann fold is composed of six parallel beta strands that form an extended beta sheet. The first three strands are connected by α- helices resulting in a beta-alpha-beta-alpha-beta structure. This pattern is duplicated once to produce an inverted tandem repeat containing six strands. Overall, the strands are arranged in the order of 321456 (1 = N-terminal, 6 = C-terminal).[7] Five stranded Rossmann-like folds are arranged in the order 32145.[8] The overall tertiary structure of the fold resembles a three-layered sandwich wherein the filling is composed of an extended beta sheet and the two slices of bread are formed by the connecting parallel alpha-helices.
One of the features of the Rossmann fold is its co-factor binding specificity. Through the analysis of four NADH-binding enzymes, it was found that in all four enzymes the nucleotide co-factor entailed the same conformation and orientation with respect to the polypeptide chain.[1]
The fold may contain additional strands joined by short helices or coils. The most conserved segment of Rossmann folds is the first beta-alpha-beta segment. Phosphate-binding loop is located between the first beta-strand and alpha-helix. On the tip of the second beta-strand, there is a conserved aspartate residue that is involved in ribose binding.[9] Since this segment is in contact with the ADP portion of dinucleotides such as FAD, NAD and NADP it is also called as an "ADP-binding beta-beta fold.
The function of the Rossmann fold in enzymes is to bind nucleotide cofactors. It also often contributes to substrate binding.
Metabolic enzymes normally have one specific function, and in the case of UDP-glucose 6-dehydrogenase, the primary function is to catalyze the two step NAD(+)-dependent oxidation of UDP-glucose into UDP-glucuronic acid.[10] The N- and C-terminal domains of UgdG share structural features with ancient mitochondrial ribonucleases named MAR. MARs are present in lower eukaryotic microorganisms, have a Rossmannoid-fold and belong to the isochorismatase superfamily. This observation reinforces that the Rossmann structural motifs found in NAD(+)-dependent dehydrogenases can have a dual function working as a nucleotide cofactor binding domain and as a ribonuclease.
The evolutionary relationship between the Rossmann fold and Rossmann-like folds is unclear. These folds are referred to as Rossmannoids. It has been hypothesized that all these folds, including a Rossmann fold originated from a single common ancestral fold, that had nucleotide binding capabilities, in addition to non-specific catalytic activity.
However, an analysis of the PDB finds evidence of convergent evolution[11] with 156 separate H-groups of demonstrable homology, from which 123 X-groups of probable homology can be found. The groups have been integrated into ECOD.[12]
Phylogenetic analysis of the NADP binding enzyme adrenodoxin reductase revealed that from prokaryotes, through metazoa and up to primates the sequence motif difference from that of most FAD and NAD-binding sites is strictly conserved.[13]
In many articles and textbooks, a Rossmann fold is defined as a strict repeated series of βαβ structure. Yet, comprehensive examination of the Rossmann folds in many NAD(P) and FAD binding sites revealed that only the first βα structure is strictly conserved. In some enzymes, there may be many loops and several helices (i.e., not a single helix) between the beta strands that form the beta-sheet. These enzymes have a common origin indicated by conserved sequence and structural features, according to Hanukoglu.
The result by Hanukoglu (2017) is corroborated by Medvedev et al. (2020), in the form of an ECOD "H-group" called "Rossmann-related". Even within this group, ECOD describes a wide range of non-nucleotide activities.[12]