A comparative study of ribosomal proteins: linkage between amino acid distribution and ribosomal assembly
© Lott et al.; licensee BioMed Central Ltd. 2013
Received: 22 March 2013
Accepted: 17 October 2013
Published: 23 October 2013
Assembly of the ribosome from its protein and RNA constituents must occur quickly and efficiently in order to synthesize the proteins necessary for all cellular activity. Since the early 1960’s, certain characteristics of possible assembly pathways have been elucidated, yet the mechanisms that govern the precise recognition events remain unclear.
We utilize a comparative analysis to investigate the amino acid composition of ribosomal proteins (r-proteins) with respect to their role in the assembly process. We compared small subunit (30S) r-protein sequences to those of other housekeeping proteins from 560 bacterial species and searched for correlations between r-protein amino acid content and factors such as assembly binding order, environmental growth temperature, protein size, and contact with ribosomal RNA (rRNA) in the 30S complex.
We find r-proteins have a significantly high percent of positive residues, which are highly represented at rRNA contact sites. An inverse correlation between the percent of positive residues and r-protein size was identified and is mainly due to the content of Lysine residues, rather than Arginine. Nearly all r-proteins carry a net positive charge, but no statistical correlation between the net charge and the binding order was detected. Thermophilic (high-temperature) r-proteins contain increased Arginine, Isoleucine, and Tyrosine, and decreased Serine and Threonine compared to mesophilic (lower-temperature), reflecting a known distinction between thermophiles and mesophiles, possibly to account for protein thermostability. However, this difference in amino acid content does not extend to rRNA contact sites, as the proportions of thermophilic and mesophilic contact residues are not significantly different.
Given the significantly higher level of positively charged residues in r-proteins and at contact sites, we conclude that ribosome assembly relies heavily on an electrostatic component of interaction. However, the binding order of r-proteins in assembly does not appear to depend on these electrostatics interactions. Additionally, because thermophiles and mesophiles exhibit significantly different amino acid compositions in their sequences but not in the identities of contact sites, we conclude that this electrostatic component of interaction is insensitive to temperature and is not the determining factor differentiating the temperature sensitivity of ribosome assembly.
Ribosomes are the transient macromolecular machines that synthesize proteins in all living organisms. They are composed of ribosomal RNA (rRNA) and ribosomal proteins (r-proteins), which self-assemble into functional units. The bacterial ribosome is made of two asymmetrical subunits: the larger 50S and the smaller 30S. This study focuses on the assembly of the 30S subunit. The efficient and accurate self-assembly of the ribosome in vivo is essential because new ribosomes and proteins must be produced in order for cells to grow. It is estimated that approximately 60% of all cellular transcriptional activities have been attributed to the synthesis of rRNA in a rapidly growing cell  and 40% of the total energy of an Escherichia Coli cell is directed toward the synthesis of proteins . Assembly has been studied extensively, both computationally and experimentally, and is known to require the orchestration of both rRNA folding and r-protein binding. Previous investigations provide evidence of an ordered, cooperative protein binding/RNA folding assembly mechanism [3–5], conserved structures and sequences [6–11], and the employment of electrostatics interactions [12–14]. A detailed assembly map describing the sequential and interdependent binding of r-proteins  classified r-proteins as primary, secondary, and tertiary binders, depending on their ability to bind to 16S rRNA: primary proteins bind to bare rRNA, secondary proteins can bind to 16S rRNA after at least one primary protein has already bound, and tertiary proteins require at least one primary and one secondary protein . Additionally, r-proteins were named S1, S2, S3, etc., in the general order of decreasing size; that is, S1 is the largest ribosomal protein and S21 the smallest [16, 17].
Because r-proteins strongly interact with negatively charged rRNA to form a functional complex, one might expect that r-proteins exhibit characteristic amino acid composition and distribution within the protein structures that reflect their electrostatic interactions. For instance, it is known that r-proteins generally carry net positive charges [13, 14], and we previously analyzed the crystal structures of two bacterial ribosomes and found that most E. coli and Thermus thermophilus r-proteins not only carry net positive charges, but their percentages of positively charged residues are actually above the average expected for a typical protein . We also demonstrated that these positively charged residues tend to be concentrated in areas of the protein that are in contact with rRNA. These observations are consistent with the hypothesis that positively charged residues facilitate and stabilize r-protein binding to the negatively charged rRNA. Because these studies encompassed such a small portion of the bacterial kingdom, the investigation of r-proteins from a large number of species is needed to more definitively describe the nature of this trend. To date, however, large-scale analyses comparing the ribosomal components from many species have focused on the use of rRNA, r-proteins, or ribosomal DNA to determine species relatedness or construct phylogenetic trees [18–21] rather than attempting to shed light on the universal mechanisms of ribosome assembly.
Temperature has profound effects on the rates of biological reactions and the structures of molecules, including proteins. Because the structure and function of a protein are ultimately controlled by its makeup of amino acids, one would expect proteins from thermophilic species to have different amino acid composition from those of mesophilic species. In accordance, several large-scale thermostability studies have detected differences in protein residues, such as thermophiles exhibiting an increased occurrence of charged residues, decreased incidence of polar and uncharged residues, a reduction in hydrophobic surface of the protein, larger numbers of hydrogen bonds, ion pairs, and disulfide bridges or hydrophobic and aromatic interactions, an increased protein compactness, and changes in surface charge distribution and helix dipole stabilization [22–30]. While the majority of these previous protein thermostability analyses have focused primarily on non-ribosomal protein samples, one  mentioned that the trends were not significantly changed when r-proteins were excluded from analysis. Some studies have focused on ribosomal components in light of thermal adaptation, identifying a positive correlation between the guanine and cytosine content in rRNA genes and the species growth temperature , and demonstrating that the binding affinity of r-protein S8 with its rRNA binding site increases with growth temperature among related bacterial species . Additionally, it has been shown [33, 34] that subunits from a thermophilic Archaea can form functionally active hybrids with eukaryotic yeast subunits (i.e. the small subunit from one species and the large from another), whereas no such particles formed between the subunits from a mesophile and yeast, suggesting that there is at least some structural similarity between ribosomes from thermophilic bacteria and eukaryotic species. One study  compared the stability of the entire ribosome structure in mesophiles and thermophiles, showing that thermophilic ribosomes are generally nonfunctional at low temperatures and hypothesizing that thermophilic ribosomes might be prohibitively rigid at low temperatures in order to be functionally flexible at their optimal growth temperatures. This is in agreement with a report from “melting” and unfolding studies, indicating thermophilic ribosomes are more “durable” than those isolated from mesophiles . Similarly, it has been shown that the individual components of a thermophilic ribosome are less stable than the completely assembled ribosome . In our previous study , we observed that r-proteins of the thermophilic T. thermophilus generally have higher net positive charges than those of mesophilic E. coli, possibly implicating differing roles of certain amino acids in the structure or function of thermophilic and mesophilic r-proteins. While these thermostability studies have enriched the current understanding of ribosome structures and temperature-sensitive characteristics in a variety of species, details regarding the contributions of individual amino acids to the ribosome’s accurate self-assembly mechanisms and the factors that differentiate species’ ability to create thermostable complexes within certain temperature ranges remain uncertain.
In the current study, we extend our previous work to include 560 different bacterial species (listed in Additional file 1) to test whether the reported trends hold for prokaryotes in general. For this purpose, we employ a comparative approach where association is tested between the average occurrence of each amino acid and the members of two categories of house-keeping bacterial proteins: ribosomal proteins and non-ribosomal proteins. Additionally, we compare r-protein sequences from mesophilic and thermophilic species to examine how amino acid composition and distribution might affect ribosome assembly at differing environmental temperatures.
Results and discussion
R-proteins contain higher levels of positively charged residues than other soluble protein families
Figure 1B shows the magnitude and direction of the significant differences in the amino acid distributions for the two samples of proteins, represented by their t-test values. The height of the bar represents the relative difference in the sample means and its direction indicates which protein sample contains the larger proportion of that residue. Positive T-test values indicate a higher proportion of that residue was found in the non-ribosomal sample, whereas negative values correspond to a higher percentage in r-proteins. It is well documented that ribosomal proteins contain high levels of these positively charged residues, and the marked difference shown here clearly implicates an important electrostatics feature of r-proteins in contrast to proteins whose functions do not rely heavily on charge-charge interactions [12–14]. This result solidifies our earlier observation that ribosomal proteins have higher proportions of positively charged residues and that the assembly between ribosomal proteins and rRNA includes an important electrostatic component, a notion that has also been suggested by other studies . It is evident that these amino acids play an important role in the assembly process, attracting positively charged r-proteins to negatively charged rRNA across possibly long distances to initiate the assembly process. While this line of reasoning is not novel, the overwhelming significance of positively charged residue content indicates our amino acid composition database imparts a rational view of r-protein make-up, and provides the foundation for the rest of the current study. This observation prompted further investigation into the large database of r-protein sequences, particularly with regard to the roles of these amino acids in the electrostatics component of ribosome assembly.
Because increased temperature is known to denature and destabilize biological molecules, yet thermophilic bacteria synthesize and assemble ribosome components that maintain functionality at consistently high environmental temperatures [36, 37], we analyzed r-protein amino acid composition to test whether the amino acid make-up plays a role in the thermostability of the r-proteins. To this end, we utilized a comparative approach where association was tested between the growth temperature preferences of a large number of thermophilic and mesophilic bacterial species and the proportion of each amino acid in the r-protein sequences, specifically focusing on amino acid compositional differences associated with thermophilicity. We obtained three types of information for the 560 species in our database: growth temperature preference data, 30S ribosomal protein sequences from at least one r-protein, and 16S ribosomal DNA sequences (to determine species relatedness). The vast majority consisted of mesophiles and only 40 were identified as thermophiles. Phylogenetic analysis of these species indicated that thermophiles are not evenly distributed in the bacterial phylogenetic tree: they tended to cluster in several branches, especially in the orders Aquificales, Thermoanaerobacterales, and Thermotogales (Additional file 1).
The phylogenetic clustering of thermophiles in our sample necessitated us to employ a method to control for the phylogenetic dependence and avoid bias when assessing the association between growth temperature preference and ribosomal amino acid composition. Because closely related samples are expected to show similar traits such as amino acid composition and growth temperature preference, a significant association can simply be a result of phylogenetic relatedness rather than adaptation to similar environmental conditions. To circumvent this problem, we applied Phylogenetic Independent Contrast (PIC [39, 40]), which assesses the statistical significance of correlations between variables while controlling for the phylogenetic relatedness among samples. In this way, a significant correlation implies that the differences in amino acid composition between thermophiles and mesophiles are due to adaptation to different temperature environments and not due to mere species relatedness. It should be noted, however, that PIC is conservative, because it fails to detect significant adaptive changes that accompany significant phylogenetic dependence.
Positively charged residues correlate with protein size but not binding order
To determine whether increased temperature affects the relative proportions of amino acids in bacterial r-proteins regarding binding order, we analyzed the amino acid compositions according to species optimal growth temperature (see Methods). For positive residues (Figure 3A), all r-proteins except S11 showed higher mean percent residues in thermophiles than mesophiles, whereas for polar residues (Panel A in Additional file 5), all thermophilic proteins showed lower mean percent residues than their mesophilic counterparts. This suggests that the preference of positive residues at the expense of polar residues among thermophiles applies nearly universally to all r-proteins of the 30S subunit, as has also been evidenced in other protein families . However, only some r-proteins, including all primary binding proteins, tended to show statistically significant differences between the two temperature-based groups for positive residues (Figure 3A). Few proteins showed statistical differences for other categories, according to no discernible pattern (Additional file 5; see Additional file 6 for summaries of statistical test results). These trends suggest that thermophiles tend to prefer positive residues and avoid polar residues across all r-proteins, and this trend is somewhat pronounced for primary binding proteins. Average net charges of individual r-proteins of thermophilic species are higher than mesophilic, except for S2, but only three proteins (S14, S17, and S20) show differences that are statistically significant according to PIC analysis (Figure 3B; see Additional file 7 for statistics).
R-protein RNA contact sites are enriched with positively charged residues
where L is the length of the protein (total number of residues), C is the estimated number of residues in contact with RNA in the fully assembled 30S subunit (see Methods), Rt is the total number of residues of a specific type (e.g. Alanine (A) or Serine (S)), and Rc is the number of contact residues of said type. CEF is closely related to the proportions of contact residues already reported (the numerator, Rc/C, is the proportion of each residue as a contact, as described above), but CEF is not a redundant calculation, as it gives a broader measure of the role each amino acid plays in r-proteins. By comparing the fraction of a particular amino acid as a contact residue to its proportion in the total protein, CEF describes the distribution of each amino acid throughout the protein, revealing how often each residue is used as a contact site as a function of how often it occurs in the protein. Thus, a CEF value of 1 indicates that the residue under investigation appears at contact sites in the same percentage as it appears in the overall sequence, whereas CEF>1 implies that the residue has a high occurrence at the RNA contact interface for the proportion of that residue in the full protein.
We calculated CEF values of the r-proteins in all 560 species (Figure 5B). One-sample t-tests revealed that CEF values significantly deviated from one (two-tailed p < 0.01) for all the amino acids except for glycine, indicating that the distribution of amino acids in r-proteins is influenced by the interaction with rRNA. The results revealed that the mean contact enrichment factors were greater than 1 for positively charged residues and polar residues excluding Cysteine (C). CEF values were less than 1 for negatively charged and non-polar residues. These observations indicate that contact sites are generally enriched with positive and polar residues, which can form charge-charge or hydrogen bonding interactions, but are deficient of negative and non-polar residues, which might produce energetically unfavorable interactions with the rRNA. Contact enrichment factors for aromatic residues, which could participate in base-stacking with the rRNA nucleotides, were split: Phenylalanine (F) CEF was less than 1, whereas CEF for Tryptophan (W) and Tyrosine (Y) were greater than 1. It is worth noting that W and Y are both capable of hydrogen bonding, which could explain their preference at contact sites, but F is completely hydrophobic and is often found buried inside water-soluble proteins.
For the five amino acid chemical categories, the CEF for positively charged residues is the greatest, followed by polar residues, and those for negatively charged and nonpolar are lowest. This demonstrates that protein residues that contact rRNA tend to (1) carry a formal positive charge or contain a polar side chain and (2) avoid negatively charged or nonpolar residues. Therefore, not only do r-proteins contain a higher level of positively charged residues than non-ribosomal proteins, these residues are concentrated at rRNA contact sites. These general patterns reflect the role of positively charged regions of r-proteins in associating with the negatively charged rRNA during ribosomal assembly.
To test whether r-protein-rRNA interaction is different between mesophiles and thermophiles due to their differing overall amino acid compositions (as seen in Figure 2), we compared the CEF values between the two groups (Figure 5B). PIC indicated that most of those differences are not statistically significant (p > 0.01, Student’s t-test and sign test) except for Glutamic Acid (Glu, E; Figure 5B; see Additional file 11 for CEF statistical tests), which occurs at contact sites in one of the lowest proportions for both mesophiles and thermophiles (mean CEF = 0.43 and 0.37 for mesophiles and thermophiles, respectively), but is nonetheless statistically more common at mesophilic contact sites than thermophilic. Glu is not found in significantly different amounts in the overall composition of mesophilic and thermophilic r-proteins, and further investigation into Glu’s roles in the assembly process or thermostability in general might better explain this observation. The combination of significant thermostability-related differences in amino acid compositions (increased R, I, Y and decreased S, T for thermophiles) with no significant difference in the distribution of those amino acids at r-protein contact sites supports the understanding that the electrostatics component of ribosome assembly is not dependent on temperature, because the identity of thermophilic contact sites is statistically no different than that of mesophilic sites. This seems reasonable because other molecular interactions such as hydrogen bonding and hydrophobic interactions are sensitive to temperature, but the electrostatic interaction itself is independent of temperature, which likely explains why we observed similar amino acid residue distributions at the r-protein contact sites in mesophiles and thermophiles.
Utilizing a comparative approach to analyze a large database of r-protein sequences has identified a number of important associations between the amino acid composition of r-proteins and their function in ribosomal assembly. We found that r-proteins have a significantly higher content of positively charged residues than do non-ribosomal proteins (10% for Arginine and 11% for Lysine in r-proteins, versus 4.7% and 5.9%, respectively, in non-ribosomal proteins), which agrees with previous analyses of r-protein charges. More specifically, these two residues are also highly represented at contact sites along the protein/RNA interface (contact enrichment factor (CEF) > 1) for all species in the study, alluding to the significance of electrostatic interaction in ribosome assembly. These results agree with and improve our previous r-protein study by statistically extending the same trends across a large sample of bacteria. Interestingly, we found that the percentage of Lysine residues generally increases with decreasing r-protein size, but the same correlation is not found with Arginine, despite its similar positively charged side chain. Taken together, these results corroborate the heavy emphasis on electrostatic interactions in the assembly mechanism of the ribosome. However, association between r-protein binding order (primary, secondary, and tertiary) was not detected for the proportion of positively charged residues (or Lys or Arg alone) or for net protein charge. This leads to the conclusion that the order in which r-proteins bind to their binding sites during assembly is probably not determined by the electrostatics interactions between r-proteins and rRNA. Although the assembly between r-proteins with rRNA involves an overwhelmingly significant portion of electrostatic interaction, this interaction alone does not govern the assembly order.
The thermostability aspect of the study, performed by comparing amino acid compositions and distributions between species with high and low preferred growth temperature, revealed two noteworthy characteristics of 30S ribosomal proteins. First, we found that thermophiles show increased R, I, and Y content, whereas mesophiles have increased proportions of S and T, trends that are generally consistent with previously reported distinctions between thermophilic and mesophilic amino acid compositions . Second, while these differences in overall make-up are significant, they do not extend to the predicted contact sites in thermophilic and mesophilic r-proteins. That is, the proportions of residues at contact sites are generally not significantly different between the two groups. Whereas the percent compositions of amino acids relating to qualities such as thermostability and protein folding are expected to vary with environmental temperature, our results indicate that the distributions of residues in contact with rRNA are comparable for all bacterial species. If the regions of r-proteins that contact rRNA in the fully assembled ribosome are considered “active sites” for the assembly process, it follows that they should be as highly conserved as the ribosome and its function themselves. In accordance, from the results of the current study, we conclude that the electrostatics component of ribosome assembly, while it is not the only interaction involved during assembly, is an important attraction between r-proteins and rRNA, but this component of interaction is insensitive to the temperature. The latter conclusion is reasonable because the electrostatics interaction itself does not depend on temperature.
Therefore, we conclude from our statistical analysis: binding order does not appear to depend on the amount of electrostatic attraction experienced by primary binders versus secondary or tertiary binders, and the electrostatics interactions of ribosome assembly do not seem to control the discrepancy between mesophilic temperature-sensitive and thermophilic high-temperature-stable constructs. The particular molecular factors that govern the timing and order of r-proteins binding with rRNA and that contribute to the temperature sensitivity of ribosomes assembled in species that live at different temperatures remain to be determined.
The study required three pieces of information for each of 560 bacterial species: growth temperature preference (mesophilic or thermophilic), amino acid composition data based on amino acid sequences of 30S ribosomal proteins, and 16S ribosomal DNA sequences for the phylogenetic tree construction required for PIC. We only included species with all three pieces of information publicly available. Estimates of the growth temperature preference of studied species were searched based on the species name and obtained from various sources in the public domain. Initially, species were categorized into four growth temperature preference types; cryophiles (e.g., high latitude, altitude habitats, ocean floor, < 10°C), lower mesophiles (ambient conditions, 10-35°C), upper mesophiles (e.g., mammalian body, 35-50°C), and thermophiles (e.g., deep see thermal vents, hot springs, >50°C). Examination of the distribution of amino acid composition based on these four categories indicated that the distributions of the first three categories were often similar to each other but markedly different from that of thermophiles, particularly for positive and polar residues. Therefore, we combined species in the first three categories and conducted subsequent analyses using only two categories; mesophiles (<50°C) and thermophiles (>50°C).
30S ribosomal protein sequences
Amino acid sequences for the S2-S21 30S ribosomal protein were queried and downloaded from Genbank (http://www.ncbi.nlm.nih.gov/) using the search term “30S ribosomal protein”. Protein S1 was excluded from the analysis, as in many other 30S ribosomal protein studies, because it binds relatively weakly to the 30S complex and exchanges very rapidly during protein assembly . The queried sequences were aligned using the T-coffee multiple alignment program  (http://www.tcoffee.org/Projects_home_page/t_coffee_home_page.html) using default alignment settings. We filtered out potentially spurious sequences that 1) were unusually short or long and 2) had unusually low T-coffee alignment scores, which might indicate poor sequence quality or incorrect genes. When multiple sequences from the same species were available, we chose the one with the highest alignment score. Gaps and missing sequences were ignored in the subsequent analyses.
Non-ribosomal protein sequences
To compare the amino acid proportions of ribosomal and non-ribosomal proteins, we analyzed protein sequences of 15 house-keeping protein families that are functionally well-defined and distinct from each other: adenylate kinase, carbamoyltransferase, carboxypeptidase, citrate synthase, ferredoxin, glutamate dehydrogenase, glycosyltransferase, inorganic pyrophosphatase, methionine aminopeptidase, phosphofructokinase, phosphoglycerate kinase, reductase, rubredoxin, triose phosphate isomerase, xylanase. Their sequences were queried and downloaded from Genbank by using each protein name along with the name of each of the 560 species used for the ribosomal proteins analyses as search terms. (See Additional file 12 for the number of species for and Additional file 13 for a description of each protein family used in this study.) The first sequence returned in each search was used for the analyses. When no sequence was available for a given species, the species was omitted from the analysis for that protein. Student’s paired sample t-test was performed to test the equality of the amino acid distributions between ribosomal and each non-ribosomal protein.
Determination of ribosomal protein-RNA contact sites and protein net charge
The r-protein/rRNA contact sites were obtained from the E. coli [PDB: 2AVY] and T. thermophilus [PDB: 1J5E] 30S x-ray crystal structures, accessed from the Protein Data Bank . Using a code written in our own group as described in our previous r-protein study , any atom on a protein residue within 3.5Å of any atom on a 16S rRNA nucleotide is considered a contact point. A contact residue is a protein residue that makes at least one contact point with any RNA nucleotide. The identity and position of these contact residues found in the assembled 30S subunit were recorded and used for further analysis. Because the rRNA contact sites of E. coli and T. thermophilus are not always conserved, we designated rRNA contact sites of all the studied species based on the shared contact sites between these two reference species. These contact sites, therefore, should be considered conservative. Protein net charge was calculated according to the formula [(K + R) – (D + E)], where (K+R) represents the number of Lysine and Arginine residues (positively charged) and (D+E) represents the number of Aspartic Acid and Glutamic acid residues (negatively charged). All other residues are considered neutrally charged at physiological pH.
16S rDNA sequences and phylogenetic tree construction
To construct a phylogenetic tree required for PIC, we queried bacterial 16S rDNA sequences based on the species name from Greengenes database (greengenes.lbl.gov), which curates and aligns publicly available prokaryotic 16S ribosomal RNA gene sequences. Based on the sequence alignments from Greengenes, we constructed a majority-rule consensus phylogenetic tree of the studied species using MrBayes  (http://mrbayes.sourceforge.net), which uses Markov Chain Monte Carlo (MCMC) methods to estimate Bayesian inference of evolutionary relationships. We used Modeltest  to search for a nucleotide substitution model that fit our dataset and selected GTR+G (General Time Reversible with gamma-shaped rate variation among sites) with a flat Dirichlet prior probability density, evaluated based on Akaike Information Criterion (AIC).
Phylogenetic Independent Contrast (PIC)
To assess the association between growth temperature preference of bacterial species and their amino acid composition using PIC, we used the AOT module of Phylocom  (http://www.phylodiversity.net/phylocom/), incorporating the branch lengths in the Bayesian tree. Each protein contained an overlapping but different set of species sequences from other proteins. Therefore, when proteins are analyzed separately for PIC, the original phylogenetic tree was pruned using the ‘sampleprune’ module of Phylocom to filter out missing species. When a binary trait is involved in a PIC analysis (as for growth temperature preference in this study, i.e., mesophile or thermophile), AOT identifies independently contrasting tree nodes based on a combination of both the sister-taxa (ST) set and the paraphyletic (PT) set, and calculates trait correlations using these independent contrasts. Significance of independent contrasts was tested using two separate tests; t-test and sign test. In t-test, the mean and standard deviation of the contrasts were used to conduct a one-sample t-test with degree of freedom of N (number of contrasts) - 1 against the null hypothesis of mean = 0. In sign test, binomial probabilities were calculated for the number of contrasts toward one direction against the total number of contrasts.
Student’s paired-sample and one-sample t-tests, Pearson’s product–moment and Spearman’s rank correlations, Pearson’s χ2 tests, and descriptive statistics including box plots were calculated using PASW Statistics18 (IBM, New York, NY) and R (http://www.r-project.org/). Two-tailed Fisher’s exact tests were conducted using the Fisher’s exact test Excel Addin (http://www.obertfamily.com/software/fisherexact.html). Effect size of Fisher’s exact tests was estimated using the ϕ2 coefficient (ϕ2 = √ (χ2/N), where N is the number of samples).
Availability of supporting data
The data sets supporting the results of this article are included within the article and its additional files.
The authors acknowledge partial financial support provided by Oak Ridge Associated University in partner with Oak Ridge National Lab through ORAU/ORNL high performance computing grant (project BIP011) and by National Science Foundation through TN-SCORE project (grant EPS-1004083).
- Warner JR: The economics of ribosome biosynthesis in yeast. Trends Biochem Sci. 1999, 24: 437-440. 10.1016/S0968-0004(99)01460-7.View ArticleGoogle Scholar
- Wilson DN, Nierhaus KH: The weird and wonderful world of bacterial ribosome regulation. Crit Rev Biochem Mol Biol. 2007, 42: 187-219. 10.1080/10409230701360843.View ArticleGoogle Scholar
- Traub P, Nomura M: Structure and Function of Escherichia Coli Ribosomes .6. Mechanism of Assembly of 30-S Ribosomes Studied in Vitro. J Mol Biol. 1969, 40: 391-10.1016/0022-2836(69)90161-2.View ArticleGoogle Scholar
- Held WA, Ballou B, Mizushima S, Nomura M: Assembly mapping of 30 S ribosomal proteins from Escherichia coli. Further studies. Biol Chem. 1974, 249: 3103-3111.Google Scholar
- Powers T, Daubresse G, Noller HF: Dynamics of in vitro assembly of 16 S rRNA into 30 S ribosomal subunits. J Mol Biol. 1993, 232: 362-374. 10.1006/jmbi.1993.1396.View ArticleGoogle Scholar
- Mougel M, Allmang C, Eyermann F, Cachia C, Ehresmann B, Ehresmann C: Minimal 16S rRNA binding site and role of conserved nucleotides in Escherichia coli ribosomal protein S8 recognition. Eur J Biochem. 1993, 215: 787-792. 10.1111/j.1432-1033.1993.tb18093.x.View ArticleGoogle Scholar
- Talkington MW, Siuzdak G, Williamson JR: An assembly landscape for the 30S ribosomal subunit. Nature. 2005, 438: 628-632. 10.1038/nature04261.View ArticleGoogle Scholar
- Nevskaya N, Tishchenko S, Nikulin A, al-Karadaghi S, Liljas A, Ehresmann B, Ehresmann C, Garber M, Nikonov S: Crystal structure of ribosomal protein S8 from Thermus thermophilus reveals a high degree of structural conservation of a specific RNA binding site. J Mol Biol. 1998, 279: 233-244. 10.1006/jmbi.1998.1758.View ArticleGoogle Scholar
- Serganov A, Benard L, Portier C, Ennifar E, Garber M, Ehresmann B, Ehresmann C: Role of conserved nucleotides in building the 16 S rRNA binding site for ribosomal protein S15. J Mol Biol. 2001, 305: 785-803. 10.1006/jmbi.2000.4354.View ArticleGoogle Scholar
- Pagel FT, Zhao SQ, Hijazi KA, Murgola EJ: Phenotypic heterogeneity of mutational changes at a conserved nucleotide in 16 S ribosomal RNA. J Mol Biol. 1997, 267: 1113-1123. 10.1006/jmbi.1997.0943.View ArticleGoogle Scholar
- Lee EH, Hsin J, Mayans O, Schulten K: Secondary and tertiary structure elasticity of titin Z1Z2 and a titin chain model. Biophys J. 2007, 93: 1719-1735. 10.1529/biophysj.107.105528.View ArticleGoogle Scholar
- Burton B, Zimmermann MT, Jernigan RL, Wang Y: A computational investigation on the connection between dynamics properties of ribosomal proteins and ribosome assembly. Plos Comput Biol. 2012, 8: e1002530-10.1371/journal.pcbi.1002530.View ArticleGoogle Scholar
- Baker NA, Sept D, Joseph S, Holst MJ, McCammon JA: Electrostatics of nanosystems: application to microtubules and the ribosome. Proc Natl Acad Sci USA. 2001, 98: 10037-10041. 10.1073/pnas.181342398.View ArticleGoogle Scholar
- Klein DJ, Moore PB, Steitz TA: The roles of ribosomal proteins in the structure assembly, and evolution of the large ribosomal subunit. J Mol Biol. 2004, 340: 141-177. 10.1016/j.jmb.2004.03.076.View ArticleGoogle Scholar
- Culver GM: Assembly of the 30S ribosomal subunit. Biopolymers. 2003, 68: 234-249. 10.1002/bip.10221.View ArticleGoogle Scholar
- Kurland CG: Ribosome structure and function emergent. Science. 1970, 169: 1171-1177. 10.1126/science.169.3951.1171.View ArticleGoogle Scholar
- Arnold RJ, Reilly JP: Observation of Escherichia coli ribosomal proteins and their posttranslational modifications by mass spectrometry. Anal Biochem. 1999, 269: 105-112. 10.1006/abio.1998.3077.View ArticleGoogle Scholar
- Tehler A, Little DP, Farris JS: The full-length phylogenetic tree from 1551 ribosomal sequences of chitinous fungi, Fungi. Mycol Res. 2003, 107: 901-916. 10.1017/S0953756203008128.View ArticleGoogle Scholar
- Hillis DM, Dixon MT: Ribosomal DNA: molecular evolution and phylogenetic inference. Q Rev Biol. 1991, 66: 411-453. 10.1086/417338.View ArticleGoogle Scholar
- Lu Z, Zhang W: Comparative phylogenies of ribosomal proteins and the 16S rRNA gene at higher ranks of the class Actinobacteria. Curr Microbiol. 2012, 65: 1-6.View ArticleGoogle Scholar
- Schnabel G, Schnabel EL, Jones AL: Characterization of Ribosomal DNA from Venturia inaequalis and Its Phylogenetic Relationship to rDNA from Other Tree-Fruit Venturia Species. Phytopathology. 1999, 89: 100-108. 10.1094/PHYTO.1918.104.22.168.View ArticleGoogle Scholar
- Hickey D, Singer G: Genomic and proteomic adaptations to growth at high temperature. Genome Biol. 2004, 5: 117-10.1186/gb-2004-5-10-117.View ArticleGoogle Scholar
- Zhou XX, Wang YB, Pan YJ, Li WF: Differences in amino acids composition and coupling patterns between mesophilic and thermophilic proteins. Amino Acids. 2008, 34: 25-33. 10.1007/s00726-007-0589-x.View ArticleGoogle Scholar
- Singer GAC, Hickey DA: Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content. Gene. 2003, 317: 39-47.View ArticleGoogle Scholar
- Szilagyi A, Zavodszky P: Structural differences between mesophilic, moderately thermophilic and extremely thermophilic protein subunits: results of a comprehensive survey. Structure. 2000, 8: 493-504. 10.1016/S0969-2126(00)00133-7.View ArticleGoogle Scholar
- Farias ST, Bonato MC: Preferred amino acids and thermostability. Genetics and molecular research : GMR. 2003, 2: 383-393.Google Scholar
- Jaenicke R: Do ultrastable proteins from hyperthermophiles have high or low conformational rigidity?. P Natl Acad Sci USA. 2000, 97: 2962-2964. 10.1073/pnas.97.7.2962.View ArticleGoogle Scholar
- Zierenberg RA, Adams MWW, Arp AJ: Life in extreme environments: Hydrothermal vents. P Natl Acad Sci USA. 2000, 97: 12961-12962. 10.1073/pnas.210395997.View ArticleGoogle Scholar
- Kumar S, Nussinov R: How do thermophilic proteins deal with heat?. Cell Mol Life Sci. 2001, 58: 1216-1233. 10.1007/PL00000935.View ArticleGoogle Scholar
- Haney PJ, Badger JH, Buldak GL, Reich CI, Woese CR, Olsen GJ: Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species. Proc Natl Acad Sci USA. 1999, 96: 3578-3583. 10.1073/pnas.96.7.3578.View ArticleGoogle Scholar
- Wang HC, Xia X, Hickey D: Thermal adaptation of the small subunit ribosomal RNA gene: a comparative study. J Mol Evol. 2006, 63: 120-126. 10.1007/s00239-005-0255-4.View ArticleGoogle Scholar
- Gruber T, Kohrer C, Lung B, Shcherbakov D, Piendl W: Affinity of ribosomal protein S8 from mesophilic and (hyper)thermophilic archaea and bacteria for 16S rRNA correlates with the growth temperatures of the organisms. FEBS Lett. 2003, 549: 123-128. 10.1016/S0014-5793(03)00760-9.View ArticleGoogle Scholar
- Altamura S, Cammarano P, Londei P: Archaebacterial and eukaryotic ribosomal subunits can form active hybrid ribosomes. FEBS Lett. 1986, 204: 129-133. 10.1016/0014-5793(86)81400-4.View ArticleGoogle Scholar
- Londei P, Altamura S, Caprini E, Martayan A: Translation and ribosome assembly in extremely thermophilic archaebacteria. Biochimie. 1991, 73: 1465-1472. 10.1016/0300-9084(91)90179-5.View ArticleGoogle Scholar
- Pedone F, Bonincontro A, Briganti G, Giansanti A, Londei P, Risuleo G, Mengoni M: Effects of magnesium and temperature on the conformation and reassociation of Escherichia coli and Sulfolobus solfataricus ribosomes. Biochim Biophys Acta. 1997, 1335: 283-289. 10.1016/S0304-4165(96)00146-8.View ArticleGoogle Scholar
- Cammarano P, Mazzei F, Londei P, Teichner A, de Rosa M, Gambacorta A: Secondary structure features of ribosomal RNA species within intact ribosomal subunits and efficiency of RNA-protein interactions in thermoacidophilic (Caldariella acidophila, Bacillus acidocaldarius) and mesophilic (Escherichia coli) bacteria. Biochim Biophys Acta. 1983, 740: 300-312. 10.1016/0167-4781(83)90139-2.View ArticleGoogle Scholar
- Briganti G, Giordano R, Londei P, Pedone F: Small angle neutron scattering analysis of thermal stability of 23S rRNA and the intact 50S subunits of Sulfolobus solfataricus. Biochim Biophys Acta. 1998, 1379: 297-301. 10.1016/S0304-4165(97)00066-4.View ArticleGoogle Scholar
- Trylska J, Konecny R, Tama F, Brooks CL, McCammon JA: Ribosome motions modulate electrostatic properties. Biopolymers. 2004, 74: 423-431. 10.1002/bip.20093.View ArticleGoogle Scholar
- Felsenstein J: Phylogenies and the Comparative Method. American Naturalist. 1985, 125: 1-15. 10.1086/284325.View ArticleGoogle Scholar
- Garland T, Harvey PH, Ives AR: Procedures for the analysis of comparative data using phylogenetically independent contrasts. Syst Biol. 1992, 41: 18-32.View ArticleGoogle Scholar
- Schuwirth BS, Borovinskaya MA, Hau CW, Zhang W, Vila-Sanjurjo A, Holton JM, Cate JH: Structures of the bacterial ribosome at 3.5 A resolution. Science. 2005, 310: 827-834. 10.1126/science.1117230.View ArticleGoogle Scholar
- Wimberly BT, Brodersen DE, Clemons WM, Morgan-Warren RJ, Carter AP, Vonrhein C, Hartsch T, Ramakrishnan V: Structure of the 30S ribosomal subunit. Nature. 2000, 407: 327-339. 10.1038/35030006.View ArticleGoogle Scholar
- Wilson DN, Nierhaus KH: Ribosomal proteins in the spotlight. Crit Rev Biochem Mol. 2005, 40: 243-267. 10.1080/10409230500256523.View ArticleGoogle Scholar
- Notredame C, Higgins DG, Heringa J: T-coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000, 302: 205-217. 10.1006/jmbi.2000.4042.View ArticleGoogle Scholar
- Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res. 2000, 28: 235-242. 10.1093/nar/28.1.235.View ArticleGoogle Scholar
- Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001, 17: 754-755. 10.1093/bioinformatics/17.8.754.View ArticleGoogle Scholar
- Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.View ArticleGoogle Scholar
- Webb CO, Ackerly DD, Kembel SW: Phylocom: software for the analysis of phylogenetic community structure and trait evolution. Bioinformatics. 2008, 24: 2098-2100. 10.1093/bioinformatics/btn358.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.