Abstract
Free full text
Prevalent genome streamlining and latitudinal divergence of planktonic bacteria in the surface ocean
Associated Data
Abstract
Planktonic bacteria dominate surface ocean biomass and influence global biogeochemical processes, but remain poorly characterized owing to difficulties in cultivation. Using large-scale single cell genomics, we obtained insight into the genome content and biogeography of many bacterial lineages inhabiting the surface ocean. We found that, compared with existing cultures, natural bacterioplankton have smaller genomes, fewer gene duplications, and are depleted in guanine and cytosine, noncoding nucleotides, and genes encoding transcription, signal transduction, and noncytoplasmic proteins. These findings provide strong evidence that genome streamlining and oligotrophy are prevalent features among diverse, free-living bacterioplankton, whereas existing laboratory cultures consist primarily of copiotrophs. The apparent ubiquity of metabolic specialization and mixotrophy, as predicted from single cell genomes, also may contribute to the difficulty in bacterioplankton cultivation. Using metagenome fragment recruitment against single cell genomes, we show that the global distribution of surface ocean bacterioplankton correlates with temperature and latitude and is not limited by dispersal at the time scales required for nucleotide substitution to exceed the current operational definition of bacterial species. Single cell genomes with highly similar small subunit rRNA gene sequences exhibited significant genomic and biogeographic variability, highlighting challenges in the interpretation of individual gene surveys and metagenome assemblies in environmental microbiology. Our study demonstrates the utility of single cell genomics for gaining an improved understanding of the composition and dynamics of natural microbial assemblages.
Planktonic bacteria dominate surface ocean biomass and have a major impact on the global cycling of carbon, nitrogen, and other elements (1). Among the available pure cultures of marine bacterioplankton, only a limited number represent bacterioplankton that are abundant in the ocean, such as the cyanobacteria Prochlorococcus and Synechococcus and the Alphaproteobacteria Pelagibacter (collectively termed PSP cultures). This limits the scope of studies of the microbial metabolic processes and evolutionary changes that impact marine ecosystems and their geochemical cycles (2–6). Unusual nutritional requirements resulting from genome reduction may contribute to cultivation difficulties, as suggested by studies of the chemoheterotroph Pelagibacter (7, 8) and the methylotroph OM43 (9).
Although prevailing culture-independent tools, including microbial community shotgun sequencing, targeted gene surveys, and fluorescent in situ hybridization, have revealed the extent and significance of microbial diversity, they have not been able to provide the genome context information required for accurate metabolic reconstruction spanning organismal, population, and community levels of organization (10). As a result, the genomic repertoires, natural histories, and geographic distribution of even the most abundant taxonomic groups of marine bacterioplankton remain largely unknown (1, 11). Microbial studies in other environments, such as the human body and soils, face similar challenges (10). The recent development of robust protocols for single cell genomics provides a versatile, cultivation-independent approach for assessing natural microbial diversity with corresponding genome context information (12).
To determine whether genome streamlining is a prevalent feature among free-living marine bacterioplankton, and to analyze global patterns of surface ocean bacterioplankton distribution, we obtained draft genomes of 56 single amplified genomes (SAGs) (5, 13–15) and compared them with existing bacterioplankton cultures and metagenomes. The sequenced SAGs represent many ubiquitous surface ocean bacteria lineages, including Marine Group A, Verrucomicrobia, Actinobacteria, Bacteroidetes, and Proteobacteria lineages SAR86, ARCTIC96BD-19, SAR92, SAR116, and Roseobacter (SI Appendix, Fig. S1). The majority of these groups have few or no cultured representatives. Members of the PSP group were excluded from SAG selection, because their genome streamlining and environmental abundance have been demonstrated previously (1, 2, 4, 11). Samples for SAG generation were collected from the Gulf of Maine, the Mediterranean Sea, and the subtropical gyres of the North Pacific and South Atlantic Oceans (SI Appendix, Table S1). On average, 55% (range, 0.3–97.8%) of the genome was recovered from each analyzed cell (SI Appendix, Table S2). A subset of 41 SAGs, each >0.75 Mbp in size and with >30% estimated genome recovery, was used for our comparative genomics and biogeographic analyses. Our results demonstrate that genome streamlining is a prevalent evolutionary strategy among free-living bacterioplankton in the surface ocean. They also suggest that the global distribution of the majority of surface ocean bacterioplankton might not be limited by dispersal and is correlated with temperature and latitude.
Results and Discussion
Genomic Signatures of Streamlining and Oligotrophy Among Uncultured Marine Bacteria.
A comparison of general genome features among marine bacterioplankton revealed that the majority of our SAGs clustered with cultures of Prochlorococcus and Pelagibacter, as well as with the SAR86 SAGs sequenced by Dupont et al. (6) (Fig. 1A and SI Appendix, Table S3). SAGs segregated from cultures along a principal component axis associated with low guanine and cytosine (GC) content, low percentage of noncoding nucleotides, low fraction of genes encoding periplasm and cytoplasm membrane proteins, and Clusters of Orthologous Groups (COG) categories K (transcription) and T (signal transduction). These genomic signatures have been identified as indicators of genome streamlining and oligotrophy (16, 17). All Verrucomicrobia and Bacteroidetes SAGs, one SAR92 SAG, and all Bacteroidetes cultures clustered separately from other SAGs and cultures (Fig. 1A). These genomes are associated with elevated frequency of genes encoding extracellular, outer membrane and multilocation proteins, and COG category V (defense mechanisms), corroborating the previously proposed role of Bacteroidetes (18, 19) and the recently suggested importance of Verrucomicrobia (14) in macromolecule degradation, a process requiring cell surface-associated or extracellular hydrolases. SAGs of the same taxonomic group but retrieved from different geographic locations had similar genomic signatures, indicating that the selection for these signatures operates in both the open ocean and coastal waters, and in diverse climate zones. In contrast, large differences in genomic signatures were found between SAGs and their cultured relatives within each taxonomic group that contains multiple SAGs and cultures, such as Roseobacter, SAR116, and Bacteroidetes (SI Appendix, Table S4).
Obligate oligotrophy has been proposed as a key factor leading to poor recovery of environmental microorganisms in pure cultures (19–21), and our study provides clear evidence for the predominance of a copiotroph lifestyle among existing marine cultures across taxonomic groups. Our data also suggest that oligotroph characteristics in surface ocean bacteria are not limited to members of Prochlorococcus and Pelagibacter in tropical regions, as previously thought (16, 22), but rather is a common trophic strategy among many bacterioplankton lineages around the globe.
As one of the variables contributing to genomic differences between SAGs and cultures (Fig. 1A), the average GC content of SAGs (37.9%) was significantly lower than that of the 101 marine bacterioplankton cultures (48.5%; SI Appendix, Fig. S2A). Although multiple displacement amplification (MDA) of mixed templates may introduce GC biases, here such biases were eliminated by performing MDA on individual cells, followed by high-coverage sequencing and de novo assembly, which have been demonstrated to accurately reconstruct GC of the analyzed genomes (23–25). The high similarity of the average GC content of SAGs (37.9%) and available surface ocean metagenomes (39.6%) provides further support for the representativeness of our SAG data (SI Appendix, Table S5). The difference in %GC between SAGs and cultures was significant in both coding and noncoding genome regions, suggesting GC content rather than protein composition as the primary adaptive trait (SI Appendix, Fig. S2B).
SAGs differed from cultures in the frequency of encoded amino acids (Fig. 1B and SI Appendix, Table S6), with SAGs being enriched in tyrosine, phenylalanine, isoleucine, glutamic acid, asparagine, lysine, and serine and depleted in valine, glycine, alanine, arginine, proline, histidine, and tryptophan. These two groups of amino acids were similar in terms of chemical properties, synthesis costs, and numbers of C and N atoms (SI Appendix, Table S7), but diverged in average GC content of the first two nucleotides of their codons (14% and 79%, respectively). This finding provides further evidence that differences in amino acid utilization between SAGs and cultures are driven primarily by differences in %GC. Recent experimental work suggests that high GC content may enhance bacterial growth in laboratory conditions (26). In contrast, low genomic GC content may be an adaptation to nitrogen limitation (27) or a result of mutational biases in the absence of effective DNA repair systems (16). It remains to be understood how the observed GC depletion in bacterioplankton and the resulting shifts in amino acid use impact surface ocean processes.
One predicted cost of genome streamlining in free-living bacteria is a reduction in physiological flexibility, leading to specialization in resource utilization. Accordingly, SAGs had fewer paralogs and smaller genomes compared with cultures from the same taxonomic groups, with the exception of SAR116 (Fig. 2). The low paralog frequency is not likely the result of incomplete genome recovery from SAGs, given that partial genes at the ends of contigs may be incorrectly assigned as paralogs, leading to overestimation of paralogs. This effect is evident in the substantially higher fraction of paralogs identified from highly fragmented SAR86 SAG assemblies sequenced by Dupont et al. (6) compared with the SAR86 SAGs reported here. This overall trend suggests that the small genome size and fewer gene duplications may provide an adaptive advantage to life in the oligotrophic ocean.
Comparisons of metabolic potential among taxonomic groups represented by multiple SAGs provide strong evidence for specialized resource utilization despite incomplete genome recovery from individual SAGs (SI Appendix, Figs. S3–S8 and Tables S8–S10). For example, Gammaproteobacteria lineages SAR86, SAR92, and ARCTIC96BD-19 encode a heterotrophic central metabolism but differ in terms of pathway completeness and variation. Moreover, genes encoding the oxidative component of the pentose phosphate metabolism are absent in most SAR86 SAGs, but this pathway was found to be complete in most ARCTIC96BD-19 SAGs (SI Appendix, Table S9). Evidence of autotrophic carbon fixation was found only in ARCTIC96BD-19 SAGs, which harbor the RuBisCO operon, as previously reported for SAGs of this lineage from the mesopelagic zone (15). Only the SAR116 SAGs encoded form I coxL, indicating a functional carbon monoxide dehydrogenase (SI Appendix, Fig. S7). Genes supporting various inorganic sulfur utilization pathways were common and lineage-specific, including polysulfide reductase (psr) in Marine Group A, the sox (sulfur oxidation) operon in SAR116, and adenylylsulfate reductase (aprA) among members of ARCTIC96BD-19. Proteorhodopsin genes were found consistently in Marine Group A and ARCTIC96BD-19 SAGs, expanding the taxonomic groups known to encode these photometabolic systems (SI Appendix, Fig. S4 and Table S10).
The ubiquity of metabolic specialization and mixotrophy, as suggested by these data, may contribute to difficulties in cultivating marine bacterioplankton. Accordingly, a member of the ARCTIC96BD-19 lineage was recently cultured from the surface ocean and found to oxidize thiosulfate (28), as was suggested by genome information obtained from SAGs in our previous study (15). Thus, single cell genomics provides a means for the discovery of genes that can be unequivocally assigned to uncultured taxonomic groups, thereby providing critical knowledge about their biology, including clues for cultivation strategies.
Biogeography of Marine Bacterioplankton.
We analyzed the global distribution of surface ocean bacterioplankton using SAGs as references in fragment recruitment (4–6) of publicly available metagenomes, which span diverse geographic regions and climate zones and contain 45 million sequence reads totaling 23 Gbp (SI Appendix, Table S5 and Fig. S9). Using the 95% genomic DNA identity threshold, an operational delineation of taxonomically defined microbial species (29), the combined set of our 41 SAGs recruited an average of 0.9% reads from each surface ocean metagenome (Figs. 3 and and4A).4A). The available PSP genomes (Prochlorococcus, Synechococcus, and Pelagibacter; a total of 24) recruited 1.6%, whereas the remaining 82 genomes of marine bacterioplankton cultures recruited only 0.3% (Fig. 4A). Lowering the DNA identity threshold in fragment recruitment resulted in a linear increase in the fraction of recruited reads until BLAST effectiveness diminished at nucleotide identities <60%. At this relaxed threshold, which corresponds to ~94% identity of the small subunit (SSU) rRNA gene (30) and an approximate, operational delineation of taxonomic order (31), 5.2%, 12.0%, 4.7%, and 19.3% of marine metagenome reads were recruited by SAGs, PSP genomes, 82 other bacterioplankton cultures, and a combined set of all genomes, respectively. Although the majority of marine bacterioplankton remains genomically unexplored, single cell sequencing offers a practical solution for genome recovery of uncultivated environmental microorganisms.
Using the 95% genomic DNA identity threshold, all SAGs obtained from the Gulf of Maine recruited the highest fraction of metagenomes from temperate regions (average temperature, 11.7 °C; range, 4.0–18.2 °C), which are represented by the northeast and northwest coasts of North America, the Atlantic coast of Europe, and the Indian Ocean off New Zealand in available datasets (Fig. 3). In contrast, SAGs obtained from the two subtropical gyres recruited primarily from warm-water metagenomes in the Atlantic, Pacific, and Indian Oceans (average temperature, 25.7 °C; range, 18.6–29.3 °C; designated “tropical”). SAGs recovered from the Mediterranean Sea, which has an intermediate climate, recruited relatively evenly across temperate and tropical metagenomes. Metagenomes from the Southern Ocean (average temperature, −0.1 °C; range, −2.0 to 4.2 °C; designated “polar”) recruited primarily to SAGs from the Gulf of Maine, although significantly less compared with temperate metagenomes. In contrast to recruitment to SAGs, metagenome fragment recruitment to the majority of marine cultures was limited, and fewer clear biogeographic patterns were apparent (SI Appendix, Figs. S10 and S11), in agreement with previous observations (4, 32).
The abundance of specific genotypes, determined by metagenome fragment recruitment, was most strongly correlated with surface water temperature and latitude (SI Appendix, Fig. S12). Chlorophyll a concentration, water column depth, and longitude were minor factors in the ordination, suggesting that phytoplankton abundance, proximity to the coast, and geographic distance among sampling stations are less important than latitude in determining the abundance of most analyzed genotypes. These findings corroborate recent reports of temperature as a major driver of the global distribution of marine algae (33, 34) and Pelagibacter (35). Temperature and latitude also have been identified as key determinants of less-specific descriptors of marine bacterioplankton biogeography, such as community richness (36) and the frequency of functionally related genes (37–39), for which our study provides extensive genomic context.
We estimated the ratio of metagenomic fragment recruitment from native versus nonnative climate zones, relative to SAG collection site, at various DNA identity intervals as proxies for evolutionary distance (Fig. 4B). In the case of temperate versus tropical zones, the ratio was highest (3,827) at 95–100% DNA identity, decreased to 154 at 90–95% identity, and declined to <10 at 80–85% identity. This pattern was similar for all taxonomic groups analyzed. The corresponding ratios were similar when comparing recruitment by temperate SAGs in temperate versus polar environments, but were higher when comparing recruitment by tropical SAGs in tropical versus polar environments. Thus, operationally defined species (>95% genomic DNA identity) were highly specific to their climate zones, but little geographic specificity was observed within phylogenetic groups that shared <80% genomic DNA identity, which corresponds to ~97% identity of the SSU rRNA gene (31). Accordingly, several bacterioplankton cells analyzed in this study shared >97% identity of their SSU rRNA genes even though they originated from divergent climate zones and demonstrated contrasting geography in metagenome fragment recruitment; examples include SAR116 SAGs AAA158-M15 versus AAA015-N04 and SAR86 SAGs AAA298-N10 versus AAA076-P09 (SI Appendix, Fig. S1).
Whereas the SSU rRNA gene identities were high in these pairs of SAGs, the average nucleotide identity (29) was only 75% and 71%, respectively. The >97% identity of the SSU rRNA gene is the most widely used delineator of operational taxonomic units (OTUs) in microbial ecology. However, it is often overlooked that such OTUs encompass much broader phylogenetic groups than the currently accepted, operationally defined bacterial species, and may contain organisms with divergent adaptations. Thus, insufficient phylogenetic resolution might explain the difficulties encountered in earlier studies in detecting consistent differentiation of bacterioplankton along longitudinal gradients when using SSU rRNA gene surveys (35, 39, 40) or metagenome fragment recruitment with relaxed settings (32), although the more pronounced differences between polar and tropical bacterioplankton have been reported from such studies (35, 39, 40). Here, metagenome fragment recruitment using stringent settings and environmentally relevant, single cell genomes as references enabled us to identify previously undetected, community-wide genetic divergence among tropical, temperate, and polar marine bacterioplankton.
Assuming 1% divergence of the SSU rRNA gene every 50 Ma (41), we estimate that bacterioplankton genetic differences among the three climate zones might have accumulated over tens to hundreds of millions of years. Although such estimates contain significant uncertainties (42, 43), it is clear that the required evolutionary timeframe encompasses numerous overturns of the global ocean by surface currents and thermohaline circulation, which take 1,000–2,000 y each (44). These estimates corroborate the absence of longitudinal effects on fragment recruitment (Figs. 3 and and4)4) and suggest that the observed differences in bacterioplankton composition between nonpolar climate zones are not driven by dispersal limitations, but are defined by evolutionary innovation enabling certain genotypes to thrive in a specific climate zone. Given our lack of direct evidence for the genomic context of recruited metagenome fragments, how local populations of surface ocean bacterioplankton vary by their genome organization remains to be determined. Nevertheless, our data suggest that the global distribution of surface ocean bacterioplankton genes is not limited by dispersal at the time scales required for nucleotide substitution to exceed the current operational definition of bacteria species, thus adding some evolutionary constraints to the famous statement that “everything is everywhere, but the environment selects” (45).
Summary
Using large-scale single cell genomic sequencing and metagenome fragment recruitment, we have provided extensive, cultivation-independent insight into the genome-level diversity, metabolic potential, and biogeography of many abundant bacterial lineages inhabiting the surface ocean. Our data provide clear evidence that existing laboratory cultures consist mostly of copiotrophic genotypes, compared with free-living bacterioplankton that are streamlined for growth under resource-poor conditions. We also show that the global distribution of the majority of surface ocean bacterioplankton is correlated with temperature and latitude and is not likely limited by dispersal. Individual cells with highly similar SSU rRNA gene sequences exhibited significant genomic and biogeographic variability, highlighting challenges in the interpretation of individual gene surveys and metagenome assemblies in environmental microbiology. Our study demonstrates the utility of single cell genomics in providing a significantly improved understanding of the composition and dynamics of natural microbial assemblages in the ocean and other environments, which will be critical in predicting how ecosystems respond to large-scale environmental shifts, such as global warming and ocean acidification.
Materials and Methods
Collection and Construction of SAGs.
Replicate, 1-mL aliquots of water collected for single cell analyses were cryopreserved with 6% glycine betaine (Sigma-Aldrich) and stored at −80 °C or in liquid nitrogen (46). Single cell sorting, whole-genome amplification, real-time PCR screens, and PCR product sequence analyses were performed at the Bigelow Laboratory Single Cell Genomics Center (www.bigelow.org/scgc), as described by Stepanauskas and Sieracki (13) for SAGs MS024-2A, MS024-3C, MS190-1F, and MS220-5C and by Swan et al. (15) and Martinez-Garcia et al. (14) for the remaining SAGs.
SSU rRNA gene sequences were edited using Sequencher v4.7 (Gene Codes) and compared with previously deposited sequences using the RDP v10 Classifier (SSU rRNA) and National Center for Biotechnology Information BLAST. SAG SSU rRNA sequences were aligned with selected database sequences using ClustalW. Alignment columns with > 90% gaps were removed, and a maximum likelihood tree (100 bootstrap replicates) was constructed using PhyML implemented in Geneious v6.0.5 (47). Details of SAG sequencing, assembly, and annotation are provided in the SI Appendix.
Genome Recovery Estimation of SAGs and Determination of Paralogs.
To estimate the completeness of each assembled SAG genome, we analyzed all finished genome sequences of the taxonomic phyla Alphaproteobacteria (n = 145), Gammaproteobacteria (n = 317), Bacteroidetes (n = 22), and Actinobacteria (n = 131); the taxonomic phylum Verrucomicrobia (n = 4); and the taxonomic domain Bacteria (n = 1,023) available from the Integrated Microbial Genomes (IMG) database (48). Based on COG gene classifications, a set of conserved single copy genes (CSCGs) was extracted for each group of finished genomes from the IMG database. A CSCG was defined as a gene that occurs only once in each of 99% (95% in the case of the domain Bacteria) of the genomes contributing to the taxonomic group. The number of CSCGs for each group was as follows: Alphaproteobacteria, n = 58; Gammaproteobacteria, n = 47; Bacteroidetes, n = 86; Actinobacteria, n = 60; Verrucomicrobia, n = 330; Bacteria, n = 45. The ratio of the number of CSCGs observed for each SAG assembly and for the corresponding taxonomic group of finished genomes was used as a measure of genome recovery (SI Appendix, Table S2).
The frequency of paralog gene families within SAGs and marine cultures was determined using BLASTCLUST with the following settings: −L 0.5 −S 30.0 −e 1e-6. The number of paralogs out of the total number of protein coding genes was calculated for each genome.
Multivariate Analysis of SAG and Marine Culture Genome Signatures.
The amino acid frequencies of 41 SAGs and bacterioplankton genomes were determined using Geneious v. 6.0.5, arcsin square root-transformed, and analyzed using principal components analysis (PCA) after standardization of values. Several genome characteristics found to separate marine prokaryotes by lifestyle (i.e., frequency of protein localizations and several COG categories) were calculated for SAGs and marine culture genomes as described previously (17), as was %GC and noncoding DNA, and these values were used as input for a second PCA analysis as described above. For this second PCA, the two Actinobacteria SAGs AAA015-D07 and AAA015-M09 were excluded. All PCAs were conducted using PRIMER v6.0.
Fragment Recruitment Analysis.
The basic approach of Rusch et al. (4) was used to estimate the abundances of relatives of SAGs and bacterioplankton cultures within each metagenome. BLAST+ v2.2.25 was used to recruit metagenome sequences to each SAG assembly using default parameter values, except for the following: -evalue 0.0001 -reward 1 -penalty -1 -soft_masking true -lcase_masking -xdrop_gap 150. Genome contigs ≥2,000 kbp from each SAG were used in the fragment recruitment analysis. The 23S, 16S, 5S, and ITS regions were masked in each genome before recruitment. The percentage of unique recruits (≥200 bp long and matching at ≥95% identity) from each metagenome matching to each SAG was normalized by genome length. The percentage of unique reads for each metagenome–genome pair was also determined at 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, and 50% identity thresholds. SAG abundances from each metagenome were calculated from BLAST output and plotted using custom R scripts. Metagenomes used in fragment recruitment analysis were quality processed using PRINSEQ (49), and all sequences with the following characteristics were removed from further analysis: sequences <100 bp, sequences containing any ambiguities (Ns), all forms of replicate and duplicate sequences, and sequences with a minimum entropy value of 70 (applied to pyrosequencing datasets only).
Environmental and Sample Location Correlations with Fragment Recruitment Abundances.
The influence of environmental factors on fragment recruitment-derived community composition was determined using nonmetric multidimensional scaling (MDS). MDS is an ordination technique that plots samples as points in low-dimensional space while attempting to maintain the relative distances between points as close as possible to the actual rank order of similarities between samples (50). Thus, metagenomes with similar community composition are plotted closer together in ordination space. A stress factor calculated for each MDS ordination indicates how well plotted configurations of sample distances agree with original rank orders calculated from the similarity matrices. SAG recruitment abundances were arcsin square root-transformed, and the Bray–Curtis distance was calculated for the MDS analysis. Sampling and environmental factors used for axis correlations were temperature, chlorophyll a concentration, water column depth at the sampling location (log-transformed), and latitude and longitude of the sampling location. All MDS calculations were performed using PC-ORD v6.08.
Calculation of Average Nucleotide Identity Between Genomes.
Average nucleotide identity (ANI) values between the pairs of SAR116 SAGs AAA158-M15 and AAA015-N04 and SAR86 SAGs AAA298-N10 and AAA076-P09 were calculated following the method described by Goris et al. (29), using a custom Perl script. Each SAG served as a reference genome, and resulting ANI values were averaged.
Acknowledgments
We thank David Emerson, William Hanage, Siv Anderson, and Jessica Labonte for valuable comments; Edward F. DeLong for metagenomes from Hawaii ocean time (HOT) station ALOHA; the officers and crew of the RV Ka'imikai-O-Kanaloa and the HOT team for sample collection at station ALOHA; and Jane Heywood and Michael Sieracki for South Atlantic field sample collection. This work was supported by National Science Foundation Grants EF-826924 (to R.S.), OCE-821374 (to R.S.), and OCE-1232982 (to R.S. and B.K.S.); US Department of Energy (DOE) JGI 2011 Microbes Program Grant CSP 387 (to R.S., B.K.S., S.G., M.A.M., F.M.L., R.C. and S.G.A.); the Gordon and Betty Moore Foundation (M.A.M.); Spanish Ministry of Science and Innovation Grant CGL2011-26848/BOS (to S.G.A) and CONSOLIDER-INGENIO2010 Program Grant CSD2008-00077 (to S.G.A. and J.M.G.); the Natural Sciences and Engineering Research Council of Canada (NSERC); the Canada Foundation for Innovation, and the Canadian Institute for Advanced Research (CIFAR; S.J.H.). J.J.W. was supported by NSERC. Research activities of R.C. and F.M.L. are supported by the Australian Research Council, and research activities of R.C. are supported by the Australian Antarctic Science program. Work conducted by the DOE Joint Genome Institute is supported by the DOE’s Office of Science under Contract DE-AC02-05CH11231. This is contribution no. 006 of the Tara Oceans Expedition 2009–2012.
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
Data deposition: Whole-genome sequence data for single amplified genomes used for our analyses are available in the Joint Genome Institute's Integrated Microbial Genome database, http://img.jgi.doe.gov/cgi-bin/w/main.cgi (accession nos. 643886079, 643886118, 2228664025-26, 2228664028-29, 2228664032, 2228664034, 2228664052-53, 2228664055-56, 2236347001, 2236347003, 2236347013, 2236347015, 2236347017-19, 2236347021-24, 2236347026-27, 2236347030-33, 2236347035-36, 2236347039, 2236347041, 2236347043, 2236661010, 2236661014, 2236661017-18, 2507262045, 2507262047, and 2517572139).
This article contains supporting information online at www.pnas.org/lookup/suppl/10.1073/pnas.1304246110/-/DCSupplemental.
References
Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences
Full text links
Read article at publisher's site: https://doi.org/10.1073/pnas.1304246110
Read article for free, from open access legal sources, via Unpaywall: https://www.pnas.org/content/pnas/110/28/11463.full.pdf
Citations & impact
Impact metrics
Citations of article over time
Alternative metrics
Smart citations by scite.ai
Explore citation contexts and check if this article has been
supported or disputed.
https://scite.ai/reports/10.1073/pnas.1304246110
Article citations
Diversity and distribution of a prevalent Microviridae group across the global oceans.
Commun Biol, 7(1):1377, 23 Oct 2024
Cited by: 0 articles | PMID: 39443614 | PMCID: PMC11499846
A Marine Group A isolate relies on other growing bacteria for cell wall formation.
Nat Microbiol, 9(8):1954-1963, 03 Jun 2024
Cited by: 0 articles | PMID: 38831032
Genome-scale community modelling reveals conserved metabolic cross-feedings in epipelagic bacterioplankton communities.
Nat Commun, 15(1):2721, 28 Mar 2024
Cited by: 2 articles | PMID: 38548725 | PMCID: PMC10978986
Advances in phage-host interaction prediction: in silico method enhances the development of phage therapies.
Brief Bioinform, 25(3):bbae117, 01 Mar 2024
Cited by: 0 articles | PMID: 38555471 | PMCID: PMC10981677
Review Free full text in Europe PMC
Expanded phylogeny of extremely halophilic archaea shows multiple independent adaptations to hypersaline environments.
Nat Microbiol, 9(4):964-975, 22 Mar 2024
Cited by: 3 articles | PMID: 38519541
Go to all (215) article citations
Other citations
Data
Data behind the article
This data has been text mined from the article, or deposited into data resources.
BioStudies: supplemental material and supporting data
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Assembling the marine metagenome, one cell at a time.
PLoS One, 4(4):e5299, 23 Apr 2009
Cited by: 202 articles | PMID: 19390573 | PMCID: PMC2668756
Metagenome-assembled genomes uncover a global brackish microbiome.
Genome Biol, 16:279, 14 Dec 2015
Cited by: 95 articles | PMID: 26667648 | PMCID: PMC4699468
Pole-to-pole biogeography of surface and deep marine bacterial communities.
Proc Natl Acad Sci U S A, 109(43):17633-17638, 08 Oct 2012
Cited by: 155 articles | PMID: 23045668 | PMCID: PMC3491513
Genomic insights into bacterial DMSP transformations.
Ann Rev Mar Sci, 4:523-542, 01 Jan 2012
Cited by: 69 articles | PMID: 22457986
Review