Abstract
The deep ocean (>200 m depth) is the largest habitat on Earth. Recent evidence suggests sulfur oxidation could be a major energy source for deep ocean microbes. However, the global relevance and the identity of the major players in sulfur oxidation in the oxygenated deep-water column remain elusive. Here we combined single-cell genomics, community metagenomics, metatranscriptomics and single-cell activity measurements on samples collected beneath the Ross Ice Shelf in Antarctica to characterize a ubiquitous mixotrophic bacterial group (UBA868) that dominates expression of RuBisCO genes and of key sulfur oxidation genes. Further analyses of the gene libraries from the ‘Tara Oceans’ and ‘Malaspina’ expeditions confirmed the ubiquitous distribution and global relevance of this enigmatic group in the expression of sulfur oxidation and dissolved inorganic carbon fixation genes across the global mesopelagic ocean. Our study also underscores the unrecognized importance of mixotrophic microbes in the biogeochemical cycles of the deep ocean.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
£14.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
£99.00 per year
only £8.25 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Data availability
The sequence data generated in this study have been deposited in the EMBL Nucleotide Sequence Database (ENA) database under Bioproject PRJEB35712 (metagenomic and metatranscriptomic raw reads, metagenomic and metatranscriptomic assemblies, metagenomic assembled genomes and single-cell amplified genomes) and in the NCBI Sequence Read Archive (SRA) under Bioproject PRJNA593264 (16S rRNA amplicon reads).
Code availability
Scripts available at Zenodo (https://doi.org/10.5281/zenodo.7721930.2023).
References
Whitman, W. B., Coleman, D. C. & Wiebe, W. J. Prokaryotes: the unseen majority. Proc. Natl Acad. Sci. USA 95, 6578–6583 (1998).
Herndl, G. J. & Reinthaler, T. Microbial control of the dark end of the biological pump. Nat. Geosci. 6, 718–724 (2013).
Reinthaler, T., Van Aken, H. M. & Herndl, G. J. Major contribution of autotrophy to microbial carbon cycling in the deep North Atlantic’s interior. Deep Sea Res. Part II 57, 1572–1580 (2010).
Baltar, F. et al. Significance of non-sinking particulate organic carbon and dark CO2 fixation to heterotrophic carbon demand in the mesopelagic northeast Atlantic. Geophys. Res. Lett. 37, L09602/02010GL043105 (2010).
Könneke, M. et al. Isolation of an autotrophic ammonia-oxidizing marine archaeon. Nature 437, 543–546 (2005).
Wuchter, C. et al. Archaeal nitrification in the ocean. Proc. Natl Acad. Sci. USA 103, 12317–12322 (2006).
Pachiadaki, M. G. et al. Major role of nitrite-oxidizing bacteria in dark ocean carbon fixation. Science 358, 1046–1051 (2017).
Zhang, Y. et al. Nitrifier adaptation to low energy flux controls inventory of reduced nitrogen in the dark ocean. Proc. Natl Acad. Sci. USA 117, 4823–4830 (2020).
Callbeck, C. M. et al. Oxygen minimum zone cryptic sulfur cycling sustained by offshore transport of key sulfur oxidizing bacteria. Nat. Commun. 9, 1729 (2018).
Ulloa, O., Canfield, D. E., DeLong, E. F., Letelier, R. M. & Stewart, F. J. Microbial oceanography of anoxic oxygen minimum zones. Proc. Natl Acad. Sci. USA 109, 15996–16003 (2012).
Walsh, D. A. et al. Metagenome of a versatile chemolithoautotroph from expanding oceanic dead zones. Science 326, 578–582 (2009).
Meier, D. V. et al. Niche partitioning of diverse sulfur-oxidizing bacteria at hydrothermal vents. ISME J. 11, 1545–1558 (2017).
Swan, B. K. et al. Potential for chemolithoautotrophy among ubiquitous bacteria lineages in the dark ocean. Science 333, 1296–1300 (2011).
Landa, M. et al. Sulfur metabolites that facilitate oceanic phytoplankton–bacteria carbon flux. ISME J. 13, 2536–2550 (2019).
Clifford, E. L. et al. Crustacean zooplankton release copious amounts of dissolved organic matter as taurine in the ocean. Limnol. Oceanogr. 62, 2745–2758 (2017).
Tutasi, P. & Escribano, R. Zooplankton diel vertical migration and downward C flux into the oxygen minimum zone in the highly productive upwelling region off northern Chile. Biogeosciences 17, 455–473 (2020).
Williams, B. T. et al. Bacteria are important dimethylsulfoniopropionate producers in coastal sediments. Nat. Microbiol. 4, 1815–1825 (2019).
Callbeck, C. M. et al. Sulfur cycling in oceanic oxygen minimum zones. Limnol. Oceanogr. 66, 2360–2392 (2021).
De Corte, D. et al. Microbes mediating the sulfur cycle in the Atlantic Ocean and their link to chemolithoautotrophy. Environ. Microbiol. 23, 7152–7167 (2021).
Teira, E., Reinthaler, T., Pernthaler, A., Pernthaler, J. & Herndl, G. J. Combining catalyzed reporter deposition-fluorescence in situ hybridization and microautoradiography to detect substrate utilization by Bacteria and Archaea in the deep ocean. Appl. Environ. Microbiol. 70, 44411–44414 (2004).
Parks, D. H. et al. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat. Biotechnol. 36, 996–1004 (2018).
Martínez-Pérez, C. et al. Phylogenetically and functionally diverse microorganisms reside under the Ross Ice Shelf. Nat. Commun. 13, 117 (2022).
Koch, T. & Dahl, C. A novel bacterial sulfur oxidation pathway provides a new link between the cycles of organic and inorganic sulfur compounds. ISME J. 12, 2479–2491 (2018).
Kiene, R. P., Linn, L. J., González, J., Moran, M. A. & Bruton, J. A. Dimethylsulfoniopropionate and methanethiol are important precursors of methionine and protein-sulfur in marine bacterioplankton. Appl. Environ. Microbiol. 65, 4549–4558 (1999).
Wang, X.-J. et al. Structural and mechanistic insights into dimethylsulfoxide formation through dimethylsulfide oxidation. Front. Microbiol. 12, 735793 (2021).
Bork, P. et al. Tara Oceans studies plankton at planetary scale. Science 348, 873–873 (2015).
Duarte, C. M. Seafaring in the 21st century: the Malaspina 2010 circumnavigation expedition. Limnol. Oceanogr. Bull. 24, 11–14 (2015).
Malfertheiner, L., Martínez-Pérez, C., Zhao, Z., Herndl, G. J. & Baltar, F. Phylogeny and metabolic potential of the candidate phylum SAR324. Biology 11, 599 (2022).
Erb, T. J. Carboxylases in natural and synthetic microbial pathways. Appl. Environ. Microbiol. 77, 8466–8477 (2011).
Baltar, F. et al. Prokaryotic responses to ammonium and organic carbon reveal alternative CO2 fixation pathways and importance of alkaline phosphatase in the mesopelagic North Atlantic. Front. Microbiol. 7, 1670 (2016).
Stewart, F. J., Ulloa, O. & DeLong, E. F. Microbial metatranscriptomics in a permanent marine oxygen minimum zone. Environ. Microbiol. 14, 23–40 (2012).
Frigaard, N.-U. & Dahl, C. Sulfur metabolism in phototrophic sulfur bacteria. Adv. Microb. Physiol. 54, 103–200 (2008).
Gregersen, L. H., Bryant, D. A. & Frigaard, N.-U. Mechanisms and evolution of oxidative sulfur metabolism in green sulfur bacteria. Front. Microbiol. 2, 116 (2011).
Meyer, B., Imhoff, J. F. & Kuever, J. Molecular analysis of the distribution and phylogeny of the soxB gene among sulfur‐oxidizing bacteria—evolution of the Sox sulfur oxidation enzyme system. Environ. Microbiol. 9, 2957–2977 (2007).
Hensen, D., Sperling, D., Trüper, H. G., Brune, D. C. & Dahl, C. Thiosulphate oxidation in the phototrophic sulphur bacterium Allochromatium vinosum. Mol. Microbiol. 62, 794–810 (2006).
Grimm, F., Franz, B. & Dahl, C. in Microbial Sulfur Metabolism (eds Dahl, C. & Friedrich, C. G.) 101–116 (Springer, 2008).
Herndl, G. J., Bayer, B., Baltar, F. & Reinthaler, T. Prokaryotic life in the deep ocean’s water column. Annu. Rev. Mar. Sci. 15, 461–483 (2023).
Klemetsen, T. et al. The MAR databases: development and implementation of databases specific for marine metagenomics. Nucleic Acids Res. 46, D692–D699 (2018).
Nishimura, Y. & Yoshizawa, S. The OceanDNA MAG catalog contains over 50,000 prokaryotic genomes originated from various marine environments. Sci. Data 9, 305 (2022).
Paoli, L. et al. Uncharted biosynthetic potential of the ocean microbiome. Nature 607, 111–118 (2022).
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36, 1925–1927 (2020).
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010).
Parks, D. H. et al. Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life. Nat. Microbiol. 2, 1533–1542 (2017).
Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 11, 2864 (2017).
Eddy, S. R. A probabilistic model of local sequence alignment that simplifies statistical significance estimation. PLoS Comput. Biol. 4, e1000069 (2008).
Salazar, G. et al. Gene expression changes and community turnover differentially shape the global ocean metatranscriptome. Cell 179, 1068–1083.e21 (2019).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935 (2013).
Kalvari, I. et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids Res. 49, D192–D200 (2021).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Aramaki, T. et al. KofamKOALA: KEGG ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 36, 2251–2252 (2020).
Gilchrist, C. L. & Chooi, Y.-H. Clinker & clustermap. js: automatic generation of gene cluster comparison figures. Bioinformatics 37, 2473–2475 (2021).
Sunagawa, S. et al. Structure and function of the global ocean microbiome. Science 348, 1261359 (2015).
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Nguyen, L.-T., Schmidt, H. A., Von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Edgar, R. C. Muscle5: high-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny. Nat. Commun. 13, 6968 (2022).
Tabita, F. R., Satagopan, S., Hanson, T. E., Kreel, N. E. & Scott, S. S. Distinct form I, II, III, and IV Rubisco proteins from the three kingdoms of life provide clues about Rubisco evolution and structure/function relationships. J. Exp. Bot. 59, 1515–1524 (2008).
Berger, S. A. & Stamatakis, A. Aligning short reads to reference alignments and trees. Bioinformatics 27, 2068–2075 (2011).
Barbera, P. et al. EPA-ng: massively parallel evolutionary placement of genetic sequences. Syst. Biol. 68, 365–369 (2019).
Czech, L., Barbera, P. & Stamatakis, A. Genesis and Gappa: processing, analyzing and visualizing phylogenetic (placement) data. Bioinformatics 36, 3263–3265 (2020).
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
Cock, P. J. et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25, 1422–1423 (2009).
R Core Team R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2021); http://www.r-project.org
Wickham, H., François, R., Henry, L., Müller, K & Vaughan, D. dplyr: a grammar of data manipulation. R package version 1.0.6. (2021); https://CRAN.R-project.org/package=dplyr
Kremer, L. ggpointdensity: a cross between a 2D density plot and a scatter plot. R package version 0.1.0. (2019); https://CRAN.R-project.org/package=ggpointdensity
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer, 2016).
Logares, R. et al. Metagenomic 16S rDNA I llumina tags are a powerful alternative to amplicon sequencing to explore diversity and structure of microbial communities. Environ. Microbiol. 16, 2659–2671 (2014).
Cao, S. et al. Structure and function of the Arctic and Antarctic marine microbiota as revealed by metagenomics. Microbiome 8, 47 (2020).
Quast, C. et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 41, D590–D596 (2013).
Ludwig, W. et al. ARB: a software environment for sequence data. Nucleic Acids Res. 32, 1363–1371 (2004).
Bergauer, K. et al. Organic matter processing by microbial communities throughout the Atlantic water column as revealed by metaproteomics. Proc. Natl Acad. Sci. USA 115, E400–E408 (2017).
Manz, W., Amann, R., Ludwig, W., Wagner, M. & Schleifer, K.-H. Phylogenetic oligodeoxynucleotide probes for the major subclasses of proteobacteria: problems and solutions. Syst. Appl. Microbiol. 15, 593–600 (1992).
Fuchs, B. M., Glöckner, F. O., Wulf, J. & Amann, R. Unlabeled helper oligonucleotides increase the in situ accessibility to 16S rRNA of fluorescently labeled oligonucleotide probes. Appl. Environ. Microbiol. 66, 3603–3607 (2000).
Yilmaz, L. S., Parnerkar, S. & Noguera, D. R. mathFISH, a web tool that uses thermodynamics-based mathematical models for in silico evaluation of oligonucleotide probes for fluorescence in situ hybridization. Appl. Environ. Microbiol. 77, 1118–1122 (2011).
Amann, R. I. et al. Combination of 16 rRNA-targeted oligonucleotide probes with flow cytometry for analyzing mixed microbial populations. Appl. Environ. Microbiol. 56, 1919–1925 (1990).
Wallner, G., Amann, R. & Beisker, W. Optimizing fluorescent in situ hybridization with rRNA‐targeted oligonucleotide probes for flow cytometric identification of microorganisms. Cytometry 14, 136–143 (1993).
Sintes, E. & Herndl, G. J. Quantifying substrate uptake of individual cells of marine bacterioplankton populations by catalyzed reporter deposition fluorescence in situ hybridization combined with microautoradiography. Appl. Environ. Microbiol. 72, 7022–7028 (2006).
Acknowledgements
We thank the Victoria University of Wellington Hot Water Drilling Team led by A. Pyne and D. Mendeno. We also thank L. Montiel and V. Balagué from the Institut de Ciències del Mar (ICM), CSIC, Spain, for extracting RNA, and the CNAG staff for RNA sequencing library preparation and sequencing. Bioinformatics analyses were performed at the LiSC Cluster (University of Vienna), and the MARBITS platform (ICM; http://marbits.icm.csic.es). This research was facilitated by the New Zealand Antarctic Research Institute (NZARI)-funded Aotearoa New Zealand Ross Ice Shelf Program. Samples for MICRO-CARD-FISH were collected on several research cruises led by M. Simon (Sonne 248 cruise), B. Quéguiner and I. Obernosterer (MobyDick) and L. J. A. Gerringa (Geotraces-1). F.B. was supported by the Austrian Science Fund (FWF) projects OCEANIDES (P34304-B), ENIGMA (TAI534), EXEBIO (P35248) and OCEANBIOPLAST (P35619-B). G.J.H. was supported by the FWF project ARTEMIS (P28781-B21) and I486-B09 and by the ERC under the European Community’s 7th Framework Programme (FP7/2007-2013)/ERC grant agreement no. 268595 (MEDEA project). R.L. was supported by INTERACTOMICS, CTM2015-69936-P, and J.M.G. by project PID2019-110011RB-C32 (Spanish Ministry of Science and Innovation, Spanish State Research Agency, doi: 10.13039/501100011033).
Author information
Authors and Affiliations
Contributions
F.B. and S.E.M. performed sampling and experiments at RIS. S.E.M. and R.L. performed nucleic acid extraction and library preparation for metagenomics and metatranscriptomics, respectively. T.R. collected samples and provided materials. C.M.-P., C.A., Z.Z., R.L., S.R.-E., J.M.G. and F.B. analysed the data. F.B. and J.M.G. wrote the paper with contributions from all co-authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Microbiology thanks Maliheh Mehrshad and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Phylogenetic reconstruction of RbcL peptides from UBA868 and representatives from other groups.
Out of 683 Arenicellales genomes 528 were predicted to harbor recA, a single copy gene conserved in bacteria. Form IA and form II rbcL were predicted in 23 and 89 genomes, respectively, which encodes for the large subunit of the RuBisCO enzyme. Peptide sequences were aligned with MUSCLE and poorly aligned positions in the alignment were removed with TrimAl. The tree was built with RaxML-NG with the model LG + I + G4 as estimated with ModelTest-NG. Numbers at nodes are bootstrap values ≥ 70% (100 replicates). Scale bar indicates the number of substitutions per site.
Extended Data Fig. 2 Depth distribution for UBA868 and non-UBA868 genomes across longitude.
Depths are represented in two scales: 0 and 1000 m (A) and 6000 m (B). Genomes classified as UBA868 (depicted in orange) are mainly retrieved from samples at greater depths than other genomes (blue) across longitudes (Mann-Whitney-Wilcoxon test, p-value = 2.5 × 10−36). Specifically, UBA868 shows a median depth value of 284 m, while non-UBA868 samples display a median depth of 34 m. Genomes are those from Paoli et al., 2022 (doi: 10.1038/s41586-022-04862-3).
Extended Data Fig. 3 Relative abundance of UBA868 based on MICRO-CARD-FISH and cell volume.
(A) Percent contribution of UBA868 to total DAPI stained cells. (B) cell volume (log) of UBA868 positive cells in samples collected from the North Atlantic (175 data points), Tropical Atlantic (163 data points), Tropical Pacific (38 data points) and Southern Ocean (off Kerguelen islands; 88 data points). The upper and lower whiskers extend from the hinge to the largest and lowest value no further than 1.5 × interquartile range from the hinge. To the right, the microscopic image shows the appearance of UBA868 positive cocci. The sample was collected from 1750 m during Archimedes 3 cruise in the tropical North Atlantic Ocean. CF: a colored image of CARD-FISH with DAPI (blue) and FISH (green) channels. DAPI: an image of DAPI channel at the same field of view as CF. Scale bar: 1 μm.
Extended Data Fig. 4 Contribution of UBA868 sulfur oxidizing genes to the overall expression and in the metagenomes, and phylogeny of SoxB peptides.
(A) Average contribution of UBA868 for other genes in the pathways to oxidize sulfur oxidation is highest in the mesopelagic (MES) than in the deep chlorophyll maximum (DCM) or in the surface (SRF). soxB expression and gene abundance were quantified with the KO specific for it and HMM profiles specific for peptides annotated as “thiosulfohydrolase SoxB” in RefSeq and the peptides retrieved with the TIGRFAM specific for SoxB. The contribution of the two subunits for the RuBisCO genes, large and small subunits, rbcL and rbcS is also represented. HMM profiles were also designed to tell apart the contribution of rbcL type I and II. (B) SAR324 SoxB peptides fall in two subclusters, most of which are outside the groups with RefSeq and UBA868 peptides. Only those SAR324 genomes that cluster with RefSeq are predicted to have soxAXYZ (predicted with their KOs). For the rest of taxa, the pathway seems incomplete. The phylogenetic reconstruction suggests that the KO specific for it retrieves SoxB peptides in the SAR324, Flavobacteriales, Pirellulales and part of the Alphaproteobacteria, that might instead have a related but different function from SoxB. Phylogenetic trees were obtained with IQ-TREE as described in the main text.
Extended Data Fig. 5 Phylogenetic reconstruction of representative SoxB peptides from marine bacteria.
The maximum-likelihood tree was built with IQ-TREE. A total of 402 Arenicellales genomes were predicted to contain the soxB gene involved in thiosulphate oxidation, while 528 were predicted to harbor recA. The gene was only found in the families UBA868 (399 genomes), BMS3Bbin11 (one genome), LS-SOB (one in four genomes) and in one unclassified Arenicellales. Only UBA868 SoxB are included in the tree since none of the other Arenicellales families were represented in the metatranscriptomes. This reference tree was used to predict the function and taxonomy of metatranscriptome reads by placing the corresponding peptides on the tree. The smaller phylogenetic tree at the upper left shows the position of some of the Ocean Microbial Reference Gene Catalog (OM-RGC) peptides in part of the UBA868 subcluster. The placement of these OM-RGC peptides was used to assign the label UBA868 to them. The names in bigger fonts belong to the subclusters in Figs. 1 and 6, which include the taxa with smaller fonts. Circles at nodes are proportional to bootstrap values ≥ 70% based on 100 replicates). Scale bar indicates number of substitutions per site.
Extended Data Fig. 6 Phylogenetic reconstruction of rDsrA peptides from marine bacteria.
The maximum-likelihood tree was built with IQ-TREE. Sequences represent the reverse dissimilatory sulfite reductase subunits A (rDsrA), for sulfite oxidation. A total of 458 Arenicellales genomes were predicted to have the rdsrA gene, while 528 Arenicellales were predicted to contain recA. The gene was found in the families UBA868 (433 genomes), UBA5680 (present in all but one of the 23 genomes), BMS3Bbin11 (represented by one genome) and LS-SOB (present in two of the four genomes). UBA5680 peptides are within the Gammaproteobacteria 2 and LS-SOB in the Gammaproteobacteria 3 subclusters in the tree. This reference tree was used to predict the function and taxonomy of metatranscriptome reads by placing the corresponding peptides on the tree. The metatranscriptome peptides were taxonomically classified after placement in one of the subclusters as shown in the smaller phylogenetic tree to the left that represents part of the UBA868 subcluster and some of the Ocean Microbial Reference Gene Catalog (OM-RGC) rDsrA peptides. The names in bigger fonts belong to the subclusters in Figs. 1 and 6, which include the taxa with smaller fonts. Circles at nodes are proportional to the bootstrap values ≥70% based on 100 replicates. Scale bar indicates number of substitutions per site.
Extended Data Fig. 7 Phylogenetic reconstruction of AprA peptides.
The maximum-likelihood tree was built with IQ-TREE. A total of 421 Arenicellales genomes were predicted to encode AprA, while 528 were predicted to contain the single copy gene recA. The aprA gene was found in the families UBA868 (398 genomes), UBA5680 (17 genomes), BMS3Bbin11 (represented by one genome), LS-SOB (present in all four genomes) and one unclassified Arenicellales. UBA868 peptides fall in three clusters. UBA5680, BMS3Bbin11 and LS-SOB AprA peptides are within the Gammaproteobacteria in the figure. This reference tree was used to predict the function and taxonomy of the metatranscriptome reads by placing the corresponding peptides on the tree. The smaller tree at the upper right shows part of the UBA868 sequences and some of the Ocean Microbial Reference Gene Catalog (OM-RGC) peptides that were placed within this subcluster. For the sake of simplicity in Figs. 1 and 6, subclusters with the bigger font in their names include the taxa with smaller font. Circles at nodes are proportional to the bootstrap values ≥70% based on 100 replicates. Scale bar indicates number of substitutions per site.
Extended Data Fig. 8 Flowchart to construct the reference phylogenetic tree and assign a function and taxonomic label to environmental peptides.
SoxB is shown as an example case. In panel A, the set of SoxB peptides were first predicted in the genome database and then a reference phylogenetic tree was constructed and analyzed. Panel B depicts how SoxB is predicted in the environmental peptide sequences and a taxonomic label is assigned.
Extended Data Fig. 9 Phylogenetic reconstruction of form I and II RbcL peptides.
Among the Arenicellales, form I gene was found in 21 of the UBA868 family genomes, in one LS-SOB family genome and an unclassified Arenicellales (A). The form II gene was found in 88 UBA868 family and one BMS3Bbin11 family genomes and was not detected in the rest of the families in the order (B). The BMS3Bbin11 RbcL peptide fell within the unclassified Bacteria in (B). This maximum-likelihood tree was built with IQ-TREE. This reference phylogenetic tree was used to identify the metatranscriptome sequence reads from samples under the Ross Sea ice by placing the corresponding peptides on the tree. RbcL peptides from the Ocean Microbial Reference Gene Catalog (OM-RGC) were also taxonomically classified according to their placement within the subclusters. Circles at nodes are proportional to the bootstrap values ≥70% based on 100 replicates. Scale bar indicates number of substitutions per site.
Supplementary information
Supplementary Information
Supplementary Materials and Methods
Supplementary Data
SoxB alignment file in phylip format.
Supplementary Data
Profile HMM for Rbcl type.
Supplementary Data
Profile HMM for Rbcl type II.
Supplementary Data
Profile HMM for reverse DsrA.
Supplementary Data
Fasta file to construct the alignment file.
Supplementary Data
Fasta file to construct the alignment file.
Supplementary Data
Fasta file to construct the alignment file.
Supplementary Data
Fasta file to construct the alignment file.
Supplementary Data
Fasta file to construct the alignment file.
Supplementary Data
List of TIGRFAMs and their description to build the concatenated phylogenetic tree (Fig. 2).
Supplementary Data
AprA alignment file in phylip format.
Supplementary Data
RbcL type I alignment file in phylip format.
Supplementary Data
RbcL type II alignment file in phylip format.
Supplementary Data
Reverse DsrA alignment file in phylip format.
Supplementary Tables 1–5
Supplementary Table 1. Genomes analysed, taxonomic classification and key genes involved in the carbon, sulfur and nitrogen cycles. Identification of genes is based on hits to Pfam, TIGRFAM or KOfam databases. Those labelled with ‘hmm’ between parenthesis are based on HMM designed in this study and described in Methods, except for DmdA, which is from González et al. (2019). Supplementary Table 2. Relative abundance to the etotal prokaryotic community in the global ocean of UBA868. Data derived from Tara Oceans and Malaspina expeditions. For details, see Methods. Supplementary Table 3. Contribution of UBA868 to total DIC fixation based on MICRO-CARD-FISH (for details, see Methods). Supplementary Table 4. Contribution of UBA868 to total leucine uptake based on MICRO-CARD-FISH (for details, see Methods). Supplementary Table 5. Normalized expression and abundances of genes in the sulfur and carbon cycle. Normalized per-cell abundance was taken directly from the functional profiles, based on KO, at http://ocean-microbiome.embl.de/companion.html. The methods to quantify the average abundances for each gene are described in Supplementary Materials and Methods.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Baltar, F., Martínez-Pérez, C., Amano, C. et al. A ubiquitous gammaproteobacterial clade dominates expression of sulfur oxidation genes across the mesopelagic ocean. Nat Microbiol 8, 1137–1148 (2023). https://doi.org/10.1038/s41564-023-01374-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41564-023-01374-2