Oct 07, 2023
Genomes of six viruses that infect Asgard archaea from deep
Nature Microbiology volume 7,
Nature Microbiology volume 7, pages 953–961 (2022)Cite this article
7872 Accesses
10 Citations
433 Altmetric
Metrics details
Asgard archaea are globally distributed prokaryotic microorganisms related to eukaryotes; however, viruses that infect these organisms have not been described. Here, using metagenome sequences recovered from deep-sea hydrothermal sediments, we characterize six relatively large (up to 117 kb) double-stranded DNA (dsDNA) viral genomes that infected two Asgard archaeal phyla, Lokiarchaeota and Helarchaeota. These viruses encode Caudovirales-like structural proteins, as well as proteins distinct from those described in known archaeal viruses. Their genomes contain around 1–5% of genes associated with eukaryotic nucleocytoplasmic large DNA viruses (NCLDVs) and appear to be capable of semi-autonomous genome replication, repair, epigenetic modifications and transcriptional regulation. Moreover, Helarchaeota viruses may hijack host ubiquitin systems similar to eukaryotic viruses. Genomic analysis of these Asgard viruses reveals that they contain features of both prokaryotic and eukaryotic viruses, and provides insights into their potential infection and host interaction mechanisms.
This is a preview of subscription content, access via your institution
Open Access articles citing this article.
Nature Communications Open Access 29 December 2022
Nature Microbiology Open Access 27 June 2022
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Rent or buy this article
Get just this article for as long as you need it
$39.95
Prices may be subject to local taxes which are calculated during checkout
The genomic sequences associated with the study have been deposited in NCBI under BioProject PRJNA692327.
All custom scripts, alignments and phylogenetic tree files have been made available at https://github.com/bakermicrolab/asgardviruses.
Spang, A. et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521, 173–179 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zaremba-Niedzwiedzka, K. et al. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 541, 353–358 (2017).
Article CAS PubMed Google Scholar
Eme, L., Spang, A., Lombard, J., Stairs, C. W. & Ettema, T. J. G. Archaea and the origin of eukaryotes. Nat. Rev. Microbiol. 15, 711–723 (2017).
Article CAS PubMed Google Scholar
Baker, B. J. et al. Diversity, ecology and evolution of Archaea. Nat. Microbiol. 5, 887–900 (2020).
Article CAS PubMed Google Scholar
Imachi, H. et al. Isolation of an archaeon at the prokaryote–eukaryote interface. Nature 577, 519–525 (2020).
Article CAS PubMed PubMed Central Google Scholar
Spang, A. et al. Proposal of the reverse flow model for the origin of the eukaryotic cell based on comparative analyses of Asgard archaeal metabolism. Nat. Microbiol. 4, 1138–1148 (2019).
Article CAS PubMed Google Scholar
Bell, P. J. L. Evidence supporting a viral origin of the eukaryotic nucleus. Virus Res. 289, 198168 (2020).
Article CAS PubMed Google Scholar
Forterre, P. & Gaïa, M. Giant viruses and the origin of modern eukaryotes. Curr. Opin. Microbiol. 31, 44–49 (2016).
Article PubMed Google Scholar
Chaikeeratisak, V. et al. Assembly of a nucleus-like structure during viral replication in bacteria. Science 355, 194–197 (2017).
Article CAS PubMed PubMed Central Google Scholar
Malone, L. M. et al. A jumbo phage that forms a nucleus-like structure evades CRISPR-Cas DNA targeting but is vulnerable to type III RNA-based immunity. Nat. Microbiol. 5, 48–55 (2020).
Article CAS PubMed Google Scholar
Iyer, L. M., Aravind, L. & Koonin, E. V. Common origin of four diverse families of large eukaryotic DNA viruses. J. Virol. 75, 11720–11734 (2001).
Article CAS PubMed PubMed Central Google Scholar
Krupovic, M., Dolja, V. V. & Koonin, E. V. The LUCA and its complex virome. Nat. Rev. Microbiol. 18, 661–670 (2020).
Article CAS PubMed Google Scholar
Makarova, K. S. et al. Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants. Nat. Rev. Microbiol. 18, 67–83 (2020).
Article CAS PubMed Google Scholar
Dombrowski, N., Teske, A. P. & Baker, B. J. Expansive microbial metabolic versatility and biodiversity in dynamic Guaymas Basin hydrothermal sediments. Nat. Commun. 9, 4999 (2018).
Article PubMed PubMed Central CAS Google Scholar
Castelle, C. J. et al. Protein family content uncovers lineage relationships and bacterial pathway maintenance mechanisms in DPANN Archaea. Front. Microbiol. 12, 660052 (2021).
Article PubMed PubMed Central Google Scholar
Langwig, M. V. et al. Large-scale protein level comparison of Deltaproteobacteria reveals cohesive metabolic groups. ISME J. https://doi.org/10.1038/s41396-021-01057-y (2021).
Kieft, K., Zhou, Z. & Anantharaman, K. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome 8, 90 (2020).
Article CAS PubMed PubMed Central Google Scholar
Prangishvili, D. et al. The enigmatic archaeal virosphere. Nat. Rev. Microbiol. 15, 724–739 (2017).
Article CAS PubMed Google Scholar
Nayfach, S. et al. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat. Biotechnol. 39, 578–585 (2021).
Article CAS PubMed Google Scholar
Kazlauskas, D., Krupovic, M. & Venclovas, Č. The logic of DNA replication in double-stranded DNA viruses: insights from global analysis of viral genomes. Nucleic Acids Res. 44, 4551–4564 (2016).
Article CAS PubMed PubMed Central Google Scholar
Pons, J. C. et al. VPF-Class: Taxonomic assignment and host prediction of uncultivated viruses based on viral protein families. Bioinformatics https://doi.org/10.1093/bioinformatics/btab026 (2021).
Krupovic, M., Cvirkaite-Krupovic, V., Iranzo, J., Prangishvili, D. & Koonin, E. V. Viruses of archaea: structural, functional, environmental and evolutionary genomics. Virus Res. 244, 181–193 (2018).
Article CAS PubMed Google Scholar
Yutin, N., Wolf, Y. I., Raoult, D. & Koonin, E. V. Eukaryotic large nucleo-cytoplasmic DNA viruses: clusters of orthologous genes and reconstruction of viral genome evolution. Virol. J. 6, 223 (2009).
Article PubMed PubMed Central CAS Google Scholar
Koonin, E. V. & Dolja, V. V. Virus world as an evolutionary network of viruses and capsidless selfish elements. Microbiol. Mol. Biol. Rev. 78, 278–303 (2014).
Article CAS PubMed PubMed Central Google Scholar
Iranzo, J., Koonin, E. V., Prangishvili, D., Krupovic, M. & Sandri-Goldin, R. M. Bipartite network analysis of the archaeal virosphere: evolutionary connections between viruses and capsidless mobile elements. J. Virol. 90, 11043–11055 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kala, S. et al. HNH proteins are a widespread component of phage DNA packaging machines. Proc. Natl Acad. Sci. USA 111, 6022–6027 (2014).
Article CAS PubMed PubMed Central Google Scholar
Guilliam, T. A., Keen, B. A., Brissett, N. C. & Doherty, A. J. Primase-polymerases are a functionally diverse superfamily of replication and repair enzymes. Nucleic Acids Res. 43, 6651–6664 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gupta, A., Lad, S. B., Ghodke, P. P., Pradeepkumar, P. I. & Kondabagil, K. Mimivirus encodes a multifunctional primase with DNA/RNA polymerase, terminal transferase and translesion synthesis activities. Nucleic Acids Res. 47, 6932–6945 (2019).
Article CAS PubMed PubMed Central Google Scholar
MacNeill, S. A. PCNA-binding proteins in the archaea: novel functionality beyond the conserved core. Curr. Genet. 62, 527–532 (2016).
Article CAS PubMed PubMed Central Google Scholar
Mazzon, C. et al. Cytosolic and mitochondrial deoxyribonucleotidases: activity with substrate analogs, inhibitors and implications for therapy. Biochem. Pharmacol. 66, 471–479 (2003).
Article CAS PubMed Google Scholar
Colson, P., La Scola, B., Levasseur, A., Caetano-Anollés, G. & Raoult, D. Mimivirus: leading the way in the discovery of giant viruses of amoebae. Nat. Rev. Microbiol. 15, 243–254 (2017).
Article CAS PubMed PubMed Central Google Scholar
Doherty, A. J., Serpell, L. C. & Ponting, C. P. The helix-hairpin-helix DNA-binding motif: a structural basis for non-sequence-specific recognition of DNA. Nucleic Acids Res. 24, 2488–2497 (1996).
Article CAS PubMed PubMed Central Google Scholar
Iyer, L. M., Balaji, S., Koonin, E. V. & Aravind, L. Evolutionary genomics of nucleo-cytoplasmic large DNA viruses. Virus Res. 117, 156–184 (2006).
Article CAS PubMed Google Scholar
Sim, S., Hughes, K., Chen, X. & Wolin, S. L. The bacterial Ro60 protein and its noncoding Y RNA regulators. Annu. Rev. Microbiol. 74, 387–407 (2020).
Article CAS PubMed Google Scholar
Ho, C. K., Wang, L. K., Lima, C. D. & Shuman, S. Structure and mechanism of RNA ligase. Structure 12, 327–339 (2004).
Article CAS PubMed Google Scholar
Tang, Q., Wu, P., Chen, H. & Li, G. Pleiotropic roles of the ubiquitin-proteasome system during viral propagation. Life Sci. 207, 350–354 (2018).
Article CAS PubMed PubMed Central Google Scholar
Murphy, J., Mahony, J., Ainsworth, S., Nauta, A. & van Sinderen, D. Bacteriophage orphan DNA methyltransferases: insights from their bacterial origin, function, and occurrence. Appl. Environ. Microbiol. 79, 7547–7555 (2013).
Article CAS PubMed PubMed Central Google Scholar
Jeudy, S. et al. Exploration of the propagation of transpovirons within Mimiviridae reveals a unique example of commensalism in the viral world. ISME J. 14, 727–739 (2020).
Article CAS PubMed Google Scholar
Agarkova, I. V., Dunigan, D. D. & Van Etten, J. L. Virion-associated restriction endonucleases of chloroviruses. J. Virol. 80, 8114–8123 (2006).
Article CAS PubMed PubMed Central Google Scholar
Markine-Goriaynoff, N. et al. Glycosyltransferases encoded by viruses. J. Gen. Virol. 85, 2741–2754 (2004).
Article CAS PubMed Google Scholar
Piacente, F., Gaglianone, M., Laugieri, M. E. & Tonetti, M. G. The autonomous glycosylation of large DNA viruses. Int. J. Mol. Sci. 16, 29315–29328 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hagelueken, G. et al. A coiled-coil domain acts as a molecular ruler to regulate O-antigen chain length in lipopolysaccharide. Nat. Struct. Mol. Biol. 22, 50–56 (2014).
Article PubMed PubMed Central CAS Google Scholar
Tamarit, D. et al. A closed Candidatus Odinarchaeum chromosome exposes Asgard archaeal viruses. Nat. Microbiol. https://doi.org/10.1038/s41564-022-01122-y (2022).
Article PubMed Google Scholar
Medvedeva, S. et al. Three families of Asgard archaeal viruses identified in metagenome-assembled genomes. Nat. Microbiol. https://doi.org/10.1038/s41564-022-01144-6 (2022).
Article PubMed Google Scholar
Joshi, N.A. & Fass, J.N. Sickle: a sliding-window, adaptive, quality-based trimming tool for FastQ files (Version 1.33) [Software] (2011). https://github.com/najoshi/sickle
Peng, Y., Leung, H. C. M., Yiu, S. M. & Chin, F. Y. L. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28, 1420–1428 (2012).
Article CAS PubMed Google Scholar
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Article CAS PubMed PubMed Central Google Scholar
Alneberg, J. et al. Binning metagenomic contigs by coverage and composition. Nat. Methods 11, 1144–1146 (2014).
Article CAS PubMed Google Scholar
Kang, D. D. et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7, e7359 (2019).
Article PubMed PubMed Central Google Scholar
Sieber, C. M. K. et al. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nat. Microbiol. 3, 836–843 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chen, I.-M. A. et al. IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes. Nucleic Acids Res. 47, D666–D677 (2019).
Article CAS PubMed Google Scholar
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
Article CAS PubMed PubMed Central Google Scholar
Biswas, A., Staals, R. H. J., Morales, S. E., Fineran, P. C. & Brown, C. M. CRISPRDetect: a flexible algorithm to define CRISPR arrays. BMC Genomics 17, 356 (2016).
Article PubMed PubMed Central CAS Google Scholar
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bland, C. et al. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinformatics 8, 209 (2007).
Article PubMed PubMed Central CAS Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Article PubMed PubMed Central CAS Google Scholar
Padilha, V. A., Alkhnbashi, O. S., Shah, S. A., de Carvalho, A. C. P. L. F. & Backofen, R. CRISPRcasIdentifier: machine learning for accurate identification and classification of CRISPR-Cas systems. Gigascience 9, giaa062 (2020).
Article PubMed PubMed Central CAS Google Scholar
Makarova, K. S. et al. An updated evolutionary classification of CRISPR-Cas systems. Nat. Rev. Microbiol. 13, 722–736 (2015).
Article CAS PubMed PubMed Central Google Scholar
Koonin, E. V., Makarova, K. S. & Zhang, F. Diversity, classification and evolution of CRISPR-Cas systems. Curr. Opin. Microbiol. 37, 67–78 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nethery, M. A. et al. CRISPRclassify: repeat-based classification of CRISPR loci. CRISPR J. 4, 558–574 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010).
Article PubMed PubMed Central CAS Google Scholar
Aramaki, T. et al. KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 36, 2251–2252 (2019).
Article PubMed Central CAS Google Scholar
El-Gebali, S. et al. The Pfam protein families database in 2019. Nucleic Acids Res. 47, D427–D432 (2019).
Article CAS PubMed Google Scholar
Grazziotin, A. L., Koonin, E. V. & Kristensen, D. M. Prokaryotic virus orthologous groups (pVOGs): a resource for comparative genomics and protein family annotation. Nucleic Acids Res. 45, D491–D498 (2017).
Article CAS PubMed Google Scholar
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).
Article CAS PubMed PubMed Central Google Scholar
Guo, J. et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome 9, 37 (2021).
Article PubMed PubMed Central Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009).
Article PubMed PubMed Central CAS Google Scholar
Buchfink, B., Xie, C. & Huson, D. H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2014).
Article PubMed CAS Google Scholar
Schulz, F. et al. Giant virus diversity and host interactions through global metagenomics. Nature 578, 432–436 (2020).
Article CAS PubMed PubMed Central Google Scholar
Cantu, V. A. et al. PhANNs, a fast and accurate tool and web server to classify phage structural proteins. PloS Comput. Biol. 16, e1007845 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zimmermann, L. et al. A completely reimplemented MPI bioinformatics toolkit with a new HHpred server at its core. J. Mol. Biol. 430, 2237–2243 (2018).
Article CAS PubMed Google Scholar
Grant, J. R. & Stothard, P. The CGView server: a comparative genomics tool for circular genomes. Nucleic Acids Res. 36, W181–W184 (2008).
Article CAS PubMed PubMed Central Google Scholar
Bin Jang, H. et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 37, 632–639 (2019).
Article CAS Google Scholar
Nepusz, T., Yu, H. & Paccanaro, A. Detecting overlapping protein complexes in protein-protein interaction networks. Nat. Methods 9, 471–472 (2012).
Article CAS PubMed PubMed Central Google Scholar
Enright, A. J., Van Dongen, S. & Ouzounis, C. A. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30, 1575–1584 (2002).
Article CAS PubMed PubMed Central Google Scholar
Sayers, E. W. et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 37, D5–D15 (2009).
Article CAS PubMed Google Scholar
Al-Shayeb, B. et al. Clades of huge phages from across Earth's ecosystems. Nature 578, 425–431 (2020).
Article CAS PubMed PubMed Central Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
RStudio: Integrated Development Environment for R (RStudio Team, 2019).
R: A Language and Environment for Statistical Computing (R Core Team, 2020).
Rudis, B. & Gandy, D. waffle: create waffle chart visualizations in R (2016).
Yutin, N., Wolf, Y. I. & Koonin, E. V. Origin of giant viruses from smaller DNA viruses not from a fourth domain of cellular life. Virology 466-467, 38–52 (2014).
Article CAS PubMed Google Scholar
Paez-Espino, D. et al. IMG/VR: a database of cultured and uncultured DNA viruses and retroviruses. Nucleic Acids Res. 45, D457–D465 (2017).
CAS PubMed Google Scholar
Wu, F. et al. Unique mobile elements and scalable gene flow at the prokaryote–eukaryote boundary revealed by circularized Asgard archaea genomes. Nat. Microbiol. 7, 200–212 (2022).
Article CAS PubMed PubMed Central Google Scholar
Andersson, A. F. & Banfield, J. F. Virus population dynamics and acquired virus resistance in natural microbial communities. Science 320, 1047–1050 (2008).
Article CAS PubMed Google Scholar
De Anda, V. et al. Understanding the mechanisms behind the response to environmental perturbation in microbial mats: a metagenomic-network based approach. Front. Microbiol. 9, 2606 (2018).
Article PubMed PubMed Central Google Scholar
Zhang, R. et al. SpacePHARER: sensitive identification of phages from CRISPR spacers in prokaryotic hosts. Bioinformatics 37, 3364–3366 (2021).
Article CAS PubMed Central Google Scholar
Guglielmini, J., Woo, A. C., Krupovic, M., Forterre, P. & Gaia, M. Diversification of giant and large eukaryotic dsDNA viruses predated the origin of modern eukaryotes. Proc. Natl Acad. Sci. USA 116, 19585–19592 (2019).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Minh, B. Q. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Article CAS PubMed PubMed Central Google Scholar
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
Article CAS PubMed PubMed Central Google Scholar
Download references
This work was supported by grants from the Moore-Simons Project on the Origin of the Eukaryotic Cell (Simons Foundation grant 73592LPI; https://doi.org/10.46714/735925LPI; B.J.B.) and the Simons Foundation Early Career Award (687165, B.J.B.). We thank D. Tamarit and T. Ettema for discussions about this research; A. P. Teske (Department of Marine Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA) for providing the sediments from Guaymas Basin.
Marguerite V. Langwig
Present address: Department of Bacteriology and Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, USA
Department of Marine Science, The University of Texas at Austin, Port Aransas, TX, USA
Ian M. Rambo, Marguerite V. Langwig, Pedro Leão, Valerie De Anda & Brett J. Baker
Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
Brett J. Baker
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
V.D.A. and B.J.B. conceptualized the project. I.M.R., V.D.A. and M.V.L. curated the data. B.J.B. acquired funding. I.M.R., V.D.A. and P.L. conducted the investigations. I.M.R., V.D.A. and B.J.B. developed the methodology. B.J.B. and V.D.A. administered and supervised the project. B.J.B acquired resources. I.M.R., V.D.A. and P.L. created the visualizations. I.M.R., V.D.A., M.V.L. and B.J.B. wrote the original draft. I.M.R., V.D.A., M.V.L., P.L. and B.J.B. reviewed and edited the manuscript.
Correspondence to Brett J. Baker.
The authors declare no competing interests.
Nature Microbiology thanks Susanne Erdmann, Hiroyuki Ogata and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
a, Genomic architecture of the complete Helarchaeota virus Nidhogg Meg22_1012. From outside to center: genes described in the main text, genes with homologs not described in the main text, hypothetical proteins, GC content, genome size ruler. Arrows pointing left indicate (−) sense, while those pointing right indicate (+) sense. b, Structures of the linear Fenrir, Sköll, and Ratatoskr genomes. Sequence length is designated by the measure on the x-axis. Genes with hypothetical products do not have labels and are colored in gray. CRISPR spacer match locations are highlighted with vertical bars, colored to represent 0 or 1 mismatches in the alignment.
Median value (Asgard virus = 89,108 / Archaea virus = 35,450); Minimum value (Asgard virus = 39,909 / Archaea virus = 5,278); maximum value (Asgard virus = 117,419 / Archaea virus = 103,257); Archaea virus outlier = 143,855. Data and code for this figure is available at https://github.com/bakermicrolab/asgardviruses.
The y-axis denotes average read depth for a contig within its respective sample, with each Asgard host shown on the x-axis. Each point on the x-axis contains two box-and-whisker plots indicating average read depths for linked viral contigs (left) and host contigs (right). Data points represent mean values. Taxonomic assignment is designated by the color of the points and/or box. Average read depth was calculated for each contig using reads from the same sample used in assembly. Helarchaeota Meg19_1012_Bin_504 (virus: n = 3, min=25.52, max=173.4, mid=32.50; MAG: n = 170, min=9.12, mean=56.58, max=81.45, SD = 5.8, 1st quartile=54.9, 3rd quartile=58.9), Lokiarchaeota Meg22_1012_Bin_233 (virus: n = 1, 77.5; MAG: n = 94, min=6.16, mean=37.3, max=202.9, SD = 24.8, 1st quartile=31.5, 3rd quartile=34.3), Lokiarchaeota Meg22_1214_Bin_191 (virus: n = 1, 17.05; MAG: n = 246, min=8.1, mean=44.8, max=1253, SD = 95.6, 1st quartile=29.7, 3rd quartile=39.7), Lokiarchaeota Meg22_1416_Bin_151 (virus: n = 1, 16.32; MAG: n = 217, min=5.9, mean=17, max=48, SD = 3.7, 1st quartile=15.7, 3rd quartile=18.5).
Viruses are grouped on the x-axis based on their host, with the NCLDVs included in their own category. The y-axis denotes the percentage of genes present with hits to NCVOGs (see Methods). Each dot in the graph represents a viral genome. Bacterial viruses n = 33442, mean=2, SD = ± 1.2; Archaeal viruses n = 84, mean=2.4’ SD = ± 1.4; Asgard virus n = 6, mean=2.2, SD = ± 1.4; Eukaryotic virus n = 362, mean=36, SD = ± 39; NCLDV n = 149, mean=78, SD = 21.
Viruses are grouped on the y-axis based on their host, with percentages on the x-axis indicating the proportion of NCVOG hits assigned to a particular function. NCVOGs found in Asgard viruses are related to DNA replication, recombination and repair or viral structure proteins. The first group of NCVOGs are commonly found in viruses infecting bacteria, and can also be observed in viruses infecting other archaeal groups, and Eukaryotes. The second group of NCVOGs are not so commonly found in other viruses, which can suggest a similar structure of Asgard viruses and NCLDVs.
A phylogenetic tree of 241 deoxynucleotide/side monophosphate kinase sequences from viruses and bacteria. Circles on branches indicate BOOSTER supports ≥70. Lokiarchaeota virus Fenrir Meg22_1012 and Meg22_1214 sequences are highlighted in gold. The phylogeny was inferred using the LG model with fixed base frequencies and 1000 rapid bootstraps.
A phylogenetic tree of 368 ubiquitin-activating enzyme (E1) protein sequences from archaea, bacteria, eukaryotes, and viruses (taxa are labeled with background colors). Three E1-like protein sequences were identified in Nidhogg viruses, and these are labeled with black circles and bold text. Arched lines show the connections between Nidhogg virus sequences and their Helarchaeota host. This phylogeny was inferred using the LG + R8 model with 1000 ultrafast bootstraps and optimization by nearest neighbor interchange (-bb 1000 -bnni). Circles on tree branches indicate ultrafast bootstrap supports ≥95. The tree is comprised of protein sequences belonging to the NEDD8-activating enzyme E1 catalytic subunit family (n = 11, IPR030468), ubiquitin-activating E1 enzyme (n = 218, IPR035985), viral sequences obtained from NCBI (n = 14), and sequences derived from Lokiarchaeota and Helarchaeota (n = 125).
Extended Data Figs. 1–7, Supplementary Text and description of Supplementary Data 1–13.
CRISPRDetect results, including spacer and repeat lengths and sequences, and CRISPR sense; Asgard CRISPR spacer BLASTN-short output against Guaymas Basin viruses; average read depth of CRISPR-containing contigs of Asgard MAGs; SpacePHARER hits of Asgard CRISPR spacers to Guaymas Basin UViGs; and CRISPRClassify results for Asgard CRISPR repeats.
Viral genome overview, Asgard MAG GTDBTk taxonomy and MAG statistics.
Minimum information about an uncultivated virus genome (MiUViG) metadata for viral genomes described in this study.
Viral annotations with VIBRANT, DIAMOND and InterProScan; PhANNs classification; and HHPred results for major capsid proteins predicted with PhANNs.
Supplementary_Data_5_Fenrir_Meg22_1012_226.pdf. Visualization of Lokiarchaeota virus Fenrir Meg22_1012_scaffold_226 coverage based on read mapping against the Meg22_1012 sample performed with BWA-MEM v0.7.17 and Samtools v1.11. Visualized with Geneious version 2022.0 by Biomatters. Supplementary_Data_6_Fenrir_Meg22_1214.pdf. Visualization of Lokiarchaeota virus Fenrir Meg22_1214_scaffold_313 coverage based on read mapping against the Meg22_1214 sample performed with BWA-MEM v0.7.17 and Samtools v1.11. Visualized with Geneious version 2022.0 by Biomatters. Supplementary_Data_7_Skoll_Meg22_1214_2849.pdf. Visualization of Lokiarchaeota virus Sköll Meg22_1214_scaffold_2849 coverage based on read mapping against the Meg22_1214 sample performed with BWA-MEM v0.7.17 and Samtools v1.11. Visualized with Geneious version 2022.0 by Biomatters. Supplementary_Data_8_Ratatoskr_Meg22_1012_548.pdf. Visualization of Helarchaeota virus Ratatoskr Meg22_1012_scaffold_548 coverage based on read mapping against the Meg22_1012 sample performed with BWA-MEM v0.7.17 and Samtools v1.11. Visualized with Geneious version 2022.0 by Biomatters. Supplementary_Data_9_Nidhogg_Meg22_1012_91.pdf. Visualization of Helarchaeota virus Nidhogg Meg22_1012_scaffold_91 coverage based on read mapping against the Meg22_1012 sample performed with BWA-MEM v0.7.17 and Samtools v1.11. Visualized with Geneious version 2022.0 by Biomatters. Supplementary_Data_10_Nidhogg_Meg22_1214_152.pdf. Visualization of Helarchaeota virus Nidhogg Meg22_1214_scaffold_152 coverage based on read mapping against the Meg22_1214 sample performed with BWA-MEM v0.7.17 and Samtools v1.11. Visualized with Geneious version 2022.0 by Biomatters.
Sequences used in the DNA polymerase B phylogeny.
Viral protein family (VPF) classification membership ratios for Asgard viruses.
InterProScan annotations of Asgard MAGs first detailed in this study and IMG/M annotations of all MAGs used in this study.
Reprints and Permissions
Rambo, I.M., Langwig, M.V., Leão, P. et al. Genomes of six viruses that infect Asgard archaea from deep-sea sediments. Nat Microbiol 7, 953–961 (2022). https://doi.org/10.1038/s41564-022-01150-8
Download citation
Received: 08 October 2021
Accepted: 16 May 2022
Published: 27 June 2022
Issue Date: July 2022
DOI: https://doi.org/10.1038/s41564-022-01150-8
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative