Endogenous viral element

An endogenous viral element (EVE) is a DNA sequence derived from a virus, and present within the germline of a non-viral organism. EVEs may be entire viral genomes (proviruses), or fragments of viral genomes. They arise when a viral DNA sequence becomes integrated into the genome of a germ cell that goes on to produce a viable organism. The newly established EVE can be inherited from one generation to the next as an allele in the host species, and may even reach fixation.

Endogenous retroviruses and other EVEs that occur as proviruses can potentially remain capable of producing infectious virus in their endogenous state. Replication of such 'active' endogenous viruses can lead to the proliferation of viral insertions in the germline. For most non-retroviral viruses, germline integration appears to be a rare, anomalous event, and the resulting EVEs are often only fragments of the parent virus genome. Such fragments are usually not capable of producing infectious virus, but may express protein or RNA and even cell surface receptors.

Diversity and distribution

edit

EVEs have been identified in animals, plants and fungi.[1][2][3][4] In vertebrates EVEs derived from retroviruses (endogenous retroviruses) are relatively common. Because retroviruses integrate into the nuclear genome of the host cell as an inherent part of their replication cycle, they are predisposed to enter the host germline. In addition, EVEs related to parvoviruses, filoviruses, bornaviruses and circoviruses have been identified in vertebrate genomes. In plant genomes, EVEs derived from pararetroviruses are relatively common. EVEs derived from other, non-retrotranscribing virus families, such as Geminiviridae, have also been identified in plants. Moreover, EVEs related to giant viruses (aka GEVEs) of phylum Nucleocytoviricota (NCLDV) similar to Aureococcus anophagefferens virus (AaV) have been found in 2019/2020.[5]

Identification

edit

EVEs are traditionally identified by similarity to known viruses. In 2021, it has been demonstrated that the k-mer composition of endogenous RNA virus resemble that of their exogenous counterparts. As a result, it is now possible to identify novel groups of endogenous RNA viruses whose exogenous relatives have become extinct.[6]

Use in paleovirology

edit

EVEs are a rare source of retrospective information about ancient viruses. Many are derived from germline integration events that occurred millions of years ago, and can be viewed as viral fossils. Such ancient EVEs are an important component of paleovirological studies that address the long-term evolution of viruses. Identification of orthologous EVE insertions enables the calibration of long-term evolutionary timelines for viruses, based on the estimated time since divergence of the ortholog-containing host species groups. This approach has provided minimum ages ranging from 30 to 93 million years for the Parvoviridae, Filoviridae, Bornaviridae and Circoviridae families of viruses,[3] >100 million years in the Flaviviridae,[7] and 12 million years for the Lentivirus genus of the Retroviridae family. EVEs also facilitate the use of molecular clock-based approaches to obtain calibrations of viral evolution in deep time.[8][9]

Co-option and exaptation by host species

edit

EVEs can sometimes provide a selective advantage to the individuals in which they are inserted. For example, some protect against infection with related viruses.[10][11] In some mammal groups, including higher primates, retroviral envelope proteins have been exapted to produce a protein that is expressed in the placental syncytiotrophoblast, and is involved in fusion of the cytotrophoblast cells to form the syncytial layer of the placenta. In humans this protein is called syncytin, and is encoded by an endogenous retrovirus called (ERVWE1) on chromosome seven. Remarkably, the capture of syncytin or syncytin-like genes has occurred independently, from different groups of endogenous retroviruses, in diverse mammalian lineages. Distinct, syncytin-like genes have been identified in primates, rodents, lagomorphs, carnivores, and ungulates, with integration dates ranging from 10 to 85 million years ago.[12]

See also

edit

References

edit
  1. ^ Taylor DJ, Bruenn J (December 2009). "The evolution of novel fungal genes from non-retroviral RNA viruses". BMC Biology. 7: 88. doi:10.1186/1741-7007-7-88. PMC 2805616. PMID 20021636.
  2. ^ Koonin EV (January 2010). "Taming of the shrewd: novel eukaryotic genes from RNA viruses". BMC Biology. 8: 2. doi:10.1186/1741-7007-8-2. PMC 2823675. PMID 20067611.
  3. ^ a b Katzourakis A, Gifford RJ (November 2010). "Endogenous viral elements in animal genomes". PLOS Genetics. 6 (11): e1001191. doi:10.1371/journal.pgen.1001191. PMC 2987831. PMID 21124940.
  4. ^ Feschotte C, Gilbert C (March 2012). "Endogenous viruses: insights into viral evolution and impact on host biology" (PDF). Nature Reviews. Genetics. 13 (4): 283–296. doi:10.1038/nrg3199. PMID 22421730. S2CID 205485232.
  5. ^ Moniruzzaman M, Weinheimer AR, Martinez-Gutierrez CA, Aylward FO (December 2020). "Widespread endogenization of giant viruses shapes genomes of green algae". Nature. 588 (7836): 141–145. Bibcode:2020Natur.588..141M. doi:10.1038/s41586-020-2924-2. PMID 33208937. S2CID 227065267.
  6. ^ Kojima S, Yoshikawa K, Ito J, Nakagawa S, Parrish NF, Horie M, et al. (February 2021). "Virus-like insertions with sequence signatures similar to those of endogenous nonretroviral RNA viruses in the human genome". Proceedings of the National Academy of Sciences of the United States of America. 118 (5): e2010758118. Bibcode:2021PNAS..11810758K. doi:10.1073/pnas.2010758118. PMC 7865133. PMID 33495343.
  7. ^ Bamford CG, de Souza WM, Parry R, Gifford RJ (2022). "Comparative analysis of genome-encoded viral sequences reveals the evolutionary history of flavivirids (family Flaviviridae)". Virus Evolution. 8 (2): veac085. doi:10.1093/ve/veac085. PMC 9752770. PMID 36533146.
  8. ^ Katzourakis A, Tristem M, Pybus OG, Gifford RJ (April 2007). "Discovery and analysis of the first endogenous lentivirus". Proceedings of the National Academy of Sciences of the United States of America. 104 (15): 6261–6265. doi:10.1073/pnas.0700471104. PMC 1851024. PMID 17384150.
  9. ^ Gilbert C, Feschotte C (September 2010). "Genomic fossils calibrate the long-term evolution of hepadnaviruses". PLOS Biology. 8 (9): e100049. doi:10.1371/journal.pbio.1000495. PMC 2946954. PMID 20927357.
  10. ^ Best S, Le Tissier P, Towers G, Stoye JP (August 1996). "Positional cloning of the mouse retrovirus restriction gene Fv1". Nature. 382 (6594): 826–829. Bibcode:1996Natur.382..826B. doi:10.1038/382826a0. PMID 8752279. S2CID 1883507.
  11. ^ Arnaud F, Varela M, Spencer TE, Palmarini M (November 2008). "Coevolution of endogenous betaretroviruses of sheep and their host". Cellular and Molecular Life Sciences. 65 (21): 3422–3432. doi:10.1007/s00018-008-8500-9. PMC 4207369. PMID 18818869.
  12. ^ Dupressoir A, Lavialle C, Heidmann T (September 2012). "From ancestral infectious retroviruses to bona fide cellular genes: role of the captured syncytins in placentation". Placenta. 33 (9): 663–671. doi:10.1016/j.placenta.2012.05.005. PMID 22695103.