[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3107411.3107432acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

A Compatibility Approach to Identify Recombination Breakpoints in Bacterial and Viral Genomes

Published: 20 August 2017 Publication History

Abstract

Recombination is an evolutionary force that results in mosaic genomes for microorganisms. The evolutionary history of microorganisms cannot be properly inferred if recombination has occurred among a set of taxa. That is, polymorphic sites of a multiple sequence alignment cannot be described by a single phylogenetic tree. Thus, detecting the presence of recombination is crucial before phylogeny inference. The phylogenetic-based methods are commonly utilized to explore recombination, however, the compatibility-based methods are more computationally efficient since the phylogeny construction is not required. We propose a novel approach focusing on the pairwise compatibility of polymorphic sites of given regions to characterize potential breakpoints in recombinant bacterial and viral genomes. The performance of average compatibility ratio (ACR) approach is evaluated on simulated alignments of different scenarios comparing with two programs, GARD and RDP4. Three empirical datasets of varying genome sizes with varying levels of homoplasy are also utilized for testing. The results demonstrate that our approach is able to detect the presence of recombination and identify the recombinant breakpoints efficiently, which provides a better understanding of distinct phylogenies among mosaic sequences.

References

[1]
John Archer, John W Pinney, Jun Fan, Etienne Simon-Loriere, Eric J Arts, Matteo Negroni, and David L Robertson 2008. Identifying the important HIV-1 recombination breakpoints. PLoS Comput Biol, Vol. 4, 9 (2008), e1000178.
[2]
C Arnold 2007. Molecular evolution of Mycobacterium tuberculosis. Clinical microbiology and infection Vol. 13, 2 (2007), 120--128.
[3]
Trevor C Bruen, Hervé Philippe, and David Bryant. 2006. A simple and robust statistical test for detecting the presence of recombination. Genetics, Vol. 172, 4 (2006), 2665--2681.
[4]
Josephine Bryant, Claire Chewapreecha, and Stephen D Bentley. 2012. Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences. Future microbiology, Vol. 7, 11 (2012), 1283--1296.
[5]
Cheong Xin Chan, Robert G Beiko, and Mark A Ragan. 2006. Detecting recombination in evolving nucleotide sequences. BMC bioinformatics, Vol. 7, 1 (2006), 412.
[6]
Nicholas J Croucher, Simon R Harris, Christophe Fraser, Michael A Quail, John Burton, Mark van der Linden, Lesley McGee, Anne von Gottberg, Jae Hoon Song, Kwan Soo Ko, and others. 2011. Rapid pneumococcal evolution in response to clinical interventions. science, Vol. 331, 6016 (2011), 430--434.
[7]
Xavier Didelot, Rory Bowden, Teresa Street, Tanya Golubchik, Chris Spencer, Gil McVean, Vartul Sangal, Muna F Anjum, Mark Achtman, Daniel Falush, and others 2011. Recombination and population structure in Salmonella enterica. PLoS Genet, Vol. 7, 7 (2011), e1002191.
[8]
Xavier Didelot and Daniel J Wilson 2015. ClonalFrameML: efficient inference of recombination in whole bacterial genomes. PLoS Comput Biol, Vol. 11, 2 (2015), e1004041.
[9]
Elizabeth M Driebe, Jason W Sahl, Chandler Roe, Jolene R Bowers, James M Schupp, John D Gillece, Erin Kelley, Lance B Price, Talima R Pearson, Crystal M Hepp, and others. 2015. Using whole genome analysis to examine recombination across diverse sequence types of Staphylococcus aureus. PloS one, Vol. 10, 7 (2015), e0130955.
[10]
George F Estabrook, CS Johnson, and FR McMorris. 1976. A mathematical foundation for the analysis of cladistic character compatibility. Mathematical Biosciences Vol. 29, 1--2 (1976), 181--187.
[11]
George F Estabrook and FR McMorris 1980. When is one estimate of evolutionary relationships a refinement of another? Journal of Mathematical Biology Vol. 10, 4 (1980), 367--373.
[12]
Graham J Etherington, Jo Dicks, and Ian N Roberts. 2004. Recombination Analysis Tool (RAT): a program for the high-throughput detection of recombination. Bioinformatics, Vol. 21, 3 (2004), 278--281.
[13]
Richard G Everitt, Xavier Didelot, Elizabeth M Batty, Ruth R Miller, Kyle Knox, Bernadette C Young, Rory Bowden, Adam Auton, Antonina Votintseva, Hanna Larner-Svensson, and others. 2014. Mobile elements drive recombination hotspots in the core genome of Staphylococcus aureus. Nature communications Vol. 5 (2014).
[14]
Nicholas C Grassly and Edward C Holmes 1997. A likelihood method for the detection of selection and recombination using nucleotide sequences. Molecular Biology and Evolution Vol. 14, 3 (1997), 239--247.
[15]
Michaela M Gutacker, James C Smoot, Cristi A Lux Migliaccio, Stacy M Ricklefs, Su Hua, Debby V Cousins, Edward A Graviss, Elena Shashkina, Barry N Kreiswirth, and James M Musser 2002. Genome-wide analysis of synonymous single nucleotide polymorphisms in Mycobacterium tuberculosis complex organisms: resolution of genetic relationships among closely related microbial strains. Genetics, Vol. 162, 4 (2002), 1533--1543.
[16]
Ingrid B Jakobsen and Simon Easteal 1996. A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequences. Computer applications in the biosciences: CABIOS, Vol. 12, 4 (1996), 291--295.
[17]
BG Kelly, A Vespermann, and DJ Bolton 2009. The role of horizontal gene transfer in the evolution of selected foodborne bacterial pathogens. Food and Chemical Toxicology Vol. 47, 5 (2009), 951--968.
[18]
Elzbieta Krzywinska, Jaroslaw Krzywinski, and Jeffrey S Schorey 2004. Naturally occurring horizontal gene transfer and homologous recombination in Mycobacterium. Microbiology, Vol. 150, 6 (2004), 1707--1712.
[19]
Fedora Lanzas, Petros C Karakousis, James C Sacchettini, and Thomas R Ioerger 2013. Multidrug-resistant tuberculosis in panama is driven by clonal expansion of a multidrug-resistant Mycobacterium tuberculosis strain related to the KZN extensively drug-resistant M. tuberculosis strain from South Africa. Journal of clinical microbiology Vol. 51, 10 (2013), 3277--3285.
[20]
Walter J Le Quesne. 1969. A method of selection of characters in numerical taxonomy. Systematic Biology, Vol. 18, 2 (1969), 201--205.
[21]
Walter J LeQuesne. 1972. Further studies based on the uniquely derived character concept. Systematic Biology, Vol. 21, 3 (1972), 281--288.
[22]
Darren Martin and Ed Rybicki 2000. RDP: detection of recombination amongst aligned sequences. Bioinformatics, Vol. 16, 6 (2000), 562--563.
[23]
Darren P Martin, Ben Murrell, Michael Golden, Arjun Khoosal, and Brejnev Muhire 2015. RDP4: Detection and analysis of recombination patterns in virus genomes. Virus Evolution, Vol. 1, 1 (2015), vev003.
[24]
Christopher A Meacham and George F Estabrook 1985. Compatibility methods in systematics. Annual Review of Ecology and Systematics Vol. 16, 1 (1985), 431--446.
[25]
Adewunmi Onafuwa-Nuga and Alice Telesnitsky 2009. The remarkable frequency of human immunodeficiency virus type 1 genetic recombination. Microbiology and Molecular Biology Reviews Vol. 73, 3 (2009), 451--480.
[26]
Talima Pearson, Joseph D Busch, Jacques Ravel, Timothy D Read, Shane D Rhoton, Jana M U'ren, Tatum S Simonson, Sergey M Kachur, Rebecca R Leadem, Michelle L Cardon, and others 2004. Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing. Proceedings of the National Academy of Sciences of the United States of America, Vol. 101, 37 (2004), 13536--13541.
[27]
Hervé Philippe, Henner Brinkmann, Dennis V Lavrov, D Timothy J Littlewood, Michael Manuel, Gert Wörheide, and Denis Baurain. 2011. Resolving difficult phylogenetic questions: why more sequences are not enough. PLoS Biol, Vol. 9, 3 (2011), e1000602.
[28]
Paul J Planet, Apurva Narechania, Liang Chen, Barun Mathema, Sam Boundy, Gordon Archer, and Barry Kreiswirth 2016. Architecture of a Species: Phylogenomics of Staphylococcus aureus. Trends in Microbiology (2016).
[29]
DOTREE Plotree and DOTGRAM Plotgram 1989. PHYLIP-phylogeny inference package (version 3.2). cladistics, Vol. 5, 163 (1989), 6.
[30]
Sergei L Kosakovsky Pond, David Posada, Michael B Gravenor, Christopher H Woelk, and Simon DW Frost 2006natexlaba. Automated phylogenetic detection of recombination using a genetic algorithm. Molecular biology and evolution Vol. 23, 10 (2006), 1891--1901.
[31]
Sergei L Kosakovsky Pond, David Posada, Michael B Gravenor, Christopher H Woelk, and Simon DW Frost 2006natexlabb. GARD: a genetic algorithm for recombination detection. Bioinformatics, Vol. 22, 24 (2006), 3096--3098.
[32]
Sergei L Kosakovsky Pond, David Posada, Eric Stawiski, Colombe Chappey, Art FY Poon, Gareth Hughes, Esther Fearnhill, Mike B Gravenor, Andrew J Leigh Brown, and Simon DW Frost 2009. An evolutionary model-based algorithm for accurate phylogenetic breakpoint mapping and subtype prediction in HIV-1. PLoS Comput Biol, Vol. 5, 11 (2009), e1000581.
[33]
David Posada. 2002. Evaluation of methods for detecting recombination from DNA sequences: empirical data. Molecular biology and evolution Vol. 19, 5 (2002), 708--717.
[34]
David Posada and Keith A Crandall 2001. Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proceedings of the National Academy of Sciences, Vol. 98, 24 (2001), 13757--13762.
[35]
David Posada and Keith A Crandall 2002. The effect of recombination on the accuracy of phylogeny estimation. Journal of molecular evolution Vol. 54, 3 (2002), 396--402.
[36]
MIKA O SALMINEN, JEAN K CARR, DONALD S BURKE, and FRANCINE E McCUTCHAN 1995. Identification of breakpoints in intergenotypic recombinants of HIV type 1 by bootscanning. AIDS research and human retroviruses Vol. 11, 11 (1995), 1423--1425.
[37]
David Sankoff. 1975. Minimal mutation trees of sequences. SIAM J. Appl. Math. Vol. 28, 1 (1975), 35--42.
[38]
Célio D Santos-Júnior, António Veríssimo, and Joana Costa 2016. The recombination dynamics of Staphylococcus aureus inferred from spA gene. BMC microbiology, Vol. 16, 1 (2016), 143.
[39]
SA Sawyer. 1999. GENECONV: A computer package for the statistical detection of gene conversion. Distributed by the author, Department of Mathematics, Washington University in St. Louis (1999).
[40]
Mikkel H Schierup and Jotun Hein 2000. Consequences of recombination on traditional phylogenetic analysis. Genetics, Vol. 156, 2 (2000), 879--891.
[41]
Robert W Scotland and Mike Steel 2015. Circumstances in which parsimony but not compatibility will be provably misleading. Systematic biology, Vol. 64, 3 (2015), 492--504.
[42]
Binshan Shi, Christina Kitchen, Barbara Weiser, Douglas Mayers, Brian Foley, Kimdar Kemal, Kathryn Anastos, Marc Suchard, Monica Parker, Cheryl Brunner, and others 2010. Evolution and recombination of genes encoding HIV-1 drug resistance and tropism during antiretroviral therapy. Virology, Vol. 404, 1 (2010), 5--20.
[43]
John Maynard Smith. 1992. Analyzing the mosaic structure of genes. Journal of molecular evolution Vol. 34, 2 (1992), 126--129.
[44]
Christopher M Thomas and Kaare M Nielsen 2005. Mechanisms of, and barriers to, horizontal gene transfer between bacteria. Nature reviews microbiology Vol. 3, 9 (2005), 711--721.
[45]
David B Wake. 1991. Homoplasy: the result of natural selection, or evidence of design limitations? The American Naturalist Vol. 138, 3 (1991), 543--567.
[46]
Timothy M Walker, Camilla LC Ip, Ruth H Harrell, Jason T Evans, Georgia Kapatai, Martin J Dedicoat, David W Eyre, Daniel J Wilson, Peter M Hawkey, Derrick W Crook, and others. 2013. Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. The Lancet infectious diseases Vol. 13, 2 (2013), 137--146.
[47]
Georg F Weiller. 1998. Phylogenetic profiles: a graphical method for detecting genetic recombinations in homologous sequences. Molecular Biology and Evolution Vol. 15, 3 (1998), 326--335.
[48]
Michael Worobey. 2001. A novel approach to detecting and measuring recombination: new insights into evolution in viruses, bacteria, and mitochondria. Molecular Biology and Evolution Vol. 18, 8 (2001), 1425--1434.

Index Terms

  1. A Compatibility Approach to Identify Recombination Breakpoints in Bacterial and Viral Genomes

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ACM-BCB '17: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics
      August 2017
      800 pages
      ISBN:9781450347228
      DOI:10.1145/3107411
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 20 August 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. SNPS
      2. bacterial genomes
      3. compatibility
      4. multiple sequence alignments
      5. phylogeny
      6. recombination
      7. viral genomes

      Qualifiers

      • Research-article

      Conference

      BCB '17
      Sponsor:

      Acceptance Rates

      ACM-BCB '17 Paper Acceptance Rate 42 of 132 submissions, 32%;
      Overall Acceptance Rate 254 of 885 submissions, 29%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 230
        Total Downloads
      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 19 Dec 2024

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media