Phylogenomics for Systematic Biology.

نویسنده

  • David Posada
چکیده

Next-generation sequencing (NGS) techniques have deeply impacted multiple research areas in biology. In molecular systematics, NGS has boosted the field from being based on a few loci—phylogenetics— to the use of hundreds or thousands of loci— phylogenomics. However, while massive multilocus data sets should facilitate the resolution of complex phylogenetic problems, more data is not a panacea (Delsuc et al. 2005; Jeffroy et al. 2006; Roger and Hug 2006). Although large data sets will reduce sampling error, in the presence of systematic biases they could also lead to wrong answers with strong statistical support (Phillips et al. 2004; Nishihara et al. 2007; RodriguezEzpeleta et al. 2007; Kumar et al. 2012). At the same time, the wealth of data resulting from NGS has forced us to stop ignoring phylogenetic incongruence (Jeffroy et al. 2006; Galtier and Daubin 2008; Salichos and Rokas 2013) and to reconsider the difference between gene trees and species trees (Goodman et al. 1979), to the point that we are witnessing a methodological and conceptual shift (Degnan and Rosenberg 2009; Edwards 2009; Knowles 2009). Hence, phylogenomic analysis not only implies new technical capabilities, but also comes with an explicit recognition of processes like incomplete lineage sorting (ILS), gene duplication and loss (GDL), andhorizontal transfer (HGT) (Maddison 1997; Page and Charleston 1997; Slowinski and Page 1999). Indeed, the phylogenomic pipeline can be very complex, involving multiple challenges concerning the acquisition, manipulation, analysis, and interpretation of massive data sets, including the design of appropriate sequencing strategies, the identification of homologous/orthologous loci, model partitioning among multiple loci and gene/species tree reconstruction. And there are still important open questions regarding the best strategies for all these steps. With the idea of learning about common problems and potential solutions for some of these questions I organized a symposium entitled “Current Advances and Challenges in Practical Phylogenomics” at the 2013 Evolution meeting in Snowbird, Utah (USA) under the auspice of the Society of Systematic Biologists. The word “practical” in the title reflected my intention to push the speakers to tackle on stage some of the real hurdles phylogeneticists face in their daily life when analyzing genome-wide data. My hope was that the public would leave Snowbird’s Ballroom 2 that day with some ideas that would change to some extent the way they construct and/or analyze their phylogenomic data sets. The symposium included six talks that embraced different aspects of the phylogenomic endeavor, from data acquisition to data analysis. Indeed, not all phylogenomic problems were addressed. The speakers formed a diverse group of people encompassing different orientations (biology, computer science, statistics), at distinct career stages (from graduate students to professors), from various parts of the world and representing a mix of genders. The first two talkswere related todifferent strategies forgathering phylogenomic data and their implications. Alan Lemmon (Florida State University, USA—“Anchored phylogenomics and the power of hybrid enrichment data for phylogenetics”) broke the ice describing his methodology for the efficient acquisition of genomewide loci across multiple species, contrasting it with similar approaches like ultraconserved elements (e.g. Faircloth et al. 2012; Gilbert et al. 2015) and exon capture (e.g. Bi et al. 2012; Bragg et al. 2015; Manthey et al. 2016). Next, Mike Harvey (Louisiana State University, USA—“SNPs versus sequences for phylogeography – an exploration using simulations and massively parallel sequencing in a non-model bird”) compared the demographic inferences obtained from the same individuals using single nucleotide polymorphisms (SNPs) or a genotyping-by-sequencing approach (Elshire et al. 2011). Butphylogenomicmatrices are often incomplete, and in the third talk, Lacey Knowles (University of Michigan, USA—“What to do with missing data in next-generation sequences? Unforeseen sampling effects on species-tree analyses”) characterized the effect of missing data on the estimated species relationships The second half of the symposium focused on novel methods for the estimation of species trees from genome-wide data. Leonardo de Oliveira Martins (University of Vigo, Spain—“A probabilistic parsimonious model for species tree reconstruction”) presented a Bayesian method for the reconstruction of species trees able to deal (nonparametrically) with ILS, GDL, andHGT.After that, TandyWarnow (University of Texas atAustin, “Naive Binning Improves Phylogenomic Analysis”) explained a new approach to species tree

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Amniote phylogenomics: testing evolutionary hypotheses with BAC library scanning and targeted clone analysis of large-scale DNA sequences from reptiles.

Phylogenomics research integrating established principles of systematic biology and taking advantage of the wealth of DNA sequences being generated by genome science holds promise for answering long-standing evolutionary questions with orders of magnitude more primary data than in the past. Although it is unrealistic to expect whole-genome initiatives to proceed rapidly for commercially unimpor...

متن کامل

Exploring frontiers in the DNA landscape: an introduction to the symposium "Genome Analysis and the Molecular Systematics of Retroelements".

The emerging field of phylogenomics is influencing both the amount and type of characters being brought to bear on long-standing problems in systematic biology. Moreover, the proliferation of sequence information from genome projects in concert with the development of new informatics tools is widening access to comparative data on retroelements to a broad cross section of investigators. Motivat...

متن کامل

Comparative genomic data of the Avian Phylogenomics Project

BACKGROUND The evolutionary relationships of modern birds are among the most challenging to understand in systematic biology and have been debated for centuries. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders, and used the genomes to construct a genome-scale avi...

متن کامل

INSECT PHYLOGENOMICS. Comment on "Phylogenomics resolves the timing and pattern of insect evolution".

Misof et al. (Reports, 7 November 2014, p. 763) used a genome-scale data set to estimate the relationships among insect orders and the time scale of their evolution. Here, we reanalyze their data and show that their method has led to systematic underestimation of the evolutionary time scale. We find that key insect groups evolved up to 100 million years earlier than inferred in their study.

متن کامل

Phylogenomic inference of protein molecular function: advances and challenges

MOTIVATION Protein families evolve a multiplicity of functions through gene duplication, speciation and other processes. As a number of studies have shown, standard methods of protein function prediction produce systematic errors on these data. Phylogenomic analysis--combining phylogenetic tree construction, integration of experimental data and differentiation of orthologs and paralogs--has bee...

متن کامل

Statistics and truth in phylogenomics.

Phylogenomics refers to the inference of historical relationships among species using genome-scale sequence data and to the use of phylogenetic analysis to infer protein function in multigene families. With rapidly decreasing sequencing costs, phylogenomics is becoming synonymous with evolutionary analysis of genome-scale and taxonomically densely sampled data sets. In phylogenetic inference ap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Systematic biology

دوره 65 3  شماره 

صفحات  -

تاریخ انتشار 2016