Whole-proteome phylogeny of large dsDNA virus families by an alignment-free method.

نویسندگان

  • Guohong Albert Wu
  • Se-Ran Jun
  • Gregory E Sims
  • Sung-Hou Kim
چکیده

The vast sequence divergence among different virus groups has presented a great challenge to alignment-based sequence comparison among different virus families. Using an alignment-free comparison method, we construct the whole-proteome phylogeny for a population of viruses from 11 viral families comprising 142 large dsDNA eukaryote viruses. The method is based on the feature frequency profiles (FFP), where the length of the feature (l-mer) is selected to be optimal for phylogenomic inference. We observe that (i) the FFP phylogeny segregates the population into clades, the membership of each has remarkable agreement with current classification by the International Committee on the Taxonomy of Viruses, with one exception that the mimivirus joins the phycodnavirus family; (ii) the FFP tree detects potential evolutionary relationships among some viral families; (iii) the relative position of the 3 herpesvirus subfamilies in the FFP tree differs from gene alignment-based analysis; (iv) the FFP tree suggests the taxonomic positions of certain "unclassified" viruses; and (v) the FFP method identifies candidates for horizontal gene transfer between virus families.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Paleozoic origin of insect large dsDNA viruses.

To understand how extant viruses interact with their hosts, we need a historical framework of their evolutionary association. Akin to retrovirus or hepadnavirus viral fossils present in eukaryotic genomes, bracoviruses are integrated in braconid wasp genomes and are transmitted by Mendelian inheritance. However, unlike viral genomic fossils, they have retained functional machineries homologous ...

متن کامل

Arabidopsis leaf plasma membrane proteome using a gel free method: Focus on receptor–like kinases

The hydrophobic proteins of plant plasma membrane still remain largely unknown.  For example in the Arabidopsis genome, receptor-like kinases (RLKs) are plasma membrane proteins, functioning as the primary receptors in the signaling of stress conditions, hormones and the presence of pathogens form a diverse family of over 610 genes. A limited number of these proteins have appeard in pr...

متن کامل

2011 German Escherichia coli outbreak: Alignment-free whole-genome phylogeny by feature frequency profiles

Introduction: Accuracy of SNP-based whole-genome phylogeny reconstruction relies heavily on quality of sequence alignment which is particularly hindered by poorly assembled genomes. Alignment-free methods might provide additional insights. Here, we constructed a wholegenome phylogeny of 9 outbreak isolates against existing E. coli genomes using the alignment-free feature frequency profile (FFP)...

متن کامل

Evolution and Phylogeny of Large DNA Viruses, Mimiviridae and Phycodnaviridae Including Newly Characterized Heterosigma akashiwo Virus

Nucleocytoplasmic DNA viruses are a large group of viruses that harbor double-stranded DNA genomes with sizes of several 100 kbp, challenging the traditional concept of viruses as small, simple 'organisms at the edge of life.' The most intriguing questions about them may be their origin and evolution, which have yielded the variety we see today. Specifically, the phyletic relationship between t...

متن کامل

AGP: a multimethods web server for alignment-free genome phylogeny.

Phylogenetic analysis based on alignment method meets huge challenges when dealing with whole-genome sequences, for example, recombination, shuffling, and rearrangement of sequences. Thus, various alignment-free methods for phylogeny construction have been proposed. However, most of these methods have not been implemented as tools or web servers. Researchers cannot use these methods easily with...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 106 31  شماره 

صفحات  -

تاریخ انتشار 2009