CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

نویسندگان

  • J D Thompson
  • D G Higgins
  • T J Gibson
چکیده

The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved for the alignment of divergent protein sequences. Firstly, individual weights are assigned to each sequence in a partial alignment in order to down-weight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. These modifications are incorporated into a new program, CLUSTAL W which is freely available.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Taxonomy in a changing world: seeking solutions for a science in crisis.

Maddison, D. R., D. L. Swofford, and W. P. Maddison. 1997. NEXUS: An extensible file format for systematic information. Syst. Biol. 46:590621. Maddison, W. P., and D. R. Maddison. 2005. Mesquite: A modular system for evolutionary analysis. Version 1.06. http:// mesquiteproject.org. Mason-Gamer, R., and E. Kellogg. 1996. Testing for phylogenetic conflict among molecular data sets in the tribe Tr...

متن کامل

Building Optimal Score Computation

a database of sequences that is updated periodically with the accumulation of new sequence data, thereby allowing the periodical reassessment of phylogenetic theories. Obviously the biological components of this study will have to be reened and updated in the future. Most importantly , the performance of diierent tree making methods and conndence measures will have to be assessed against real d...

متن کامل

KalignP: Improved multiple sequence alignments using position specific gap penalties in Kalign2

SUMMARY Kalign2 is one of the fastest and most accurate methods for multiple alignments. However, in contrast to other methods Kalign2 does not allow externally supplied position specific gap penalties. Here, we present a modification to Kalign2, KalignP, so that it accepts such penalties. Further, we show that KalignP using position specific gap penalties obtained from predicted secondary stru...

متن کامل

Salmonella Kingabwa Infections and Lizard Contact, United States, 2005

Molecular and phenotypic features for identification of the opportunistic pathogens Ochrobactrum spp. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. A simple, fast, and accurate algorithm to estimate large phy-logenies by maximum likelihood. Distribution of repetitive DNA seque...

متن کامل

Bayesian Top-Down Protein Sequence Alignment with Inferred Position-Specific Gap Penalties

We describe a Bayesian Markov chain Monte Carlo (MCMC) sampler for protein multiple sequence alignment (MSA) that, as implemented in the program GISMO and applied to large numbers of diverse sequences, is more accurate than the popular MSA programs MUSCLE, MAFFT, Clustal-Ω and Kalign. Features of GISMO central to its performance are: (i) It employs a "top-down" strategy with a favorable asympto...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nucleic acids research

دوره 22 22  شماره 

صفحات  -

تاریخ انتشار 1994