Protein distance matrices comparison using sequence alignment techniques
نویسندگان
چکیده
In this paper we present an efficient approach for comparing protein structures based on distance matrices. In the first step, the distance matrices are calculated and normalized in order to avoid the scaling distortion. Further, versions from these matrices are saved on different scales and for each two protein structures we compare their respective equal-scaled matrices. Thus, we avoid the problem of comparing two relatively different sized matrices, which can lead to many ambiguous cases. For the actual comparing of the matrices, a new method is introduced, which combines techniques both of partial dynamic programming for comparing the rows on the matrices, and a greedy approach for consolidating the row-row matching results. Analysis showed that this method is fairly more efficient in terms of time and memory complexity and accuracy as well, in comparison with the MatAlign and DALI algorithms, which are guided from the same idea for matrix alignment.
منابع مشابه
Use of residue pairs in protein sequence-sequence and sequence-structure alignments.
Two new sets of scoring matrices are introduced: H2 for the protein sequence comparison and T2 for the protein sequence-structure correlation. Each element of H2 or T2 measures the frequency with which a pair of amino acid types in one protein, k-residues apart in the sequence, is aligned with another pair of residues, of given amino acid types (for H2) or in given structural states (for T2), i...
متن کاملA Transition Probability Model for Amino Acid Substitutions from Blocks
Substitution matrices have been useful for sequence alignment and protein sequence comparisons. The BLOSUM series of matrices, which had been derived from a database of alignments of protein blocks, improved the accuracy of alignments previously obtained from the PAM-type matrices estimated from only closely related sequences. Although BLOSUM matrices are scoring matrices now widely used for pr...
متن کاملEmpirical determination of effective gap penalties for sequence comparison
MOTIVATION No general theory guides the selection of gap penalties for local sequence alignment. We empirically determined the most effective gap penalties for protein sequence similarity searches with substitution matrices over a range of target evolutionary distances from 20 to 200 Point Accepted Mutations (PAMs). RESULTS We embedded real and simulated homologs of protein sequences into a d...
متن کاملAlgorithm engineering for optimal alignment of protein structure distance matrices
Protein structural alignment is an important problem in computational biology. In this paper, we present first successes on provably optimal pairwise alignment of protein inter-residue distance matrices, using the popular dali scoring function. We introduce the structural alignment problem formally, which enables us to express a variety of scoring functions used in previous work as special case...
متن کاملComparisons of Semi-local Alignment Algorithms for Protein Structures
Protein structures are essential for correct biological functions because similar structures between proteins allow molecular recognition. Identifying similar structures between proteins provide the opportunity to recognize homology that is undetectable by sequence comparison. Thus comparison and alignment of protein structures represents a powerful means of discovering functions, yielding dire...
متن کامل