Efficient Algorithms for Analyzing Segmental Duplications, Deletions, and Inversions in Genomes

نویسندگان

  • Crystal L. Kahn
  • Shay Mozes
  • Benjamin J. Raphael
چکیده

Segmental duplications, or low-copy repeats, are common in mammalian genomes. In the human genome, most segmental duplications are mosaics consisting of pieces of multiple other segmental duplications. This complex genomic organization complicates analysis of the evolutionary history of these sequences. Earlier, we introduced a genomic distance, called duplication distance, that computes the most parsimonious way to build a target string by repeatedly copying substrings of a source string. We also showed how to use this distance to describe the formation of segmental duplications according to a two-step model that has been proposed to explain human segmental duplications. Here we describe polynomial-time exact algorithms for several extensions of duplication distance including models that allow certain types of substring deletions and inversions. These extensions will permit more biologically realistic analyses of segmental duplications in genomes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Algorithms for Analyzing Human Genome Rearrangements

of “Algorithms for Analyzing Human Genome Rearrangements,” by Crystal L.Kahn, Ph.D., Brown University, May 2011. The human genome exhibits a rich structure resulting from a long history of genomicchanges, including single base-pair mutations and larger scale rearrangements such as in-versions, deletions, translocations, and duplications. The number and order of the genomicchange...

متن کامل

Efficient inversions and duplications of mammalian regulatory DNA elements and gene clusters by CRISPR/Cas9

The human genome contains millions of DNA regulatory elements and a large number of gene clusters, most of which have not been tested experimentally. The clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated nuclease 9 (Cas9) programed with a synthetic single-guide RNA (sgRNA) emerges as a method for genome editing in virtually any organisms. Here we report that t...

متن کامل

Gorilla genome structural variation reveals evolutionary parallelisms with chimpanzee.

Structural variation has played an important role in the evolutionary restructuring of human and great ape genomes. Recent analyses have suggested that the genomes of chimpanzee and human have been particularly enriched for this form of genetic variation. Here, we set out to assess the extent of structural variation in the gorilla lineage by generating 10-fold genomic sequence coverage from a w...

متن کامل

Analysis of segmental duplications via duplication distance

MOTIVATION Segmental duplications are common in mammalian genomes, but their evolutionary origins remain mysterious. A major difficulty in analyzing segmental duplications is that many duplications are complex mosaics of fragments of numerous other segmental duplications. RESULTS We introduce a novel measure called duplication distance that describes the minimum number of duplications necessa...

متن کامل

Comparing genomes with rearrangements and segmental duplications

MOTIVATION Large-scale evolutionary events such as genomic rearrange.ments and segmental duplications form an important part of the evolution of genomes and are widely studied from both biological and computational perspectives. A basic computational problem is to infer these events in the evolutionary history for given modern genomes, a task for which many algorithms have been proposed under v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009