Development of a computational strategy to compare repetitive element enrichment between experimental conditions from high-throughput sequencing datasets

نویسندگان

  • Steven Criscione
  • Nicola Neretti
چکیده

Repetitive and transposable elements comprise more than half of the human genome and play diverse roles in many biological processes. Mobile elements including retrotransposons are implicated in the organization of the epigenetic landscape, the progression of tumorigenesis, and the enhancement of genetic diversity. Despite the importance of repetitive and transposable elements these sequences are traditionally ignored in high-throughput sequencing analysis due to the technical difficulty of uniquely mapping reads from repeat DNA sequences. Here we report a new computational method for the analysis of repetitive elements from high-throughput sequencing datasets that accounts for all mapping reads. In our approach, we examine reads that map uniquely and to multiple locations of the genome using two separate strategies to determine a complete estimate of enrichment for repetitive elements. Included in our computational method is an output defined by reads per kilobase of repeat element per million mapped reads (similar to RPKM definition for the exon model) [1]. The calculated repeat element enrichment RPKM allows for the comparisons between repetitive elements as well as between experimental conditions. Our new method for examining repetitive elements from highthroughput sequencing datasets represents an improvement over existing methods because we do not exclude reads from the analysis and we can make comparisons between experimental conditions. To test our method we have examined repetitive element enrichment in the embryonic and adult mouse across different tissues using a variety of high-throughput mouse sequencing datasets available from the mouse ENCODE project and Shen et al. that provide a thorough snapshot of the epigenetic landscape of the embryonic and adult mouse [2]. We compare our method with an existing strategy for estimating repetitive element enrichment proposed by Day et al. [3], and demonstrate the advantages to our strategy. In addition, we test the robustness of our approach for determining differences in enrichment between experimental samples by conducting a comparison between the embryonic and adult mouse.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

STEAK: A specific tool for transposable elements and retrovirus detection in high-throughput sequencing data

The advancements of high-throughput genomics have unveiled much about the human genome highlighting the importance of variations between individuals and their contribution to disease. Even though numerous software have been developed to make sense of large genomics datasets, a major short falling of these has been the inability to cope with repetitive regions, specifically to validate structura...

متن کامل

SAMNetWeb: identifying condition-specific networks linking signaling and transcription

MOTIVATION High-throughput datasets such as genetic screens, mRNA expression assays and global phospho-proteomic experiments are often difficult to interpret due to inherent noise in each experimental system. Computational tools have improved interpretation of these datasets by enabling the identification of biological processes and pathways that are most likely to explain the measured results....

متن کامل

Ductile Damage Evolution under Triaxial Stress Conditions: Computational and Experimental Evaluations

The continuum mechanic simulation of micro-structural damage process is important in the study of ductile fracture mechanics. In this paper, the continuum damage mechanics model formulation proposed by Lematire has been validated against ductile damage evolution experimentally measured in A533B-C1 steel under stress triaxiality conditions. First, a &#10procedure to identify the model parameters...

متن کامل

Ductile Damage Evolution under Triaxial Stress Conditions: Computational and Experimental Evaluations

The continuum mechanic simulation of micro-structural damage process is important in the study of ductile fracture mechanics. In this paper, the continuum damage mechanics model formulation proposed by Lematire has been validated against ductile damage evolution experimentally measured in A533B-C1 steel under stress triaxiality conditions. First, a procedure to identify the model parameters f...

متن کامل

Reduced Representations for Efficient Analysis of Genomic Data; from Microarray to High-throughput Sequencing

OF THE DISSERTATION Reduced Representations for Efficient Analysis of Genomic Data; From Microarray to High-throughput Sequencing by Md Pavel Mahmud Dissertation Director: Prof. Alexander Schliep Since the genomics era has started in the ’70s, microarray technologies have been extensively used for biological applications such as gene expression profiling, copy number variation (CNV) or Single N...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2012