Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs)
نویسندگان
چکیده
Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigation of the structure, distribution, and evolution of SINEs in the grass family by analyzing 14 grass and 5 other flowering plant genomes using comparative genomics methods. We identify 61 SINE families composed of 29,572 copies, in which 46 families are first described. We find that comparing with other grass TEs, grass SINEs show much higher level of conservation in terms of genomic retention: The origin of at least 26% families can be traced to early grass diversification and these families are among most abundant SINE families in 86% species. We find that these families show much higher level of enrichment near protein coding genes than families of relatively recent origin (51%:28%), and that 40% of all grass SINEs are near gene and the percentage is higher than other types of grass TEs. The pattern of enrichment suggests that differential removal of SINE copies in gene-poor regions plays an important role in shaping the genomic distribution of these elements. We also identify a sequence motif located at 3' SINE end which is shared in 17 families. In short, this study provides insights into structure and evolution of SINEs in the grass family.
منابع مشابه
RetroPred: A tool for prediction, classification and extraction of non-LTR retrotransposons (LINEs & SINEs) from the genome by integrating PALS, PILER, MEME and ANN
The problem of predicting non-long terminal repeats (LTR) like long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs) from the DNA sequence is still an open problem in bioinformatics. To elevate the quality of annotations of LINES and SINEs an automated tool "RetroPred" was developed. The pipeline allowed rapid and thorough annotation of non-LTR retrotranspos...
متن کاملMetaSINEs: Broad Distribution of a Novel SINE Superfamily in Animals
SINEs (short interspersed elements) are transposable elements that typically originate independently in each taxonomic clade (order/family). However, some SINE families share a highly similar central sequence and are thus categorized as a SINE superfamily. Although only four SINE superfamilies (CORE-SINEs, V-SINEs, DeuSINEs, and Ceph-SINEs) have been reported so far, it is expected that new SIN...
متن کاملTargeted identification of short interspersed nuclear element families shows their widespread existence and extreme heterogeneity in plant genomes.
Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-leng...
متن کاملSINE_scan: an efficient tool to discover short interspersed nuclear elements (SINEs) in large-scale genomic datasets
Motivation Short Interspersed Nuclear Elements (SINEs) are transposable elements (TEs) that amplify through a copy-and-paste mode via RNA intermediates. The computational identification of new SINEs are challenging because of their weak structural signals and rapid diversification in sequences. Results Here we report SINE_Scan, a highly efficient program to predict SINE elements in genomic DN...
متن کاملThe salmon SmaI family of short interspersed repetitive elements (SINEs): interspecific and intraspecific variation of the insertion of SINEs in the genomes of chum and pink salmon.
The genomes of chum salmon and pink salmon contain a family of short interspersed repetitive elements (SINEs), designated the salmon SmaI family. It is restricted to these two species, a distribution that suggests that this SINE family might have been generated in their common ancestor. When insertions of the SmaI SINEs at 10 orthologous loci of these species were analyzed, however, it was foun...
متن کامل