Microsatellites in different eukaryotic genomes: survey and analysis.
نویسندگان
چکیده
We examined the abundance of microsatellites with repeated unit lengths of 1-6 base pairs in several eukaryotic taxonomic groups: primates, rodents, other mammals, nonmammalian vertebrates, arthropods, Caenorhabditis elegans, plants, yeast, and other fungi. Distribution of simple sequence repeats was compared between exons, introns, and intergenic regions. Tri- and hexanucleotide repeats prevail in protein-coding exons of all taxa, whereas the dependence of repeat abundance on the length of the repeated unit shows a very different pattern as well as taxon-specific variation in intergenic regions and introns. Although it is known that coding and noncoding regions differ significantly in their microsatellite distribution, in addition we could demonstrate characteristic differences between intergenic regions and introns. We observed striking relative abundance of (CCG)(n)*(CGG)(n) trinucleotide repeats in intergenic regions of all vertebrates, in contrast to the almost complete lack of this motif from introns. Taxon-specific variation could also be detected in the frequency distributions of simple sequence motifs. Our results suggest that strand-slippage theories alone are insufficient to explain microsatellite distribution in the genome as a whole. Other possible factors contributing to the observed divergence are discussed.
منابع مشابه
Domain-level differences in microsatellite distribution and content result from different relative rates of insertion and deletion mutations.
Microsatellites (short tandem polynucleotide repeats) are found throughout eukaryotic genomes at frequencies many orders of magnitude higher than the frequencies predicted to occur by chance. Most of these microsatellites appear to have evolved in a generally neutral manner. In contrast, microsatellites are generally absent from bacterial genomes except in locations where they provide adaptive ...
متن کاملAnalysis on Frequency and Density of Microsatellites in Coding Sequences of Several Eukaryotic Genomes
Microsatellites or simple sequence repeats (SSRs) have been found in most organisms during the last decade. Since large-scale sequences are being generated, especially those that can be used to search for microsatellites, the development of these markers is getting more convenient. Keeping SSRs in viewing the importance of the application, available CDS (coding sequences) or ESTs (expressed seq...
متن کاملSurvey of compound microsatellites in multiple Lactobacillus genomes.
Distinct simple sequence repeats with 2 or more individual microsatellites joined together or lying adjacent to each other are identified as compound microsatellites. Investigation of such composite microsatellites in the genomes of genus Lactobacillus was the aim of this study. In silico inspection of microsatellite clustering in genomes of 14 Lactobacillus species revealed a wealth of compoun...
متن کاملApplication of Microsatellite Markers in Conservation Genetics and Fisheries Management: Recent Advances in Population Structure Analysis and Conservation Strategies
Microsatellites are the most popular and versatile genetic marker with myriads of applications in population genetics, conservation biology, and evolutionary biology. These are the arrays of DNA sequences, consisting of tandemly repeating mono-, di-, tri-, and tetranucleotide units, which are distributed throughout the genomes of most eukaryotic species. Microsatellites are codominant in nature...
متن کاملA Genomic Survey of HECT Ubiquitin Ligases in Eukaryotes Reveals Independent Expansions of the HECT System in Several Lineages
The posttranslational modification of proteins by the ubiquitination pathway is an important regulatory mechanism in eukaryotes. To date, however, studies on the evolutionary history of the proteins involved in this pathway have been restricted to E1 and E2 enzymes, whereas E3 studies have been focused mainly in metazoans and plants. To have a wider perspective, here we perform a genomic survey...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Genome research
دوره 10 7 شماره
صفحات -
تاریخ انتشار 2000