The Incompatible Desiderata of Gene Cluster Properties
نویسندگان
چکیده
There is widespread interest in comparative genomics in determining if historically and/or functionally related genes are spatially clustered in the genome, and whether the same sets of genes reappear in clusters in two or more genomes. We formalize and analyze the desirable properties of gene clusters and cluster definitions. Through detailed analysis of two commonly applied types of cluster, r-windows and maxgap, we investigate the extent to which a single definition can embody all of these properties simultaneously. We show that many of the most important properties are difficult to satisfy within the same definition. We also examine whether one commonly assumed property, which we call nestedness, is satisfied by the structures present in real genomic data.
منابع مشابه
جدا نمودن ژن تنظیمی استرپتومایسین فاقد پروموتر [StrR2] از استرپتومایسز گریزئوس
Background and purpose: Polymerase chain reaction (PÇR) is a rather quick and accurate method employed for gene detection and isolation. Primer designing is an important issue in this technique and plays a critical role in considering both the genome properties and cloning of the isolated genes. Streptomycin antibiotic is produced by Streptomyces griseus using str gene cluster with more than 25...
متن کاملDesiderata for Generalization-to-N Algorithms
Systems that perform \generalization-to-N" in explanation-based learning generalize a proof tree by generalizing the shape of the tree, rather than simply changing constants to variables. This paper introduces a formal framework which can be used either to characterize or to specify the outputs of an algorithm for generalizing number. The framework consists of two desiderata, or desired propert...
متن کاملHeterozygosis deficit of polymorphic markers linked to the β-globin gene cluster region in the Iranian population
Objective(s): Iran is considered as one of the high-prevalence areas for β-thalassemia with a rate of about 10% carrier frequency. Molecular diagnosis of the disease is performed both by direct sequencing and indirectly by the use of polymorphic markers present in the beta globin gene cluster. However, to date there is no reliable information on the application of the markers in the Iranian pop...
متن کاملMolecular Characterization and Phylogeny Analysis Based on Sequences of Cytochrome Oxidase gene From Hemiscorpius lepturus of Iran
Abstract: Background: Hemiscorpius lepturus is a medically important scorpion found along the Iranian borders, especially near to Khuzestan Province in the south-west of Iran. This is the only non-buthid scorpion which is potentially lethal in southern Iran and is responsible for severe dermonecrotic scorpionism. OBJECTIVES: In this study, DNA fragment of the mitochondrial cytochrome c oxidase ...
متن کاملDetection of Arctic and European cluster of canine distemper virus in north and center of Iran
Canine distemper virus (CDV) creates a very contagious viral multi-systemic canine distemper (CD) disease that affects most species of Carnivora order. The virus is genetically heterogeneous, particularly in section of the hemagglutinin (H) gene. Sequence analysis of the H gene can be useful to investigate distinction of various lineages related to geographical distribution and CDV molecular ep...
متن کامل